[PYTHON] Visualize by adding "a bite" to the "boxplot" (boxen / swarm / violin)

Overview

motivation

Boxplot

If you try Swarmplot

Add "a bite" to the boxplot

Try changing functions and options

If you do etc., you can add "a bite"

Putting it all together (cheat sheet)

option boxenplot swarmplot violinplot
Not specified download.png download.png download.png
hue="Sex" download.png download.png download.png
hue="Sex"
split=True
None download.png download.png

"Which" should be used "when"?

Boxplot vs boxenplot

Boxplot(boxplot) boxenplot
display download.png download.png
Feature Quartile, Maximum, minimum
You can also see the situation of outliers
ThanFine quantileCan be seen
Hard to see as outliers

Boxplot vs swarmplot

Boxplot(boxplot) swarmplot
display download.png download.png
Feature section(Quantile)To catch as
plot
Low cost
Awareness of the individualAnd continuously capture the data
dataDifferences by number and seriesCan be understood
しかし、plotHigh cost

swarmplot vs violinplot

swarmplot violinplot
display download.png download.png
Feature Awareness of the individualAnd continuously capture the data
dataDifferences by number and seriesCan be understood
But the plotHigh cost
Awareness of the individualせず、dataのI can't see the numberBut,
Continuous understanding of overall trends
plotKeep costs downCan do things.

Summary

Interval vs continuous How to add "Hitomi" What visualization method should I choose?
Data**section
(Quantile)**Treated with
OutliersIf you want to be aware of Boxplot(boxplot)download.png
From the quartileDetailedIn the display, boxenplotdownload.png
DataContinuouslyHandle, ThatNumber and densityIf you want to show swarmplotdownload.png
Keep plot costs down
Overall trendIf you want to show
violinplotdownload.png

Recommended Posts

Visualize by adding "a bite" to the "boxplot" (boxen / swarm / violin)
Create a function to visualize / evaluate the clustering result
Create a 2D array by adding a row to the end of an empty array with numpy
I tried to visualize the Beverage Preference Dataset by tensor decomposition.
I tried to visualize Boeing of violin performance by pose estimation
Prepare a URL to display the image uploaded by Active Storage by yourself
Find the minimum value of a function by particle swarm optimization (PSO)
I tried to verify the result of A / B test by chi-square test
How to sort by specifying a column in the Python Numpy array.