In: Statistics and Probability
Let’s say that we have a data set with N data points:
X – {X1, X2, X3……….. XN}
The formula for quartiles is given by:
What it basically means is that in a data set with N data points:
Let's say we have a dependent variable Y and two independent variable X1 and X2. Using the above formula, we can compute Quartiles.
The box Plot shows you how your data is spread out. Five pieces of information (the “five-number summary“) are generally included in the Plot:
The above figure depicts the simple example of Box Plot.
If we have two independent and one dependent variable we can make a side-by-side Box Plot of each independent variable with the dependent variable. For example :