Question

In: Computer Science

Explain how a data analyst would interpret the boxplot. Whatdoes it mean when you look...

Explain how a data analyst would interpret the boxplot. What does it mean when you look at it?

Solutions

Expert Solution

box plot is a very powerful tool that we have for understanding our data. Using box plots we can better understand our data by understanding its distribution, outliers, mean, median and variance. Box plot packs all of this information about our data in a single concise diagram. It allows us to understand the nature of our data at a single glance.

Consider the diagram below:

Every box-plot has two parts, a box and whiskers as you can see in the figure above. That’s why it is also sometimes called the box and whiskers plot. The start of the box i.e the lower quartile represents the 25% of our data set. So by looking at the diagram we can instantly conclude that 25% of our data has a value less than 6.2, similarly the end of the box i.e the upper quartile represents 75% of our data. So again from the diagram we can conclude that 75% of our data is less than 8.8. The bold black line in the box represents the median value of our data. In our example the median lies at about 7.8. The difference between the lower quartile and upper quartile is called the inter-quartile range. So basically the entire red box represents the inter-quartile range.

The following diagram will explain the quartiles even further:

Now for outliers

Now lets talk about the whiskers of boxplot and how do we visualize outliers in a boxplot. In box plot the whiskers are generally defined as 1.5 times the inter-quartile range. Anything this outside the whiskers is considered as an outlier.

Identify Skewness

We can also identify the skewness of our data by observing the shape of the box plot. If the box plot is symmetric it means that our data follows a normal distribution. If our box plot is not symmetric it shows that our data is skewed. You can get a better understanding by looking at the diagrams below:

Here is a box plot with respect to the distribution curve:


Related Solutions

How do you interpret a t value? When would it be appropriate to use a t...
How do you interpret a t value? When would it be appropriate to use a t test of dependent samples? Please provide an example.
Explain using examples, ‘how’ and ‘why’ you would collect Sensitivity and Specificity data when performing a...
Explain using examples, ‘how’ and ‘why’ you would collect Sensitivity and Specificity data when performing a Validation Study on a new DNA STR Profiling Kit.
In terms of ethics how would you interpret the decision by the government to impose a...
In terms of ethics how would you interpret the decision by the government to impose a binding price floor (in part (a) above) from a Consequentialist point of view? What about from a Deontological perspective? Briefly explain
When you look at a company's financial statements, what do these mean for a CEO or...
When you look at a company's financial statements, what do these mean for a CEO or CFO versus a potential investor (debt to equity, stock price, net revenue, etc)?
You are an analyst at a bank that has been asked to look into a project...
You are an analyst at a bank that has been asked to look into a project that is being undertaken by ABC Corp with a life of 10 years. The project will return cash flows of $2 million every year for 5 years and 4 million for the remaining 5 years after an initial investment of $10 million. The firm has a beta of 0.5 and the expected return of the market is 11%. There are currently 20000 shares outstanding...
Explain why it is difficult to see dimly-lit objects when you look directly at them. How...
Explain why it is difficult to see dimly-lit objects when you look directly at them. How could this information be used to improve our vision at night?
how do i interpret mean, mode and median?
how do i interpret mean, mode and median?
2. A. How would you interpret a current ratio of 0.8? B. Is it good or...
2. A. How would you interpret a current ratio of 0.8? B. Is it good or bad for the firm? C. What information you would need to extra to interpret it?
How do you understand and interpret mean, median, mode, standard deviation, and variance?
How do you understand and interpret mean, median, mode, standard deviation, and variance?
what would a morning and evening gratitude practice look like to you? and how would it...
what would a morning and evening gratitude practice look like to you? and how would it improve your sleep hygiene, stress management, and overall outlook on life?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT