In: Statistics and Probability
Discussion Board Forum 1/Project 2 Instructions
Standard Deviation and Outliers
Thread:
For this assignment, you will use the Project 2 Excel Spreadsheet to answer the questions below. In each question, use the spreadsheet to create the graphs as described and then answer the question.
Put all of your answers into a thread posted in Discussion Board Forum 1/Project 2.
This course utilizes the Post-First feature in all Discussion Board Forums. This means you will only be able to read and interact with your classmates’ threads after you have submitted your thread in response to the provided prompt. For additional information on Post-First, click here for a tutorial. This is intentional. You must use your own work for answers to Questions 1–5. If something happens that leads you to want to make a second post for any of your answers to Questions 1–5, you must get permission from your instructor.
What is the impact of the new point on the standard deviation? Do not just give a numerical value for the change. Explain in sentence form what happened to the standard deviation. (4 points)
B. Create a data set with 8 points in it that has a mean of approximately 10 and a standard deviation of approximately 1. Use the second chart to create a second data set with 8 points that has a mean of approximately 10 and a standard deviation of approximately 4. What did you do differently to create the data set with the larger standard deviation? (4 points)
50, 50, 50, 50, 50.
Notice that the standard deviation is 0. Explain why the standard deviation for this one is zero. Do not show the calculation. Explain in words why the standard deviation is zero when all of the points are the same. If you don’t know why, try doing the calculation by hand to see what is happening. If that does not make it clear, try doing a little research on standard deviation and see what it is measuring and then look again at the data set for this question.
Data set 1: 0, 0, 0, 100, 100, 100
Data set 2: 0, 20, 40, 60, 80, 100
Data set 3: 0, 40, 45, 55, 60, 100
Note that all three data sets have a median of 50. Notice how spread out the points are in each data set and compare this to the standard deviations for the data sets. Describe the relationship you see between the amount of spread and the size of the standard deviation and explain why this connection exists. Do not give your calculations in your answer—explain in sentence form. (8 points)
For the last 2 questions, use the Project 1 Data Set.