In: Statistics and Probability
A study was done to look at the relationship between number of movies people watch at the theater each year and the number of books that they read each year. The results of the survey are shown below. Movies 6 10 1 2 3 1 7 7 6 5 8 Books 6 0 8 7 7 7 2 1 6 7 5 Find the correlation coefficient: r = Round to 2 decimal places. The null and alternative hypotheses for correlation are: H 0 : = 0 H 1 : ≠ 0 The p-value is: Round to 4 decimal places. Use a level of significance of α = 0.05 to state the conclusion of the hypothesis test in the context of the study. There is statistically significant evidence to conclude that a person who watches more movies will read fewer books than a person who watches fewer movies. There is statistically insignificant evidence to conclude that there is a correlation between the number of movies watched per year and the number of books read per year. Thus, the use of the regression line is not appropriate. There is statistically significant evidence to conclude that a person who watches fewer movies will read fewer books than a person who watches fewer movies. There is statistically significant evidence to conclude that there is a correlation between the number of movies watched per year and the number of books read per year. Thus, the regression line is useful. r 2 = (Round to two decimal places) Interpret r 2 : There is a large variation in the number books people read each year, but if you only look at people who watch a fixed number of movies each year, this variation on average is reduced by 66%. Given any fixed number of movies watched per year, 66% of the population reads the predicted number of books per year. 66% of all people watch about the same number of movies as they read books each year. There is a 66% chance that the regression line will be a good predictor for the number of books people read based on the number of movies they watch each year. The equation of the linear regression line is: ˆ y = + x (Please show your answers to two decimal places) Use the model to predict the number of books read per year for someone who watches 8 movies per year. Books per year = (Please round your answer to the nearest whole number.) Interpret the slope of the regression line in the context of the question: As x goes up, y goes down. The slope has no practical meaning since people cannot read a negative number of books. For every additional movie that people watch each year, there tends to be an average decrease of 0.75 books read. Interpret the y-intercept in the context of the question: If someone watches 0 movies per year, then that person will read 9 books this year. The best prediction for a person who doesn't watch any movies is that they will read 9 books each year. The average number of books read per year is predicted to be 9 books. The y-intercept has no practical meaning for this study.
ΣX | ΣY | Σ(x-x̅)² | Σ(y-ȳ)² | Σ(x-x̅)(y-ȳ) | |
total sum | 56.00 | 56.00 | 88.91 | 76.91 | -67.09 |
mean | 5.09 | 5.09 | SSxx | SSyy | SSxy |
correlation coefficient , r =
SSxy/√(SSx.SSy) = -0.81
Ho: ρ = 0
Ha: ρ ╪ 0
p value=0.0024
c)
d)
R² = (SSxy)²/(SSx.SSy) = 0.66
e)
There is a large variation in the number books people read each year, but if you only look at people who watch a fixed number of movies each year, this variation on average is reduced by 66%
f)
sample size , n = 11
here, x̅ = Σx / n= 5.091 ,
ȳ = Σy/n = 5.091
SSxx = Σ(x-x̅)² = 88.9091
SSxy= Σ(x-x̅)(y-ȳ) = -67.1
estimated slope , ß1 = SSxy/SSxx = -67.1
/ 88.909 = -0.75460
intercept, ß0 = y̅-ß1* x̄ =
8.93252
so, regression line is Ŷ = 8.93
+ -0.75 *x
g)
Predicted Y at X= 8 is
Ŷ = 8.9325 +
-0.7546 *8= 2.89571
≈ 3
h)
For every additional movie that people watch each year, there tends to be an average decrease of 0.75 books read.
i)
The best prediction for a person who doesn't watch any movies is that they will read 9 books each year.