In: Statistics and Probability
A sales manager collected the following data on x = years of experience and y = annual sales ($1,000s). The estimated regression equation for these data is
ŷ = 83 + 4x.
| Salesperson | Years of Experience  | 
Annual Sales ($1,000s)  | 
|---|---|---|
| 1 | 1 | 80 | 
| 2 | 3 | 97 | 
| 3 | 4 | 102 | 
| 4 | 4 | 107 | 
| 5 | 6 | 103 | 
| 6 | 8 | 116 | 
| 7 | 10 | 119 | 
| 8 | 10 | 123 | 
| 9 | 11 | 127 | 
| 10 | 13 | 136 | 
(a)
Compute SST, SSR, and SSE.
SST=SSR=SSE=
(b)
Compute the coefficient of determination
r2.
(Round your answer to three decimal places.)
r2
=
Comment on the goodness of fit. (For purposes of this exercise, consider a proportion large if it is at least 0.55.)
The least squares line did not provide a good fit as a large proportion of the variability in y has been explained by the least squares line.The least squares line provided a good fit as a small proportion of the variability in y has been explained by the least squares line. The least squares line did not provide a good fit as a small proportion of the variability in y has been explained by the least squares line.The least squares line provided a good fit as a large proportion of the variability in y has been explained by the least squares line.
(c)
What is the value of the sample correlation coefficient? (Round your answer to three decimal places.)
Part a)
| Experience (X) | Sales (Y) | X * Y | X2 | Ŷ | SSE =Σ(Y - Ŷ)2 | SST = Σ(Yi - Y̅ )2 | SSR = Σ( Ŷ - Y̅ )2 | |
| 1 | 80 | 80 | 1 | 87 | 49 | 961 | 576 | |
| 3 | 97 | 291 | 9 | 95 | 4 | 196 | 256 | |
| 4 | 102 | 408 | 16 | 99 | 9 | 81 | 144 | |
| 4 | 107 | 428 | 16 | 99 | 64 | 16 | 144 | |
| 6 | 103 | 618 | 36 | 107 | 16 | 64 | 16 | |
| 8 | 116 | 928 | 64 | 115 | 1 | 25 | 16 | |
| 10 | 119 | 1190 | 100 | 123 | 16 | 64 | 144 | |
| 10 | 123 | 1230 | 100 | 123 | 0 | 144 | 144 | |
| 11 | 127 | 1397 | 121 | 127 | 0 | 256 | 256 | |
| 13 | 136 | 1768 | 169 | 135 | 1 | 625 | 576 | |
| Total | 70 | 1110 | 8338 | 632 | 3863.971 | 160 | 2432 | 2272 | 
X̅ = Σ (Xi / n ) = 70/10 = 7
Y̅ = Σ (Yi / n ) = 1110/10 = 111
SST = Σ(Yi - Y̅ )2 =  2432
SSR = Σ( Ŷ - Y̅ )2  = 2272
SSE =Σ(Y - Ŷ)2 = 160
Part b)
r2 = SSR / SST = 0.934
The least squares line provided a good fit as a large proportion of the variability in y has been explained by the least squares line.
Part c)
r = √ r2 = 0.967