In: Statistics and Probability
a. Estimate the regression equation and determine the predicted values of y
b. Use the predicted values and the actual values of y to calculate the residuals
c. Plot the residuals (vertical axis) against the predicted values (horizontal axis).
d. Does it appear that heteroscedasticity is a problem? Explain
e. Draw a histogram of the residuals. Does it appear that the errors are normally distributed? Explain.
f. Use the residuals to identify possible outlier(s)
| Weight | Price |
| 27 | 12.02 |
| 28.5 | 12.04 |
| 30.8 | 12.32 |
| 31.3 | 12.27 |
| 31.9 | 12.49 |
| 34.5 | 12.7 |
| 34.7 | 13 |
| 37 | 13 |
| 41 | 13.17 |
| 41 | 13.19 |
| 38.8 | 13.22 |
| 39.3 | 13.27 |
| 34 | 12.8 |
please do not use minitab or any statistical software
a. Estimate the regression equation and determine the predicted values of y:
We are given:
| Obs. | WeightWeight | PricePrice | X_i^2Xi2 | Y_i^2Yi2 | X_i \cdot Y_iXi⋅Yi |
| 1 | 27 | 12.02 | 729 | 144.4804 | 324.54 |
| 2 | 28.5 | 12.04 | 812.25 | 144.9616 | 343.14 |
| 3 | 30.8 | 12.32 | 948.64 | 151.7824 | 379.456 |
| 4 | 31.3 | 12.27 | 979.69 | 150.5529 | 384.051 |
| 5 | 31.9 | 12.49 | 1017.61 | 156.0001 | 398.431 |
| 6 | 34.5 | 12.7 | 1190.25 | 161.29 | 438.15 |
| 7 | 34.7 | 13 | 1204.09 | 169 | 451.1 |
| 8 | 37 | 13 | 1369 | 169 | 481 |
| 9 | 41 | 13.17 | 1681 | 173.4489 | 539.97 |
| 10 | 41 | 13.19 | 1681 | 173.9761 | 540.79 |
| 11 | 38.8 | 13.22 | 1505.44 | 174.7684 | 512.936 |
| 12 | 39.3 | 13.27 | 1544.49 | 176.0929 | 521.511 |
| 13 | 34 | 12.8 | 1156 | 163.84 | 435.2 |
| Sum = | 449.8 | 165.49 | 15818.46 | 2109.1937 | 5750.275 |
=1/n∑Xi=449.8/13=34.6
=34.6
=1/n∑Yi=165.49/13=12.73
=12.73
SSXX=∑Xi2−1/n(∑Xi)2=15818.46−1/13(449.8)2=255.38
SSXX=255.38
SSYY=∑Yi2−1/n(∑Yi)2=2109.1937−1/13(165.49)2=2.506
SSYY=2.506
SSXY=∑XiYi−1/n(∑Xi)(∑Yi)=5750.275−1/13(449.8)(165.49)=24.321
SSXY=24.321
=SSXY/SSXX=24.321/255.38=0.0952
=0.0952
=
=12.73−0.0952×34.6=9.4349
=9.4349
the regression equation is:

PREDICTED VALUES:
| Obs. | Weight | Price | Predicted Values |
| 1 | 27 | 12.02 | 9.4349+0.0952×27=12.006 |
| 2 | 28.5 | 12.04 | 9.4349+0.0952×28.5=12.149 |
| 3 | 30.8 | 12.32 | 9.4349+0.0952×30.8=12.368 |
| 4 | 31.3 | 12.27 | 9.4349+0.0952×31.3=12.416 |
| 5 | 31.9 | 12.49 | 9.4349+0.0952×31.9=12.473 |
| 6 | 34.5 | 12.7 | 9.4349+0.0952×34.5=12.72 |
| 7 | 34.7 | 13 | 9.4349+0.0952×34.7=12.74 |
| 8 | 37 | 13 | 9.4349+0.0952×37=12.959 |
| 9 | 41 | 13.17 | 9.4349+0.0952×41=13.34 |
| 10 | 41 | 13.19 | 9.4349+0.0952×41=13.34 |
| 11 | 38.8 | 13.22 | 9.4349+0.0952×38.8=13.13 |
| 12 | 39.3 | 13.27 | 9.4349+0.0952×39.3=13.178 |
| 13 | 34 | 12.8 | 9.4349+0.0952×34=12.673 |
b. Use the predicted values and the actual values of y to calculate the residuals:
| Price(observed value) | Predicted Values | Residuals=observed value-predicted value |
| 12.02 | 12.006 | 0.014 |
| 12.04 | 12.149 | -0.109 |
| 12.32 | 12.368 | -0.048 |
| 12.27 | 12.416 | -0.146 |
| 12.49 | 12.473 | 0.017 |
| 12.7 | 12.72 | -0.02 |
| 13 | 12.74 | 0.26 |
| 13 | 12.959 | 0.041 |
| 13.17 | 13.34 | -0.17 |
| 13.19 | 13.34 | -0.15 |
| 13.22 | 13.13 | 0.09 |
| 13.27 | 13.178 | 0.092 |
| 12.8 | 12.673 | 0.127 |
c. Plot the residuals (vertical axis) against the predicted values (horizontal axis):

d. Does it appear that heteroscedasticity is a problem? Explain:
Yes,heteroscedasticity is a problem Heteroscedasticity means unequal scatter.From the Scatter-plot we observe that the variance of the residuals is lower for lower predicted values and higher for higher predicted values.which shows a highly unequal scatter. hence, yes heteroscedasticity is a problem.
e. Draw a histogram of the residuals. Does it appear that the errors are normally distributed? Explain.
| Frequency Table | |
| Class | Count |
| -0.2--0.071 | 4 |
| -0.07-0.059 | 5 |
| 0.06-0.189 | 3 |
| 0.19-0.319 | 1 |

The normal distribution curve is symmetric that is but since the above histrogram is positively skewed hence,the errors are not normally distributed.
please rate my answer and comment for doubts.