Question

In: Statistics and Probability

a. Estimate the regression equation and determine the predicted values of y b. Use the predicted...

a. Estimate the regression equation and determine the predicted values of y

b. Use the predicted values and the actual values of y to calculate the residuals

c. Plot the residuals (vertical axis) against the predicted values (horizontal axis).

d. Does it appear that heteroscedasticity is a problem? Explain

e. Draw a histogram of the residuals. Does it appear that the errors are normally distributed? Explain.

f. Use the residuals to identify possible outlier(s)

Weight Price
27 12.02
28.5 12.04
30.8 12.32
31.3 12.27
31.9 12.49
34.5 12.7
34.7 13
37 13
41 13.17
41 13.19
38.8 13.22
39.3 13.27
34 12.8

please do not use minitab or any statistical software

Solutions

Expert Solution

a. Estimate the regression equation and determine the predicted values of y:

We are given:

Obs. WeightWeight PricePrice X_i^2Xi2​ Y_i^2Yi2​ X_i \cdot Y_iXi​⋅Yi​
1 27 12.02 729 144.4804 324.54
2 28.5 12.04 812.25 144.9616 343.14
3 30.8 12.32 948.64 151.7824 379.456
4 31.3 12.27 979.69 150.5529 384.051
5 31.9 12.49 1017.61 156.0001 398.431
6 34.5 12.7 1190.25 161.29 438.15
7 34.7 13 1204.09 169 451.1
8 37 13 1369 169 481
9 41 13.17 1681 173.4489 539.97
10 41 13.19 1681 173.9761 540.79
11 38.8 13.22 1505.44 174.7684 512.936
12 39.3 13.27 1544.49 176.0929 521.511
13 34 12.8 1156 163.84 435.2
Sum = 449.8 165.49 15818.46 2109.1937 5750.275

=1/n∑​Xi​=449.8/13​=34.6

=34.6

=1/n∑Yi​=165.49​/13=12.73

=12.73

SSXX​​=∑Xi2​−1/n​(∑Xi​)2=15818.46−1/13​(449.8)2=255.38​

SSXX=255.38

SSYY​​=∑​Yi2​−1/n​(∑​Yi​)2=2109.1937−1​/13(165.49)2=2.506​

SSYY=2.506

SSXY​​=∑​Xi​Yi​−1/n​(∑​Xi​)(∑​Yi​)=5750.275−1/13​(449.8)(165.49)=24.321​

SSXY​​=24.321

​​=​SSXY/SSXX​​=24.321/255.38=​0.0952​

=0.0952

​​==12.73−0.0952×34.6=9.4349​

=9.4349

the regression equation is:

PREDICTED VALUES:

Obs. Weight Price Predicted Values
1 27 12.02 9.4349+0.0952×27=12.006
2 28.5 12.04 9.4349+0.0952×28.5=12.149
3 30.8 12.32 9.4349+0.0952×30.8=12.368
4 31.3 12.27 9.4349+0.0952×31.3=12.416
5 31.9 12.49 9.4349+0.0952×31.9=12.473
6 34.5 12.7 9.4349+0.0952×34.5=12.72
7 34.7 13 9.4349+0.0952×34.7=12.74
8 37 13 9.4349+0.0952×37=12.959
9 41 13.17 9.4349+0.0952×41=13.34
10 41 13.19 9.4349+0.0952×41=13.34
11 38.8 13.22 9.4349+0.0952×38.8=13.13
12 39.3 13.27 9.4349+0.0952×39.3=13.178
13 34 12.8 9.4349+0.0952×34=12.673

b. Use the predicted values and the actual values of y to calculate the residuals:

Price(observed value) Predicted Values Residuals=observed value-predicted value
12.02 12.006 0.014
12.04 12.149 -0.109
12.32 12.368 -0.048
12.27 12.416 -0.146
12.49 12.473 0.017
12.7 12.72 -0.02
13 12.74 0.26
13 12.959 0.041
13.17 13.34 -0.17
13.19 13.34 -0.15
13.22 13.13 0.09
13.27 13.178 0.092
12.8 12.673 0.127

c. Plot the residuals (vertical axis) against the predicted values (horizontal axis):

d. Does it appear that heteroscedasticity is a problem? Explain:

Yes,heteroscedasticity is a problem Heteroscedasticity means unequal scatter.From the Scatter-plot we observe that the variance of the residuals is lower for lower predicted values and higher for higher predicted values.which shows a highly unequal scatter. hence, yes heteroscedasticity is a problem.

e. Draw a histogram of the residuals. Does it appear that the errors are normally distributed? Explain.

Frequency Table
Class Count
-0.2--0.071 4
-0.07-0.059 5
0.06-0.189 3
0.19-0.319 1

The normal distribution curve is symmetric that is but since the above histrogram is positively skewed hence,the errors are not normally distributed.

please rate my answer and comment for doubts.


Related Solutions

Use the dataset and the predicted regression equation to calculate the expected value of Y for...
Use the dataset and the predicted regression equation to calculate the expected value of Y for each known value of X. Then, use this information to calculate the standard error of estimate. Round all values to the nearest whole number. Y’ = 2.5 + .75(ice cream sales) Ice Cream Sales Observed Crime Predicted Crime 10 5 20 20 30 30 20 25 30 15 10 10 Now, calculate the standard error. The standard error is _________ PLEASE show your work...
Determine the regression equation (write the regression equation) of the following data, where Y is the...
Determine the regression equation (write the regression equation) of the following data, where Y is the dependant variable and X is the predictor: X Y 5 85 4 103 6 70 5 82 5 89 5 98 6 66 6 95 2 169 7 70 7 48 b) Interpret the regression coefficients. c) Graph the regression line. d) Use the regression equation to predict Y for X=3. e) Compute the coefficient of determination. f) Interpret the coefficient of determination. g)...
9) Use the following data to estimate a linear regression equation between y and x. Interpret...
9) Use the following data to estimate a linear regression equation between y and x. Interpret the estimated slope coefficient. Predict y for an x value of 9. Calculate and interpret the model’s R-squared. x y 21 12 17 10 11 8 3 5 13 15
What is the Regression Equation (i.e., what are the values of a and b)?
What is the Regression Equation (i.e., what are the values of a and b)? What is the forecast value for November? What is the Standard Error of the Estimate?    
Curve Fitting and Linear Regression a) Determine the linear regression equation for the measured values in...
Curve Fitting and Linear Regression a) Determine the linear regression equation for the measured values in the table above. ?? 1 2 3 4 Value 1 (????) 0 3 7 10 Value 2 (????) 2 4 9 11 b) Plot the points and the linear regression curve. c) Determine the Linear Correlation Coefficient (i.e., Pearson’s r) for the dataset in the table above.
What determines how well a regression equation will predict values of the y-variable from known values...
What determines how well a regression equation will predict values of the y-variable from known values of the x-variable? (A) What values or points make-up the regression line? (B) What two points are always found on the regression line? (Hint: These two points can be used to quickly draw the regression line) Explain why the regression line the “best fitting” line that can be plotted through the points in a scatter diagram? Describe how predicted y-values plotted near the point...
Use Excel to estimate a simple linear regression model for the following data (Y is a...
Use Excel to estimate a simple linear regression model for the following data (Y is a dependent variable and X is an independent variable): Y X 0 -2 0 -1 1 0 1 1 3 2 Fill in Multiple Blanks: What is the slope of the estimated line?  In your answer, show one (1) digit to the right of the decimal point, for example, 1.0, 1.2. Apply the appropriate rounding rule if necessary. What is the Y-intercept?
Solve for the predicted values of y and the residuals for the following data. x 12...
Solve for the predicted values of y and the residuals for the following data. x 12 21 28 8 20 y 17 15 22 19 24 Do not round the intermediate values. (Round your answers to 3 decimal places.) x y Predicted (ŷ) Residuals (y-ŷ) 12 17 21 15 28 22 8 19 20 24
Linear Regression When we use a least-squares line to predict y values for x values beyond...
Linear Regression When we use a least-squares line to predict y values for x values beyond the range of x values found in the data, are we extrapolating or interpolating? Are there any concerns about such predictions?
Use the following data set to calculate the regression equation. What would you estimate the happiness...
Use the following data set to calculate the regression equation. What would you estimate the happiness rating for someone who has 5 children to be? Number of children Happiness Rating 2 1 5 5 8 6 4 3 5 5 6 7 1 2 6 5 3 2
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT