Question

In: Statistics and Probability

Consider the following data for two variables, x and y. xi 135 110 130 145 175...

Consider the following data for two variables, x and y.

xi

135 110 130 145 175 160 120

yi

145 105 120 115 130 130 110

(a)

Compute the standardized residuals for these data. (Round your answers to two decimal places.)

xi

yi

Standardized Residuals
135 145
110 105
130 120
145 115
175 130
160 130
120 110

Do the data include any outliers? Explain. (Round your answers to two decimal places.)

The standardized residual with the largest absolute value is  , corresponding to yi =  . Since this residual is  ---Select--- less than −2 between −2 and +2 greater than +2 , it  ---Select--- is definitely not could be an outlier.

(b)

Plot the standardized residuals against ŷ.

A standardized residual plot has 7 points plotted on it. The horizontal axis ranges from 105 to 140 and is labeled: y hat. The vertical axis ranges from −2.5 to 2.5 and is labeled: Standardized Residuals. There is a horizontal line that spans the graph at 0 on the vertical axis. There are 4 points below the line and 3 points above it. 6 of the points appear to vary randomly between −0.8 to 0.1 on the vertical axis; however, the maximum residual is at approximately (121, 2.2).

A standardized residual plot has 7 points plotted on it. The horizontal axis ranges from 105 to 140 and is labeled: y hat. The vertical axis ranges from −2.5 to 2.5 and is labeled: Standardized Residuals. There is a horizontal line that spans the graph at 0 on the vertical axis. There are 3 points below the line and 4 points above it. 6 of the points appear to vary randomly between −0.1 to 0.8 on the vertical axis; however, the minimum residual is at approximately (121, −2.2).

A standardized residual plot has 7 points plotted on it. The horizontal axis ranges from 105 to 140 and is labeled: y hat. The vertical axis ranges from −2.5 to 2.5 and is labeled: Standardized Residuals. There is a horizontal line that spans the graph at 0 on the vertical axis. There are 4 points below the line and 3 points above it. The points are plotted from left to right in a downward, diagonal direction starting from the upper left corner of the graph. Most of the points are between −0.8 to 0.1 on the vertical axis; however, the maximum residual is at approximately (112, 2.2).

A standardized residual plot has 7 points plotted on it. The horizontal axis ranges from 105 to 140 and is labeled: y hat. The vertical axis ranges from −2.5 to 2.5 and is labeled: Standardized Residuals. There is a horizontal line that spans the graph at 0 on the vertical axis. There are 4 points below the line and 3 points above it. The points are plotted from left to right in an upward, diagonal direction starting from the lower left corner of the graph. Most of the points are between −0.8 to 0.1 on the vertical axis; however, the maximum residual is at approximately (134, 2.2).

Does this plot reveal any outliers?

The plot shows no possible outliers.The plot shows one possible outlier.    The plot shows two possible outliers.The plot shows more than two possible outliers.

(c)

Develop a scatter diagram for these data.

A scatter diagram has 7 points plotted on it. The horizontal axis ranges from 100 to 180 and is labeled: x. The vertical axis ranges from 90 to 150 and is labeled: y. The points are plotted from left to right in an upward, diagonal direction starting from the lower left corner of the diagram. The points are between 110 to 175 on the horizontal axis and between 105 to 145 on the vertical axis. Most of the points are plotted reasonably close together, but the fourth point from the left is noticeably higher than the others at 145 on the vertical axis.

A scatter diagram has 7 points plotted on it. The horizontal axis ranges from 100 to 180 and is labeled: x. The vertical axis ranges from 90 to 150 and is labeled: y. The points are plotted from left to right in a downward, diagonal direction starting from the upper left corner of the diagram. The points are between 110 to 175 on the horizontal axis and between 105 to 145 on the vertical axis. The points are fairly scattered, though the seventh point from left is slightly farther away from the others at 120 on the vertical axis.

A scatter diagram has 7 points plotted on it. The horizontal axis ranges from 100 to 180 and is labeled: x. The vertical axis ranges from 90 to 150 and is labeled: y. The points are plotted from left to right in a downward, diagonal direction starting from the upper left corner of the diagram. The points are between 110 to 175 on the horizontal axis and between 105 to 145 on the vertical axis. The points are fairly scattered, though the second point from the left is noticeably farther away from the others at 105 on the vertical axis.

A scatter diagram has 7 points plotted on it. The horizontal axis ranges from 100 to 180 and is labeled: x. The vertical axis ranges from 90 to 150 and is labeled: y. The points are plotted from left to right in an upward, diagonal direction starting from the lower left corner of the diagram. The points are between 110 to 175 on the horizontal axis and between 105 to 145 on the vertical axis. The points are reasonably close together and each consecutive point is higher than or just as high on the the diagram as the previous point.

Does the scatter diagram indicate any outliers in the data?

The diagram indicates that there are no possible outliers.The diagram indicates that there is one possible outlier.    The diagram indicates that there are two possible outliers.The diagram indicates that there are more than two possible outliers.

In general, what implications does this finding have for simple linear regression?

For simple linear regression, we must calculate standardized residuals, plot a standardized residual plot, and construct a scatter diagram to identify an outlier.For simple linear regression, we can determine an outlier by looking at the scatter diagram.     For simple linear regression, it is impossible to determine whether there is an outlier using standardized residuals, a standardized residual plot, or a scatter diagram.

Solutions

Expert Solution

SOLUTION:-
from above equation:

Predicted Residual
ŷ et=(y-ŷ)
122.58 22.42
112.64 -7.64
120.59 4.41
126.56 -6.56
138.49 -3.49
132.52 -2.52
116.62 -6.62

and

SSE =Syy-(Sxy)2/Sxx= 685.698
s2 =SSE/(n-2)= 137.1395
std error s             =   =se =√s2=     11.7107
standardized
hi=1/n+(xi-x̅)2/SSx si =s*√(1-hi) residual=et/Si
0.1488 10.8041 2.08
0.4221 8.9025 -0.86
0.1709 10.6629 0.41
0.1535 10.7745 -0.61
0.5581 7.7844 -0.45
0.2826 9.9192 -0.25
0.2640 10.0469 -0.66

Please give me thumb up......

Thank you in Advance.......


Related Solutions

Consider the following data for two variables, x and y. xi 140 110 130 150 175...
Consider the following data for two variables, x and y. xi 140 110 130 150 175 160 125 yi 150 100 125 120 135 135 115 a. Compute the standardized residuals for these data. Observation 1 Observation 2 Observation 3 Observation 4 Observation 5 Observation 6 Observation 7
{Exercise 12.50 (Algorithmic)} Consider the following data for two variables, x and y. xi 135 110...
{Exercise 12.50 (Algorithmic)} Consider the following data for two variables, x and y. xi 135 110 135 150 175 160 125 yi 145 105 120 120 135 130 115 a. Compute the standardized residuals for these data. Observation 1 Observation 2 Observation 3 Observation 4 Observation 5 Observation 6 Observation 7
Exercise 12.50 (Algorithmic)} Consider the following data for two variables, x and y. xi 135 110...
Exercise 12.50 (Algorithmic)} Consider the following data for two variables, x and y. xi 135 110 135 150 175 160 125 yi 145 105 120 120 135 130 115 a. Compute the standardized residuals for these data. Observation 1 Observation 2 Observation 3 Observation 4 Observation 5 Observation 6 Observation 7 I got 23.13 for the first one and it was wrong
A statistical program is recommended. Consider the following data for two variables, x and y. xi...
A statistical program is recommended. Consider the following data for two variables, x and y. xi 135 110 130 145 175 160 120 yi 145 100 120 120 135 130 110 (a) Compute the standardized residuals for these data. (Round your answers to two decimal places.) xi yi Standardized Residuals 135 145 2.11 Incorrect: Your answer is incorrect. 110 100 -0.73 Incorrect: Your answer is incorrect. 130 120 145 120 175 135 160 130 120 110 Do the data include...
Consider the following data for two variables, x and y.
  Consider the following data for two variables, x and y. x 22 24 26 30 35 40 y 11 20 33 34 39 36 (a) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (Round b0 to one decimal place and b1 to three decimal places.)ŷ = −8.3+1.259x    (b) Use the results from part (a) to test for a significant relationship between x and y. Use α = 0.05. Find the...
Consider the following data for two variables, x and y.
Consider the following data for two variables, x and y. x   2 3 4 5 7 7 7 8 9 y 4 5 4 6 4 6 9 5 11 a. Does there appear to be a linear relationship between x and y? Explain.(f-test, to do f-test for the overall significance) b. Develop the estimated regression equation relating x and y. c. Plot the standardized residuals versus yˆ for the estimated regression equation developed in part (b). Do the model assumptions...
A statistical program is recommended. Consider the following data for two variables, x and y. x...
A statistical program is recommended. Consider the following data for two variables, x and y. x 22 24 26 30 35 40 y 12 20 33 35 40 36 (a) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x.  (Round b0 to one decimal place and b1 to three decimal places. ŷ = (b) Use the results from part (a) to test for a significant relationship between x and y. Use α = 0.05....
A statistical program is recommended. Consider the following data for two variables, x and y. x...
A statistical program is recommended. Consider the following data for two variables, x and y. x 9 32 18 15 26 y 11 20 22 17 22 (a) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (Round b0 to two decimal places and b1 to three decimal places.) ŷ = Comment on the adequacy of this equation for predicting y. (Use α = 0.05.) The high p-value and low coefficient of determination...
A statistical program is recommended. Consider the following data for two variables, x and y. x...
A statistical program is recommended. Consider the following data for two variables, x and y. x 9 32 18 15 26 y 9 20 22 16 23 (a)Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (Round b0 to two decimal places and b1 to three decimal places.) (b) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x + b2x2. (Round b0 to two decimal places...
A statistical program is recommended. Consider the following data for two variables, x and y. x...
A statistical program is recommended. Consider the following data for two variables, x and y. x 22 24 26 30 35 40 y 12 20 32 36 39 36 (a). Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (b). Use the results from part (a) to test for a significant relationship between x and y. Use α = 0.05. Find the value of the test statistic. Find the p-value. Is the relationship...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT