Question

In: Statistics and Probability

Consider the following data for two variables, x and y. x1 = 4 5 7 8...

Consider the following data for two variables, x and y.

x1 = 4 5 7 8 10 12 12 22

y1 = 12 14 16 15 18 20 24 19

b. Compute the standardized residuals for these data (to 2 decimals, if necessary). Enter negative values as negative numbers.

Observation 1
Observation 2
Observation 3
Observation 4
Observation 5
Observation 6
Observation 7
Observation 8

Do the data include any outliers?
- Select your answer -Yes, there appear to be 3 outliersYes, there appear to be 2 outliersYes, there appears to be an outlierNo, there do not appear to be any outliersItem 11

c. Compute the leverage values for these data (to 2 decimals). Enter negative values as negative numbers.

Observation 1
Observation 2
Observation 3
Observation 4
Observation 5
Observation 6
Observation 7
Observation 8

Does there appear to be any influential observations in these data?
- Select your answer -Yes, observation 8 is an influential observation Yes, observation 6 is an influential observation Yes, observation 3 is an influential observation No, there do not appear to be any influential observations

Solutions

Expert Solution

Answer:

Leverage is a measure of how much the estimated value of the x changes when that point is removed. If leverage for an observation is high, it means it forces the regression line to be close to the point. In these data points, influential observation in these data is observation 8 as the rule for identifying the influential observation is:

if the leverage value of an observation is greater than 3times larger than the mean of leverage values

so, mean of the leverage value = 0.25000 and three times = 3*0.25000 = 0.75000

by looking at the leverage value in the table, observation 8 (whose x value =22 is extreme) has leverage value 0.76, which is higher than 0.75. so observation 8 is the influential observation. so it means removing this datapoint form the equation would have a greater impact on the regression analysis.

Create a scatterplot of yI on xI. How to create: choose xI and yI data set and in excel, go to insert bar, choose scatter

we would see scatterplot below which shows datapoint (x= 22, y = 19) is extreme with respect to other x values. this means scatter diagram indicate influential observation. ((loook at the extreme right point big in size in diagram below which stands out)

Observation xI yI Standard Residuals (xI-mean of x)2 leverage  
1 4 12 -0.917182874 36 0.284292
2 5 14 -0.382347324 25 0.235619
3 7 16 0.008262909 9 0.164823
4 8 15 -0.475492842 4 0.142699
5 10 18 0.254647825 0 0.125
6 12 20 0.645258058 4 0.142699
7 12 24 2.003379792 4 0.142699
8 22 19 -1.136525544 144 0.762168

To compute the standardized residuals in excel (Regress Y on X), Choose Tools, Data Analysis, Regression

Highlight the column Y then column X, click on standardized residuals and click OK

Similarly, the leverage value of each observation can be computed as:

hii = 1/n + (xi ? x¯)2 /(xi ? x¯)2 .

here n =8, so the leverage value of point 1 is: h11 = 1/8 + (36/226) = 0.284292

we can calculate variance in excel by choosing function in excel (insert function)

Here 226 is denominator of leverage formula and the calculation is : sum of column 5 below


Related Solutions

3. Consider the following data for two variables, x and y. x   2 3 4 5 7...
3. Consider the following data for two variables, x and y. x   2 3 4 5 7 7 7 8 9 y 4 5 4 6 4 6 9 5 11 a. Does there appear to be a linear relationship between x and y? Explain. b. Develop the estimated regression equation relating x and y. c. Plot the standardized residuals versus yˆ for the estimated regression equation developed in part (b). Do the model assumptions appear to be satisfied? Explain. d....
3. Consider the following data for two variables, x and y.x   2 3 4 5 7...
3. Consider the following data for two variables, x and y. x   2 3 4 5 7 7 7 8 9 y 4 5 4 6 4 6 9 5 11 a. Does there appear to be a linear relationship between x and y? Explain.(f-test, to do f-test for the overall significance) b. Develop the estimated regression equation relating x and y. c. Plot the standardized residuals versus yˆ for the estimated regression equation developed in part (b). Do the model...
Consider the following data for two variables, x and y.
  Consider the following data for two variables, x and y. x 22 24 26 30 35 40 y 11 20 33 34 39 36 (a) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (Round b0 to one decimal place and b1 to three decimal places.)ŷ = −8.3+1.259x    (b) Use the results from part (a) to test for a significant relationship between x and y. Use α = 0.05. Find the...
Consider the following data for two variables, x and y.
Consider the following data for two variables, x and y. x   2 3 4 5 7 7 7 8 9 y 4 5 4 6 4 6 9 5 11 a. Does there appear to be a linear relationship between x and y? Explain.(f-test, to do f-test for the overall significance) b. Develop the estimated regression equation relating x and y. c. Plot the standardized residuals versus yˆ for the estimated regression equation developed in part (b). Do the model assumptions...
Consider the following data for two variables,  and . x 8 27 21 16 22 y 8...
Consider the following data for two variables,  and . x 8 27 21 16 22 y 8 30 23 12 26 a. Develop an estimated regression equation for the data of the form y=b0+b1x . Comment on the adequacy of this equation for predicting . Enter negative value as negative number. The regression equation is Y=_______+_______ (to 2 decimals) s= (to 3 decimals) R^2= % (to 1 decimal) R^2 adj= % (to 1 decimal) Analysis of Variance SOURCE DF SS (to...
Consider the following data for a dependent variable y and two independent variables, x2 and x1....
Consider the following data for a dependent variable y and two independent variables, x2 and x1. x1 x2 y 30 12 94 46 10 109 24 17 113 50 17 179 41 5 94 51 19 175 75 8 171 36 12 118 59 13 143 77 17 212 Round your all answers to two decimal places. Enter negative values as negative numbers, if necessary. a. Develop an estimated regression equation relating y to x1 . Predict y if x1=45....
Consider the following data for a dependent variable y and two independent variables, x1 and x2.
You may need to use the appropriate technology to answer this question. Consider the following data for a dependent variable y and two independent variables, x1 and x2. x1 x2 y 30 12 93 47 10 108 25 17 112 51 16 178 40 5 94 51 19 175 74 7 170 36 12 117 59 13 142 76 16 211 The estimated regression equation for these data is ŷ = −18.89 + 2.02x1 + 4.74x2. Here, SST = 15,276.0,...
Consider the following data for a dependent variable y and two independent variables, x1 and x2....
Consider the following data for a dependent variable y and two independent variables, x1 and x2. x1 x2 y 30 12 94 47 10 108 25 17 112 51 16 178 40 5 94 51 19 175 74 7 170 36 12 117 59 13 142 76 16 209 The estimated regression equation for these data is ŷ = −17.02 + 1.99x1 + 4.70x2. Here, SST = 14,902.9, SSR = 13,773.1, sb1 = 0.2470, and sb2 = 0.9480. (1a) Test...
Consider the following data for a dependent variable y and two independent variables, x1 and x2....
Consider the following data for a dependent variable y and two independent variables, x1 and x2. x1 x2 y 30 13 95 46 10 108 25 18 113 50 16 179 40 5 95 51 20 176 74 7 170 36 12 117 59 13 142 77 16 211 Round your all answers to two decimal places. Enter negative values as negative numbers, if necessary. a. Develop an estimated regression equation relating y to x1. ŷ =_________ +___________ x1 Predict...
Consider the following data for a dependent variable y and two independent variables, x1 and x2....
Consider the following data for a dependent variable y and two independent variables, x1 and x2. x1 x2 y 30 12 95 47 10 108 25 17 112 51 16 178 40 5 94 51 19 175 74 7 170 36 12 117 59 13 142 76 16 212 The estimated regression equation for these data is ŷ = −18.52 + 2.01x1 + 4.75x2. Here, SST = 15,234.1, SSR = 14,109.8, sb1 = 0.2464, and sb2 = 0.9457. (a)Test for...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT