Question

In: Statistics and Probability

Consider the following data for two variables, x and y.

Consider the following data for two variables, x and y.

x   2 3 4 5 7 7 7 8 9

y 4 5 4 6 4 6 9 5 11

a. Does there appear to be a linear relationship between x and y? Explain.(f-test, to do f-test for the overall significance)

b. Develop the estimated regression equation relating x and y.

c. Plot the standardized residuals versus yˆ for the estimated regression equation developed in part (b). Do the model assumptions appear to be satisfied? Explain.

d. Perform a logarithmic transformation on the dependent variable y. Develop an estimated regression equation using the transformed dependent variable. Do the model assumptions appear to be satisfied by using the transformed dependent variable? Does a reciprocal transformation work better in this case? Explain.

 

Solutions

Expert Solution

a. By plotting scatter plot between the two variables x and y we can judge whether the relationship between the two variables is linear or not

In the above plot the relation seems not perfect linear.

b)  The estimated regression equation between the two variables  is

x = 2.15 + 0.604 y

c) The standardized residual plot is given below

it does not seem the assumption of independence of residuals and Homoscedasticity of residuals is not met. As the residuals are wide in the start of x axes and shrinks with the increase in the value at x axes.

F.test for the above model is

Analysis of Variance
Source DF SS MS F P.Value
Regression 1 17.521 17.521 4.37 0.075
Residual 7 28.035 4.005
Total 8 45.556

d) The regression equation is
x = - 1.28 + 9.40 ln(y) where ln(y) is logey

The model variables seems to be bit more satisfying then with out log transformation.

The F Test results are as below for log transformed model

Analysis of Variance
Source DF SS MS F P.Value
Regression 1 17.575 17.575 4.4 0.074
Residual 7 27.98 3.997
Total 8 45.556

Now if we check with the reciprocal transformation following

The regression equation is
x = 10.4 - 24.6 rep(Y) Where rep(y) is reciprocal of variable y i,e (1/y)

It seems to be more satisfying then the precious one

F.test for reciprocal model

Analysis of Variance
Source DF SS MS F P.Value
Regression 1 17.046 17.046 4.19 0.08
Residual 7 28.51 4.073
Total 8 45.556

Related Solutions

Consider the following data for two variables, x and y.
  Consider the following data for two variables, x and y. x 22 24 26 30 35 40 y 11 20 33 34 39 36 (a) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (Round b0 to one decimal place and b1 to three decimal places.)ŷ = −8.3+1.259x    (b) Use the results from part (a) to test for a significant relationship between x and y. Use α = 0.05. Find the...
A statistical program is recommended. Consider the following data for two variables, x and y. x...
A statistical program is recommended. Consider the following data for two variables, x and y. x 22 24 26 30 35 40 y 12 20 33 35 40 36 (a) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x.  (Round b0 to one decimal place and b1 to three decimal places. ŷ = (b) Use the results from part (a) to test for a significant relationship between x and y. Use α = 0.05....
A statistical program is recommended. Consider the following data for two variables, x and y. x...
A statistical program is recommended. Consider the following data for two variables, x and y. x 9 32 18 15 26 y 11 20 22 17 22 (a) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (Round b0 to two decimal places and b1 to three decimal places.) ŷ = Comment on the adequacy of this equation for predicting y. (Use α = 0.05.) The high p-value and low coefficient of determination...
A statistical program is recommended. Consider the following data for two variables, x and y. x...
A statistical program is recommended. Consider the following data for two variables, x and y. x 9 32 18 15 26 y 9 20 22 16 23 (a)Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (Round b0 to two decimal places and b1 to three decimal places.) (b) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x + b2x2. (Round b0 to two decimal places...
A statistical program is recommended. Consider the following data for two variables, x and y. x...
A statistical program is recommended. Consider the following data for two variables, x and y. x 22 24 26 30 35 40 y 12 20 32 36 39 36 (a). Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (b). Use the results from part (a) to test for a significant relationship between x and y. Use α = 0.05. Find the value of the test statistic. Find the p-value. Is the relationship...
A statistical program is recommended. Consider the following data for two variables, x and y. xi...
A statistical program is recommended. Consider the following data for two variables, x and y. xi 135 110 130 145 175 160 120 yi 145 100 120 120 135 130 110 (a) Compute the standardized residuals for these data. (Round your answers to two decimal places.) xi yi Standardized Residuals 135 145 2.11 Incorrect: Your answer is incorrect. 110 100 -0.73 Incorrect: Your answer is incorrect. 130 120 145 120 175 135 160 130 120 110 Do the data include...
Consider the following data for two variables, x and y. x 10 34 20 11 24...
Consider the following data for two variables, x and y. x 10 34 20 11 24 y 11 30 22 19 21 a. Develop an estimated regression equation for the data of the form y=b0+b1x. Comment on the adequacy of this equation for predicting . Enter negative value as negative number. The regression equation is Y=___________+___________ (to 2 decimals) s=__________ (to 3 decimals) r2=_________% (to 1 decimal) r2 adj=_______ % (to 1 decimal) Analysis of Variance SOURCE DF SS (to...
Consider the following data for two variables, x and y. x 9 32 18 15 26...
Consider the following data for two variables, x and y. x 9 32 18 15 26 y 10 19 22 17 23 (a) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (Round b0 to two decimal places and b1 to three decimal places.) ŷ = Comment on the adequacy of this equation for predicting y. (Use α = 0.05.) The high p-value and low coefficient of determination indicate that the equation is...
Consider the following data for two variables, x and y. x 22 24 26 30 35...
Consider the following data for two variables, x and y. x 22 24 26 30 35 40 y 13 20 34 35 40 36 (a)Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (Round b0 to one decimal place and b1 to three decimal places.) ŷ = (b)Use the results from part (a) to test for a significant relationship between x and y. Use α = 0.05. Find the value of the test...
Consider the following data for two variables, x and y. x 22 24 26 30 35...
Consider the following data for two variables, x and y. x 22 24 26 30 35 40 y 11 20 32 34 39 35 (a) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (Round b0 to one decimal place and b1 to three decimal places.) ŷ = (b) Use the results from part (a) to test for a significant relationship between x and y. Use α = 0.05. Find the value of...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT