Question

In: Statistics and Probability

(27 pts total) What follows is hypothetical Excel output for doing regression of y on x....

(27 pts total) What follows is hypothetical Excel output for doing regression of y on x. The regression equation is: ^y = 54:6429 + 0:9451x Source DF SS MS F P Regression 1 130.0396 130.0396 20.7812 0.0019 Residual Error Total 9 180.1000 (a) (3 pts) Complete the ANOVA table above by lling in the \Residual Error" row above. (b) (6 pts) What are the values of the regression standard error s and the coecient of determination r2? (c) (6 pts) Write down a null hypothesis H0 and an alternative hypothesis Ha that the P-value 0:0019 in this chart is referring to. Should you reject the null hypothesis at the 1% level of signcance? To what type of error are you now subject to? (d) (4 pts) Suppose SSxx = 145:6. Calculate the t statistic for the test in part (c) and conrm the P-value using the t-table. There are a couple dierent ways this could be done. Choose one, but show your work and/or explain. (e) (8 pts) Suppose x = 20:8 and SSxx = 145:6. Give a 90% prediction interval for the value of y when x = 24. Describe the meaning of this interval in relation to the population

Solutions

Expert Solution

(a)

The complete ANOVA table is,

Source DF SS MS F P

Regression 1 130.0396 130.0396 20.7812 0.0019

Residual Error 8 50.0604 6.25755

Total 9 180.1000

DF for Residual Error = DF Total - DF for Regression = 9 - 1 = 8

SS for Residual Error = SS Total - SS for Regression = 180.1000 - 130.0396 = 50.0604

MS for Residual Error = SS for Residual Error / DF for Residual Error = 50.0604 / 8 = 6.25755

(b)

Regression standard error s =

= 2.50151

Coefficient of determination r2 = SS for Regression / SS for Total = 130.0396 / 180.1000 = 0.722

(c)

where is the coefficient for x in the model.

Since p-value is less than 0.01 significance level, we reject null hypothesis H0 and conclude that there is significant evidence that .

Since we are rejecting the null hypothesis, there may be chance that the null hypothesis is true and we may commit Type I error.

(d)

Standard error of = s / = 2.50151 / = 0.2073106

t statistic = Coeff Estimate / Standard error = 0.9451 / 0.2073106 = 4.56

P-value = 2 * P(t > 4.56, df = 8) = 0.0018

which is approximately equal to P-value 0:0019 in Anova table.

(e)

Given,

= 20.8

SSxx = 145.6

y when x = 24 is,

^y = 54.6429 + 0.9451 * 24 = 77.3253

Critical value of t at 90% confidence level and df = 8 is, 1.86

90% prediction interval for the value of y when x = 24 is,

= (77.3253 - 5.0335,  77.3253 + 5.0335)

= (72.2918, 82.3588)

We are 90% confident that the next new observation (y) will fall within (72.2918, 82.3588) when x is 24.


Related Solutions

The following is part of regression output produced by Excel ( for Y vs X1 and...
The following is part of regression output produced by Excel ( for Y vs X1 and X2): Y 12.9 6.1 1.1 39.7 3.4 5.9 8.9 15 7.3 X1 0.9 0.8 1.0 0.3 0.4 0.7 0.71 0.5 0.9 X2 4.2 3.1 1.2 15.7 2.5 0.7 5.0 6.4 3.0 A) write out the estimated regression equation showing that depends on X1 and X2. b)if. X1=0.58 and X2=7.0, what is the value predicted for y c)write the number which is the standard error...
Shown below is a portion of an Excel output for regression analysis relating Y (dependent variable)...
Shown below is a portion of an Excel output for regression analysis relating Y (dependent variable) and X (independent variable). ANOVA df SS Regression 1 39947.80 Residual (Error) 10 8280.81 Total 11 48228.61 Coefficients Standard Error t Stat P-value Intercept 69.190 26.934 2.569 0.02795 X 2.441 0.351 6.946 0.00004 1.   What is the estimated regression equation that relates Y to X? (2 Points) 2.   Is the regression relationship significant? Use the p-value approach and alpha = 0.05 to answer this question. (2...
1- The regression of X on Y is not the same as the regression of Y...
1- The regression of X on Y is not the same as the regression of Y on X. Why is this? Select one: a. Because the regression minimises the residuals of y, not the residuals of x. b. Because unlike correlation, regression assumes X causes Y. c. Because one goes through (mean x, mean y) whereas the other goes through (mean y, mean x). d. Because the F test divides MSy by MSx, not the other way round. 2- Using...
3.Below is the regression input and output (from Excel) for the regression in which the percentage...
3.Below is the regression input and output (from Excel) for the regression in which the percentage change in the exchange rate is the dependent variable (Y) and the inflation differential (inflation rate home – inflation rate foreign) is the independent variable (X). According to the Purchasing Power Parity theory, the hypothesized coefficients (null hypothesis) in the regression are 0 (intercept coefficient) and 1 (inflation differential coefficient).   The directions for Problem Set 1 instruct you to first read sections of the...
Assume that an operation * is defined as follows: x * y = x' + y...
Assume that an operation * is defined as follows: x * y = x' + y Using Boolean algebra theorems and postulates (don’t use K-maps), check whether the operation * is associative or not?
Assume that total output is determined by this formula: TOTAL OUTPUT = NUMBER OF WORKERS X...
Assume that total output is determined by this formula: TOTAL OUTPUT = NUMBER OF WORKERS X PRODUCTIVITY assume there are 100 workers and each worker produces $100 of output. QUESTION: If the workforce is growing by 1% a year but productivity does not improve, how fast can output increase? Remember, the number of workers grows by 1%. Productivity does not change. Also remember the percentage change formula which is: [(new value - original value) / original value] x 100 a1%...
The multiple regression model is estimated in Excel and part of the output is provided below....
The multiple regression model is estimated in Excel and part of the output is provided below. ANOVA df SS MS F Significance F Regression 3 3.39E+08 1.13E+08 1.327997 0.27152899 Residual 76 6.46E+09 85052151 Total 79 6.8E+09 Question 8 (1 point) Use the information from the ANOVA table to complete the following statement. To test the overall significance of this estimated regression model, the hypotheses would state there is    between attendance and the group of all explanatory variables, jointly. there is...
The following data are provided. (Nonlinear regression using excel) x 1 2 3 4 5 y...
The following data are provided. (Nonlinear regression using excel) x 1 2 3 4 5 y 2.2 2.8 3.6 4.5 5.5 Fit the model y=a+bx+c/x, where a, b and c are constants for the model. Perform nonlinear regression in EXCEL using solver to get model constants. Plot y model (model and data) vs x to see the quality of fit.
2. The joint pmf of ? and ? is given by ??,? (?, ?) = (x+y)/27  ???...
2. The joint pmf of ? and ? is given by ??,? (?, ?) = (x+y)/27  ??? ? = 0, 1,2; ? = 1, 2, 3, and ??,? (?, ?) = 0 otherwise. a. Find ?(?|? = ?) for all ? = 0,1, 2. b. Find ?(3 + 0.2?|? = 2).
1. The slope coefficient for a regression of Y on X is
Consider the data in the table below.YX5810555491969105952798Answer the following questions to two decimal places.1. The slope coefficient for a regression of Y on X is2. The constant of a regression of Y on X is3. The residual for the first observation in the table is4. The correlation of the residuals and X is
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT