Question

In: Statistics and Probability

Suppose you run a regression containing observations for each of the 74 kinds of cars released...

Suppose you run a regression containing observations for each of the 74 kinds of cars released in 1978 in the United States, and you regress price (in dollars) on weight (in pounds). You get the following results:
ˆ β0 is -6.71, with SE 1174.4. ˆ β1 (slope coefficient on weight) is 2.04 with SE .377.
• (a) Say, in words, what the slope coefficient means in this case, without taking a stand on causality.

• (b) Suppose I give you the following information: the sum of squares total is roughly equal to 635 million, and the sum of squares explained is equal to 185 million. Please report the R2 and the correlation coefficient between price and weight.

• (c) Use your regression model to predict what the price would be for a car that weighs 3000 pounds.

• (d) Use the SER to add uncertainty and make this prediction an interval. • (5 points) Build a 95% confidence interval for the slope coefficient and report the p-value for comparing it to 0. Interpret your results.

Solutions

Expert Solution

Suppose you run a regression containing observations for each of the 74 kinds of cars released in 1978 in the United States, and you regress price (in dollars) on weight (in pounds). You get the following results:
ˆ β0 is -6.71, with SE 1174.4. ˆ β1 (slope coefficient on weight) is 2.04 with SE .377.
• (a) Say, in words, what the slope coefficient means in this case, without taking a stand on causality.

The regression equation we get from the information provided, we get

price = -6.71 + 2.04 weight

Interpretation for the slope : One pound increase in the weight increase the price by 2.04

• (b) Suppose I give you the following information: the sum of squares total is roughly equal to 635 million, and the sum of squares explained is equal to 185 million. Please report the R2 and the correlation coefficient between price and weight.

Coefficient of determination(rsqaure) = 0.2913
It is the measure of the amount of variability in y explained by x. Its value lies between 0 and 1. Greater the value, better is the model. In this case, it 29.13%, hence the model is bad

• (c) Use your regression model to predict what the price would be for a car that weighs 3000 pounds.

We are given weight = 3000 pounds.
We substitute this value in the regression

price = -6.71 + 2.04*(3000) = 6113.29

d.(5 points) Build a 95% confidence interval for the slope coefficient and report the p-value for comparing it to 0. Interpret your results.

95% confidence interval for the slope


Calculate the p-value for the slope coefficient.

Significance of the Slope coefficient

To understand if the slope coefficient is significant, we conduct the following hypothesis


We check the pvalue associated with that variable,
if the pvalue is less than 0.01(level of significance specified), then we reject the null hypothesis and conclude that the variable is significant or is statistically different from zero. Hence it is a significant predictor of y.

if the pvalue is greater than 0.01(level of significance specified), then we fail to reject the null hypothesis and conclude that the variable is not significant or is not statistically different from zero. Hence it is not a significant predictor of y and can be dropped from the regression.

We see that the pvalue for the slope in our case is 0.00, which is less than 0.05, hence we conclude that weight is an important predictor of y.


Related Solutions

Suppose for a multiple regression on just 5 observations you are given the following portion of...
Suppose for a multiple regression on just 5 observations you are given the following portion of an excel regression output: RESIDUAL OUTPUT Observation Predicted y(hat) Residuals 1 73.61 0.39 2 93.03 -1.03 3 58.97 -0.97 4 85.21 -0.21 5 78.18 1.82 Test the model for autocorrelation at a 10% level of significance. Test the model for heteroskedasticity using a level of significance of 5%
Suppose for a multiple regression on just 5 observations you are given the following portion of...
Suppose for a multiple regression on just 5 observations you are given the following portion of an excel regression output: RESIDUAL OUTPUT observations Predicted y(hat) Residuals 1 73.61 0.39 2 93.03 -1.03 3 58.97 -0.97 4 85.21 -0.21 5 78.18 1.82 Test the model for autocorrelation at a 10% level of significance. Test the model for heteroskedasticity using a level of significance of 5%
Suppose, for simplicity, that there are two kinds of used cars in the market. Exactly 50%...
Suppose, for simplicity, that there are two kinds of used cars in the market. Exactly 50% of the cars are good cars worth $10,000, and 50% are bad cars (lemons) worth only $5,000. Also suppose that only the car owner truly knows the type of car that it is. A.If both kinds of cars are sold, what is your expected value from buying a used car? B.Now suppose that current owners of good cars (i.e., the sellers) have a reservation...
(3) Suppose that you run a regression of wage on a set of independent variables: age,...
(3) Suppose that you run a regression of wage on a set of independent variables: age, years of education, experience, and a dummy variable married. There are 100 observations, i.e., n=100. You want to test whether all the coefficient and intercept estimates in the equation for married and unmarried are equal to each other. i.e., Married: wage = β0 + β1*age + β2*educ + β3*exper + uUnmarried: wage = α0 + α1*age + α2*educ + α3*exper + v (a) Write...
1. Suppose you run a regression on past data to find the magnitude of consumption responses...
1. Suppose you run a regression on past data to find the magnitude of consumption responses to changes in interest rates. What is the main error that you are making if you simply use this number to determine the optimal interest rate change to respond to an economic shock? 2. Why can limiting the choices of policy makers result in better long-term outcomes?
ou run the following regression on 189 observations collectedusing a random sample to examine which factors...
ou run the following regression on 189 observations collectedusing a random sample to examine which factors affect the weight of babiesat birth:y=β0+β1x1+β2x2+β3x3+εwhere•yis the birth weight in grams•x1is number of times the mother visited the physician during the 1sttrimester•x2is mother’s weight at last menstrual period in kilograms•x3is the age of the mother in yearsYou obtain the following results Variable Estimate, bi Standard Error Sbi x1 15.66 51.103 x2 4.124 1.757 x3 7.356 10.282 bo 2225.513 301.536 (a) Interpret the estimateb1= 15.660...
Suppose you run a regression and the Serial Correlation LM Test, has a p-value of 0.0000....
Suppose you run a regression and the Serial Correlation LM Test, has a p-value of 0.0000. How would you interpret the results using an F-test?
Consider a regression model Yi=β0+β1Xi+ui and suppose from a sample of 10 observations you are provided...
Consider a regression model Yi=β0+β1Xi+ui and suppose from a sample of 10 observations you are provided the following information: ∑10i=1Yi=71;  ∑10i=1Xi=42;  ∑10i=1XiYi=308; ∑10i=1X2i=196 Given this information, what is the predicted value of Y, i.e.,Yˆ for x = 12? 1. 14 2. 11 3. 13 4. 12 5. 15
For an experiment with one factor containing 7 groups, with 8 observations in each group, fill...
For an experiment with one factor containing 7 groups, with 8 observations in each group, fill in the missing results in the following ANOVA table. Source df SS MS F Among Groups 63 Within Groups 10 Please provide the explanation for each blanks
You run a regression analysis on a bivariate set of data (n=99). You obtain the regression...
You run a regression analysis on a bivariate set of data (n=99). You obtain the regression equation y=0.843x+6.762 with a correlation coefficient of r=0.954 which is significant at α=0.01 You want to predict what value (on average) for the explanatory variable will give you a value of 150 on the response variable. What is the predicted explanatory value? x = (Report answer accurate to one decimal place.) Here is a bivariate data set. Find the regression equation for the response...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT