Question

In: Math

Suppose you perform the following multiple regression: Y = B0 + B1X1 + B2X2 + B3X3....

Suppose you perform the following multiple regression: Y = B0 + B1X1 + B2X2 + B3X3. You find that X1 and X3 have a near perfect correlation. How would you conclude on the utility of your regression result? This is a problem of multicollinearity which renders the entire regression invalid. This is a problem of multicollinearity which nevertheless does not invalidate the utility of the model as a whole This is NOT a regression problem and inferences made using the model and the respective coefficients remain valid. This is a problem of multicollinearity. However inferences made concerning the individual contribution of the model coefficients remain valid

Solutions

Expert Solution

We need to see the value of R square value and the adjusted R square of a given data. There shouldn't be much difference between these. A difference of 5 -10% is accepted (5% is much better). If there is a huge difference between these then the regression model doesn't fits the data and on the contrary, it fits the data.

For example,

In case of the above output, value of R square is 0.8 but the value of adjusted R square is 0.6. As there is a huge difference, the model doesn't fits the data and hence the utility is less. In order to solve this we might consider more relevant independent variable or reduce some in order to bring them within a difference of 5-10% at most. If the difference can be reduced more, the model becomes fit for the data.

One of the accepted solution is shown below:

Here the difference between the value of R square value and the adjusted R square is very less hence it is an accepted model.


Related Solutions

Y^ = b0 + b1X1 The following table shows the calculations for regression line: The following...
Y^ = b0 + b1X1 The following table shows the calculations for regression line: The following table shows the calculations for regression line: Customers (in 1000s), X Line Maintenance Expense (in $1000s), Y X^2 Y^2 XY 25.3 484.6 640.09 234837.16 12260.38 36.4 672.3 1324.96 451987.29 24471.72 37.9 839.4 1436.41 704592.36 31813.26 45.9 694.9 2106.81 482886.01 31895.91 53.4 836.4 2851.56 699564.96 44663.76 66.8 681.9 4462.24 464987.61 45550.92 78.4 1037 6146.56 1075369 81300.8 82.6 1095.6 6822.76 1200339.36 90496.56 93.8 1563.1 8798.44 2443281.61...
estimate the following linear regression equation using the data below: y=B0+B1X In this exercise, you need...
estimate the following linear regression equation using the data below: y=B0+B1X In this exercise, you need to estimate ŷ values for given y and x values using Excel. What are the estimated values of B0 and B1 ? x = 2, 4, 6, 8, 10 y = 10, 12, 20, 25, 40 ŷ =
Simple Linear Regression: Suppose a simple linear regression analysis provides the following results: b0 = 6.000,    b1...
Simple Linear Regression: Suppose a simple linear regression analysis provides the following results: b0 = 6.000,    b1 = 3.000,    sb0 = 0.750, sb1 = 0.500,  se = 1.364 and n = 24. Use this information to answer the following questions. (a) State the model equation. ŷ = β0 + β1x ŷ = β0 + β1x + β2sb1    ŷ = β0 + β1x1 + β2x2 ŷ = β0 + β1sb1 ŷ = β0 + β1sb1 x̂ = β0 + β1sb1 x̂ = β0 +...
5. Suppose you have performed a simple linear regression model and ended up with = b0...
5. Suppose you have performed a simple linear regression model and ended up with = b0 + b1 x. (a) In your own words, describe clearly what the coefficient of determination, , measures.   (b) Suppose that your calculations produce = 0.91. What can you conclude from this value? Furthermore, what can you say about the strength and direction of the relationship between the predictor and the response variable?
Suppose for a multiple regression on just 5 observations you are given the following portion of...
Suppose for a multiple regression on just 5 observations you are given the following portion of an excel regression output: RESIDUAL OUTPUT Observation Predicted y(hat) Residuals 1 73.61 0.39 2 93.03 -1.03 3 58.97 -0.97 4 85.21 -0.21 5 78.18 1.82 Test the model for autocorrelation at a 10% level of significance. Test the model for heteroskedasticity using a level of significance of 5%
Suppose for a multiple regression on just 5 observations you are given the following portion of...
Suppose for a multiple regression on just 5 observations you are given the following portion of an excel regression output: RESIDUAL OUTPUT observations Predicted y(hat) Residuals 1 73.61 0.39 2 93.03 -1.03 3 58.97 -0.97 4 85.21 -0.21 5 78.18 1.82 Test the model for autocorrelation at a 10% level of significance. Test the model for heteroskedasticity using a level of significance of 5%
Suppose a multiple regression model is given by modifying above y with caretequals0.21x 1minus9.52x 2minus28.56. What...
Suppose a multiple regression model is given by modifying above y with caretequals0.21x 1minus9.52x 2minus28.56. What would an interpretation of the coefficient of x 1 be? Fill in the blank below. An interpretation of the coefficient of x 1 would be, "if x 1 decreases by ?? unit, then the response variable will decrease by nothing units, on average, while holding x 2 constant." By how many units?
Consider the multiple linear regression of Yi=B0+B1X1i+B2X2i+ui Show mathematical procedure of how to calculate the slope...
Consider the multiple linear regression of Yi=B0+B1X1i+B2X2i+ui Show mathematical procedure of how to calculate the slope coefficients of B1 or B2
Consider the multiple linear regression of Yi=B0+B1X1i+B2X2i+ui Show mathematical procedure of how to calculate the slope...
Consider the multiple linear regression of Yi=B0+B1X1i+B2X2i+ui Show mathematical procedure of how to calculate the slope coefficients of B1 or B2
In this problem, we will perform multiple regression on the Boston housing data. The data contains...
In this problem, we will perform multiple regression on the Boston housing data. The data contains 506 records with 14 variables. The variable medv is the response variable. Solve the following problems in R and print out the commands and outputs : To assess the data use library(MASS) data(Boston) (a) First perform a multiple regression with all the variables, what can you say about the significance of the variables based on only the p-values. Next use the ”step” function to...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT