Question

In: Statistics and Probability

Consider the following multiple regression equation relating a machinist's performance rating on a new machine (RATING)...

Consider the following multiple regression equation relating a machinist's performance rating on a new machine (RATING) to the following three independent variables.

WKEX - number of years work experience as a machinist

TSCORE - mechanical aptitude score

Years - age

^Rating = 12.5 + 0.8 WKEX + 0.32 TSCORE + 0.3 YEARS

a) Explain all of the steps for finding the value of R^2 (coefficient of determination) which are associated with finding the variance inflation factor (VIF) associated with the independent variable YEARS.

Solutions

Expert Solution

Coefficient of determination, R2, a measure that assesses the ability of a model to predict or explain an outcome in the linear regression setting. More specifically, R2 indicates the proportion of the variance in the dependent variable (Y) that is predicted or explained by linear regression and the predictor variable (X) [ also known as the independent variable]

In general, a high R2 value indicates that the model is a good fit for the data, although interpretations of fit depending on the context of analysis.

Now

R2 = MSS/TSS = (TSS RSS)/TSS,

where MSS is the model sum of squares (also known as ESS, or explained sum of squares), which is the sum of the squares of the prediction from the linear regression minus the mean for that variable; TSS is the total sum of squares associated with the outcome variable, which is the sum of the squares of the measurements minus their mean; and RSS is the residual sum of squares, which is the sum of the squares of the measurements minus the prediction from the linear regression

A variance inflation factor(VIF) detects multicollinearity in regression analysis. Multicollinearity is when there’s correlation between predictors (i.e. independent variables) in a model; it’s presence can adversely affect your regression results. The VIF estimates how much the variance of a regression coefficient is inflated due to multicollinearity in the model

VIFs are calculated by taking a predictor and regressing it against every other predictor in the model. This gives you the R-squared values, which can then be plugged into the VIF formula. “i” is the predictor you’re looking at (e.g. x1 or x2):

Here the predictor i corresponds to the independant variable Years


Related Solutions

DEFINE THE COMPONENTS OF A GENERAL MULTIPLE REGRESSION EQUATION. IN A MULTIPLE REGRESSION ANOVA TABLE, THE...
DEFINE THE COMPONENTS OF A GENERAL MULTIPLE REGRESSION EQUATION. IN A MULTIPLE REGRESSION ANOVA TABLE, THE DEGREES OF FREDOM FOR THE REGRESSION IS EQUAL TO: THE TEST TO CONFIRM WHETHER THE DEPENDENT VARIABLE CAN BE ESTIMATED WITHOUT RELYING ON THE INDEPENDENT VARIABLES IS REFERRED TO AS: WHAT STATISITIC WOULD WE USE FOR TESTING TO SEE IF AT LEAST ONE INDEPENDENT REGRESSION COEFFICIENTS IS SIGNIFICANT? TO TEST INDEPENDENT VARIABLES INDIVIDUALLY TO DETERMINE WHETHER THE REGRESSION COEFFICIENTS DIFFER FROM ZERO WOULD BE:...
DEFINE THE COMPONENTS OF A GENERAL MULTIPLE REGRESSION EQUATION. IN A MULTIPLE REGRESSION ANOVA TABLE, THE...
DEFINE THE COMPONENTS OF A GENERAL MULTIPLE REGRESSION EQUATION. IN A MULTIPLE REGRESSION ANOVA TABLE, THE DEGREES OF FREDOM FOR THE REGRESSION IS EQUAL TO: THE TEST TO CONFIRM WHETHER THE DEPENDENT VARIABLE CAN BE ESTIMATED WITHOUT RELYING ON THE INDEPENDENT VARIABLES IS REFERRED TO AS: WHAT STATISITIC WOULD WE USE FOR TESTING TO SEE IF AT LEAST ONE INDEPENDENT REGRESSION COEFFICIENTS IS SIGNIFICANT? TO TEST INDEPENDENT VARIABLES INDIVIDUALLY TO DETERMINE WHETHER THE REGRESSION COEFFICIENTS DIFFER FROM ZERO WOULD BE:...
In a multiple linear regression with 40 observations, the following sample regression equation is obtained: yˆy^...
In a multiple linear regression with 40 observations, the following sample regression equation is obtained: yˆy^ = 12.5 + 2.4x1 − 1.0x2 with se = 5.41. Also, when x1 equals 16 and x2 equals 5, se(yˆ0)se(y^0) = 2.60. [You may find it useful to reference the t table.] a. Construct the 95% confidence interval for E(y) if x1 equals 16 and x2 equals 5. (Round intermediate calculations to at least 4 decimal places, "tα/2,df" value to 3 decimal places, and...
Consider the following data:   and   What is the sample regression equation?
Consider the following data:   and   What is the sample regression equation?
a) State the Multiple Regression Equation. b) Interpret the meaning of the slopes of this equation...
a) State the Multiple Regression Equation. b) Interpret the meaning of the slopes of this equation c)Predict the gasoline mileage for an automobile that has a length of 195 inches and a weight of 3000 pounds. e)Is there a significant relationship between the gasoline mileage and the two independent variables (Length and weight) at the 0.05 level of significance? g) Interpret the meaning of the coefficient of multiple determination in this problem i) At the 0.05 level of significance, determine...
The following estimated regression equation was developed relating yearly income (y in $1000s) of 30 individuals...
The following estimated regression equation was developed relating yearly income (y in $1000s) of 30 individuals with their age (x1) and their gender (x2) (0 if male and 1 if female). Ŷ = 30 + .7x1 + 3x2 Also provided are SST = 1200 and SSE = 384. The test statistic for testing the significance of the model is
The following estimated regression equation was developed relating yearly income (y in $1000s) of 30 individuals...
The following estimated regression equation was developed relating yearly income (y in $1000s) of 30 individuals with their age (x1) and their gender (x2) (0 if male and 1 if female). Ŷ = 30 + .7x1 + 3x2 Also provided are SST = 1200 and SSE = 384. If we want to test for the significance of the model, the critical value of F at α = .05 is a. 3.33. b. 3.35. c. 3.34. d. 2.96.
1. What would the regression output (analysis) look like using this multiple regression equation and the...
1. What would the regression output (analysis) look like using this multiple regression equation and the following data? Daily Gross Revenue= total daily income+b1*daily tour income+b2*number of tourists+b3*Friday+b4*Saturday 2. What's the multiple regression equation with the numbers from the output? Years Weekend Daily Tour Income Number of Tourists Daily Gross Revenue Total Daily Income 1 Friday 3378 432 4838.95 8216.95 1 Saturday 1198 139 3487.78 4685.78 1 Sunday 3630 467 4371.3 8001.3 2 Friday 4550 546 6486.48 11036.48 2 Saturday...
Consider the following regression model, relating birth weight to the number of cigarettes smoked during pregnancy:...
Consider the following regression model, relating birth weight to the number of cigarettes smoked during pregnancy: bwgt=B0+B0cigs+ε (a) Do you think that cigsi and εi are correlated? Explain why or why not. Would the OLS estimator be BLUE in this case? (b) Suppose the mother’s level of education is also related to birth weight. However, you still estimate the equation above, using only cigsi as an independent variable. Do you think the OLS estimator B1 accurately estimates the association between...
Determine the regression equation (write the regression equation) of the following data, where Y is the...
Determine the regression equation (write the regression equation) of the following data, where Y is the dependant variable and X is the predictor: X Y 5 85 4 103 6 70 5 82 5 89 5 98 6 66 6 95 2 169 7 70 7 48 b) Interpret the regression coefficients. c) Graph the regression line. d) Use the regression equation to predict Y for X=3. e) Compute the coefficient of determination. f) Interpret the coefficient of determination. g)...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT