Question

In: Statistics and Probability

3.5          Drop/remove the insignificant independent variable from the regression model, and develop and show an updated...

3.5          Drop/remove the insignificant independent variable from the regression model, and develop and show an updated estimated regression equation that can be used to predict the average annual salary for salaried employees given the average annual salary for hourly employees and the size of the company. Again, use the F test and α = 0.05 to test for overall significance. Also use the t test and α = 0.05 to determine the significance of the independent variables in this updated estimated regression equation How much percentage of the variability in y is explained by the updated estimated regression equation? (15 points)

Rank Company Size Salaried ($1000s) Hourly ($1000s) Midsize SmallSize
4 Wegmans Food Markets Large 56 29 0 0
6 NetApp Midsize 143 76 1 0
7 Camden Property Trust Small 71 37 0 1
8 Recreational Equipment (REI) Large 103 28 0 0
10 Quicken Loans Midsize 78 54 1 0
11 Zappos.com Midsize 48 25 1 0
12 Mercedes-Benz USA Small 118 50 0 1
20 USAA Large 96 47 0 0
22 The Container Store Midsize 71 45 1 0
25 Ultimate Software Small 166 56 0 1
37 Plante Moran Small 73 45 0 1
42 Baptist Health South Florida Large 126 80 0 0
50 World Wide Technology Small 129 31 0 1
53 Methodist Hospital Large 100 83 0 0
58 Perkins Coie Small 189 63 0 1
60 American Express Large 114 35 0 0
64 TDIndustries Small 93 47 0 1
66 QuikTrip Large 69 44 0 0
72 EOG Resources Small 189 81 0 1
75 FactSet Research Systems Small 103 51 0 1
80 Stryker Large 71 43 0 0
81 SRC Small 84 33 0 1
84 Booz Allen Hamilton Large 105 77 0 0
91 CarMax Large 57 34 0 0
93 GoDaddy.com Midsize 105 71 1 0
94 KPMG Large 79 59 0 0
95 Navy Federal Credit Union Midsize 77 39 1 0
97 Schweitzer Engineering Labs Small 99 28 0 1
99 Darden Restaurants Large 57 24 0 0
100 Intercontinental Hotels Group Large 63 26 0 0

Solutions

Expert Solution

From the data it is seen that "Midsize" and "SmallSize" are two dummy variables created from the "Size" variable. Now while fitting regression model, it automatically creates dummy variables for categorical variables, so I am dropping the "Midsize" and "SmallSize" variables to avoid multicollinearity.

So finally fitting the linear regression model where response is "Salaried" and explanatory variables are "Hourly" and "Size". Below is the result of the model summary.

So from the result, it is seen that the F-statistic value is 11.72 and corresponding p-value is 4.817e-05 which is very very less than 0.05. This strongly tells that the model is overall significant.

Moreover, the p-value of the explanatory variables are less than 0.05 informing that both the Hourly and Size variable are significant in predicting Salaried variable.

R2 = 0.5749. Then 57.49% of variability of y is explained by the updated regression Model.


Related Solutions

Provide an interpretation of this regression model based on this output received from SPSS. Independent variable...
Provide an interpretation of this regression model based on this output received from SPSS. Independent variable is verbal IQ score. Dependent variable is full IQ score. Model Summary Model R R Square Adjusted R Square Std. Error of the Estimate 1 .875a .766 .764 7.003 a. Predictors: (Constant), verbiq Coefficientsa Model Unstandardized Coefficients Standardized Coefficients t Sig. 95.0% Confidence Interval for B B Std. Error Beta Lower Bound Upper Bound 1 (Constant) 12.502 3.987 3.136 .002 4.610 20.395 verbiq .928...
1.) Use Excel to plot the dependent vs the independent variable. Show the regression equation from...
1.) Use Excel to plot the dependent vs the independent variable. Show the regression equation from the computer output. Lannie Karner- GPA 3.6 Income 75k Courtney Sheperd Gpa 3.3 Income 74K Zenobia Roussel- GPA 2.9 Income 66K Elaine Doody- GPA 3.8 Income 80k Maudie Hocker-GPA 3.1 Income 65k Rick Hoover-GPA 3.2 Income 53k Franinca Ortez-GPA 2.7 Income 65k Li Kinder-GPA 3.3 Income 71k Brad Clem-GPA 3.8 Income 80k Soon Nettleton-GPA 4.0 Income 95k Vertie Yousesef-GPA 3.9 Income 110k Love Au-GPA...
1)________ regression brings one independent variable into the model at a time and does not allow...
1)________ regression brings one independent variable into the model at a time and does not allow any variables to leave once they have entered. Best subsets General stepwise Backward elimination Forward selection 2)The Centers for Disease Control (CDC) would like to test the hypothesis that the proportion of obese adults in the U.S. has increased this year when compared to last year. A random sample of 125 adults this year found that 56 were obese. Last year, a random sample...
Develop estimated regression equations, first using annual income as the independent variable and then using household...
Develop estimated regression equations, first using annual income as the independent variable and then using household size as the independent variable. Which variable is the better predictor of annual credit card charges? Discuss your findings - Income ($1000s) Household Size Amount Charged ($) 54 3 4,016 30 2 3,159 32 4 5,100 50 5 4,742 31 2 1,864 55 2 4,070 37 1 2,731 40 2 3,348 66 4 4,764 51 3 4,110 25 3 4,208 48 4 4,219 27...
Develop estimated regression equations, first using annual income as the independent variable and then using household...
Develop estimated regression equations, first using annual income as the independent variable and then using household size as the independent variable. Which variable is the better predictor of annual credit card charges? Discuss your findings - Income ($1000s) Household Size Amount Charged ($) 54 3 4,016 30 2 3,159 32 4 5,100 50 5 4,742 31 2 1,864 55 2 4,070 37 1 2,731 40 2 3,348 66 4 4,764 51 3 4,110 25 3 4,208 48 4 4,219 27...
Use the following data to develop a multiple regression model to predict from and . Discuss...
Use the following data to develop a multiple regression model to predict from and . Discuss the output, including comments about the overall strength of the model, the significance of the regression coefficients, and other indicators of model fit. y x1 x2 198 29 1.64 214 71 2.81 211 54 2.22 219 73 2.70 184 67 1.57 167 32 1.63 201 47 1.99 204 43 2.14 190 60 2.04 222 32 2.93 197 34 2.15 Appendix A Statistical Tables *(Round...
Your experience tells you that an independent variable is positively correlated to the dependent variable but a multiple regression model give it a negative coefficient.
Your experience tells you that an independent variable is positively correlated to the dependent variable but a multiple regression model give it a negative coefficient. What could cause this? Your judgement is wrong. Statistics don't lie The software package made an error The homoscedasticity assumption has been violated The model may have correlated independent variables The heteroscedasticity assumption has been violated
Data was collected from 40 employees to develop a regression model to predict the employee’s annual...
Data was collected from 40 employees to develop a regression model to predict the employee’s annual salary using their years with the company (Years), their starting salary in thousands (Starting), and their Gender (Male = 0, Female = 1). The level of significance is .01. The results from Excel regression analysis are shown below: SUMMARY OUTPUT Regression Statistics Multiple R 0.718714957 R Square 0.516551189 Adjusted R Square 0.476263788 Standard Error 10615.63461 Observations 40 ANOVA Df SS MS F Significance F...
a. If they are going to run a linear regression, identify which variable should be the independent variable and which should be the dependent variable in a regression equation.
In seeking to determine how influential advertising is, the management of a recently established retail chain collected data on sales revenue and advertising expenditure from its' stores over the last ten (10) weeks. The table below shows the data collected: Advertising Expenditure ($ 000) Sales ($ 000) 3 5 76 50 250 700 450 3.5 75 4 150 4.5 7 200 750 7.5 800 8.5 1,100 a. If they are going to run a linear regression, identify which variable should...
1. Consider simple regression model (one independent variable) a) How do you find a confidence interval...
1. Consider simple regression model (one independent variable) a) How do you find a confidence interval for the coefficient of X? b) How do you conduct a test if the coefficient of X is 10 or not? c) How do you find a CI for the mean of the dependent variable if x=2?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT