3.5 Drop/remove the insignificant independent variable from the regression model, and develop and show an updated...

3.5 Drop/remove the insignificant independent variable from the regression model, and develop and show an updated estimated regression equation that can be used to predict the average annual salary for salaried employees given the average annual salary for hourly employees and the size of the company. Again, use the F test and α = 0.05 to test for overall significance. Also use the t test and α = 0.05 to determine the significance of the independent variables in this updated estimated regression equation How much percentage of the variability in y is explained by the updated estimated regression equation? (15 points)

Rank	Company	Size	Salaried ($1000s)	Hourly ($1000s)	Midsize	SmallSize
4	Wegmans Food Markets	Large	56	29	0	0
6	NetApp	Midsize	143	76	1	0
7	Camden Property Trust	Small	71	37	0	1
8	Recreational Equipment (REI)	Large	103	28	0	0
10	Quicken Loans	Midsize	78	54	1	0
11	Zappos.com	Midsize	48	25	1	0
12	Mercedes-Benz USA	Small	118	50	0	1
20	USAA	Large	96	47	0	0
22	The Container Store	Midsize	71	45	1	0
25	Ultimate Software	Small	166	56	0	1
37	Plante Moran	Small	73	45	0	1
42	Baptist Health South Florida	Large	126	80	0	0
50	World Wide Technology	Small	129	31	0	1
53	Methodist Hospital	Large	100	83	0	0
58	Perkins Coie	Small	189	63	0	1
60	American Express	Large	114	35	0	0
64	TDIndustries	Small	93	47	0	1
66	QuikTrip	Large	69	44	0	0
72	EOG Resources	Small	189	81	0	1
75	FactSet Research Systems	Small	103	51	0	1
80	Stryker	Large	71	43	0	0
81	SRC	Small	84	33	0	1
84	Booz Allen Hamilton	Large	105	77	0	0
91	CarMax	Large	57	34	0	0
93	GoDaddy.com	Midsize	105	71	1	0
94	KPMG	Large	79	59	0	0
95	Navy Federal Credit Union	Midsize	77	39	1	0
97	Schweitzer Engineering Labs	Small	99	28	0	1
99	Darden Restaurants	Large	57	24	0	0
100	Intercontinental Hotels Group	Large	63	26	0	0

Expert Solution

From the data it is seen that "Midsize" and "SmallSize" are two dummy variables created from the "Size" variable. Now while fitting regression model, it automatically creates dummy variables for categorical variables, so I am dropping the "Midsize" and "SmallSize" variables to avoid multicollinearity.

So finally fitting the linear regression model where response is "Salaried" and explanatory variables are "Hourly" and "Size". Below is the result of the model summary.

So from the result, it is seen that the F-statistic value is 11.72 and corresponding p-value is 4.817e-05 which is very very less than 0.05. This strongly tells that the model is overall significant.

Moreover, the p-value of the explanatory variables are less than 0.05 informing that both the Hourly and Size variable are significant in predicting Salaried variable.

R² = 0.5749. Then 57.49% of variability of y is explained by the updated regression Model.

orchestra answered 2 years ago

Provide an interpretation of this regression model based on this output received from SPSS. Independent variable...

Provide an interpretation of this regression model based on this output received from SPSS. Independent variable is verbal IQ score. Dependent variable is full IQ score. Model Summary Model R R Square Adjusted R Square Std. Error of the Estimate 1 .875a .766 .764 7.003 a. Predictors: (Constant), verbiq Coefficientsa Model Unstandardized Coefficients Standardized Coefficients t Sig. 95.0% Confidence Interval for B B Std. Error Beta Lower Bound Upper Bound 1 (Constant) 12.502 3.987 3.136 .002 4.610 20.395 verbiq .928...

1.) Use Excel to plot the dependent vs the independent variable. Show the regression equation from...

1.) Use Excel to plot the dependent vs the independent variable. Show the regression equation from the computer output. Lannie Karner- GPA 3.6 Income 75k Courtney Sheperd Gpa 3.3 Income 74K Zenobia Roussel- GPA 2.9 Income 66K Elaine Doody- GPA 3.8 Income 80k Maudie Hocker-GPA 3.1 Income 65k Rick Hoover-GPA 3.2 Income 53k Franinca Ortez-GPA 2.7 Income 65k Li Kinder-GPA 3.3 Income 71k Brad Clem-GPA 3.8 Income 80k Soon Nettleton-GPA 4.0 Income 95k Vertie Yousesef-GPA 3.9 Income 110k Love Au-GPA...

1)________ regression brings one independent variable into the model at a time and does not allow...

1)________ regression brings one independent variable into the model at a time and does not allow any variables to leave once they have entered. Best subsets General stepwise Backward elimination Forward selection 2)The Centers for Disease Control (CDC) would like to test the hypothesis that the proportion of obese adults in the U.S. has increased this year when compared to last year. A random sample of 125 adults this year found that 56 were obese. Last year, a random sample...

Develop estimated regression equations, first using annual income as the independent variable and then using household...

Develop estimated regression equations, first using annual income as the independent variable and then using household size as the independent variable. Which variable is the better predictor of annual credit card charges? Discuss your findings - Income ($1000s) Household Size Amount Charged ($) 54 3 4,016 30 2 3,159 32 4 5,100 50 5 4,742 31 2 1,864 55 2 4,070 37 1 2,731 40 2 3,348 66 4 4,764 51 3 4,110 25 3 4,208 48 4 4,219 27...

Develop estimated regression equations, first using annual income as the independent variable and then using household...

Develop estimated regression equations, first using annual income as the independent variable and then using household size as the independent variable. Which variable is the better predictor of annual credit card charges? Discuss your findings - Income ($1000s) Household Size Amount Charged ($) 54 3 4,016 30 2 3,159 32 4 5,100 50 5 4,742 31 2 1,864 55 2 4,070 37 1 2,731 40 2 3,348 66 4 4,764 51 3 4,110 25 3 4,208 48 4 4,219 27...

Use the following data to develop a multiple regression model to predict from and . Discuss...

Use the following data to develop a multiple regression model to predict from and . Discuss the output, including comments about the overall strength of the model, the significance of the regression coefficients, and other indicators of model fit. y x1 x2 198 29 1.64 214 71 2.81 211 54 2.22 219 73 2.70 184 67 1.57 167 32 1.63 201 47 1.99 204 43 2.14 190 60 2.04 222 32 2.93 197 34 2.15 Appendix A Statistical Tables *(Round...

Your experience tells you that an independent variable is positively correlated to the dependent variable but a multiple regression model give it a negative coefficient.

Your experience tells you that an independent variable is positively correlated to the dependent variable but a multiple regression model give it a negative coefficient. What could cause this? Your judgement is wrong. Statistics don't lie The software package made an error The homoscedasticity assumption has been violated The model may have correlated independent variables The heteroscedasticity assumption has been violated

Data was collected from 40 employees to develop a regression model to predict the employee’s annual...

Data was collected from 40 employees to develop a regression model to predict the employee’s annual salary using their years with the company (Years), their starting salary in thousands (Starting), and their Gender (Male = 0, Female = 1). The level of significance is .01. The results from Excel regression analysis are shown below: SUMMARY OUTPUT Regression Statistics Multiple R 0.718714957 R Square 0.516551189 Adjusted R Square 0.476263788 Standard Error 10615.63461 Observations 40 ANOVA Df SS MS F Significance F...

a. If they are going to run a linear regression, identify which variable should be the independent variable and which should be the dependent variable in a regression equation.

In seeking to determine how influential advertising is, the management of a recently established retail chain collected data on sales revenue and advertising expenditure from its' stores over the last ten (10) weeks. The table below shows the data collected: Advertising Expenditure ($ 000) Sales ($ 000) 3 5 76 50 250 700 450 3.5 75 4 150 4.5 7 200 750 7.5 800 8.5 1,100 a. If they are going to run a linear regression, identify which variable should...

1. Consider simple regression model (one independent variable) a) How do you find a confidence interval...

1. Consider simple regression model (one independent variable) a) How do you find a confidence interval for the coefficient of X? b) How do you conduct a test if the coefficient of X is 10 or not? c) How do you find a CI for the mean of the dependent variable if x=2?

Question

3.5 Drop/remove the insignificant independent variable from the regression model, and develop and show an updated...

Solutions

Expert Solution

Related Solutions

Provide an interpretation of this regression model based on this output received from SPSS. Independent variable...

1.) Use Excel to plot the dependent vs the independent variable. Show the regression equation from...

1)________ regression brings one independent variable into the model at a time and does not allow...

Develop estimated regression equations, first using annual income as the independent variable and then using household...

Develop estimated regression equations, first using annual income as the independent variable and then using household...

Use the following data to develop a multiple regression model to predict from and . Discuss...

Your experience tells you that an independent variable is positively correlated to the dependent variable but a multiple regression model give it a negative coefficient.

Data was collected from 40 employees to develop a regression model to predict the employee’s annual...

a. If they are going to run a linear regression, identify which variable should be the independent variable and which should be the dependent variable in a regression equation.

1. Consider simple regression model (one independent variable) a) How do you find a confidence interval...