Question

In: Statistics and Probability

3. Fit a multiple regression model that relates the salary to education, work experience, and time spent at the bank so far.

SALARY EDUC EXPER TIME
39000 12 0 1
40200 10 44 7
42900 12 5 30
43800 8 6 7
43800 8 8 6
43800 12 0 7
43800 12 0 10
43800 12 5 6
44400 15 75 2
45000 8 52 3
45000 12 8 19
46200 12 52 3
48000 8 70 20
48000 12 6 23
48000 12 11 12
48000 12 11 17
48000 12 63 22
48000 12 144 24
48000 12 163 12
48000 12 228 26
48000 12 381 1
48000 16 214 15
49800 8 318 25
51000 8 96 33
51000 12 36 15
51000 12 59 14
51000 15 115 1
51000 15 165 4
51000 16 123 12
51600 12 18 12
52200 8 102 29
52200 12 127 29
52800 8 90 11
52800 8 190 1
52800 12 107 11
54000 8 173 34
54000 8 228 33
54000 12 26 11
54000 12 36 33
54000 12 38 22
54000 12 82 29
54000 12 169 27
54000 12 244 1
54000 15 24 13
54000 15 49 27
54000 15 51 21
54000 15 122 33
55200 12 97 17
55200 12 196 32
55800 12 133 30
56400 12 55 9
57000 12 90 23
57000 12 117 25
57000 15 51 17
57000 15 61 11
57000 15 241 34
60000 12 121 30
60000 15 79 13
61200 12 209 21
63000 12 87 33
63000 15 231 15
46200 12 12 22
50400 15 14 3
51000 12 180 15
51000 12 315 2
52200 12 29 14
54000 12 7 21
54000 12 38 11
54000 12 113 3
54000 15 18 8
54000 15 359 11
57000 15 36 5
60000 8 320 21
60000 12 24 2
60000 12 32 17
60000 12 49 8
60000 12 56 33
60000 12 252 11
60000 12 272 19
60000 15 25 13
60000 15 36 32
60000 15 56 12
60000 15 64 33
60000 15 108 16
60000 16 46 3
63000 15 72 17
66000 15 64 16
66000 15 84 33
66000 15 216 16
68400 15 42 7
69000 12 175 10
69000 15 132 24
81000 16 55 33
SUMMARY OUTPUT
Regression Statistics
Multiple R 0.549576953
R Square 0.302034828
Adjusted R Square 0.278507912
Standard Error 6027.28285
Observations 93
ANOVA
df SS MS F Significance F
Regression 3 1399124701 466374900.2 12.83784192 4.80E-07
Residual 89 3233204332 36328138.56
Total 92 4632329032
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 31794.79804 3834.248408 8.29231564 1.09E-12 24176.22813 39413.36796 24176.22813 39413.368
X Variable 1 1396.093359 277.1248641 5.037777335 2.45E-06 845.4521573 1946.734561 845.4521573 1946.73456
X Variable 2 14.84048274 6.970537846 2.129029792 0.036013792 0.990172517 28.69079297 0.990172517 28.690793
X Variable 3 206.290769 61.54401075 3.351922738 0.001179572 84.0041306 328.5774075 84.0041306 328.577407

3. Fit a multiple regression model that relates the salary to education, work experience, and time spent at the bank so far.

a - State what your model is.

b - Determine whether the independent variables are significant, or not, at a level of significance of 5%.

c - Which independent variable is most significant in explaining salary? Which is least significant?

d - Is your overall model significant? Provide statistical proof by conducting an F-test for overall fit of the regression. State the hypothesis to be tested, the p-value for your F-statistic, and your decision. How much weight of evidence is there in rejecting the null hypothesis?

Solutions

Expert Solution

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.549576953
R Square 0.302034828
Adjusted R Square 0.278507912
Standard Error 6027.28285
Observations 93
ANOVA
df SS MS F Significance F
Regression 3 1399124701 466374900.2 12.83784192 4.80329E-07
Residual 89 3233204332 36328138.56
Total 92 4632329032
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 31794.79804 3834.248408 8.29231564 1.08909E-12 24176.22813 39413.36796 24176.22813 39413.36796
EDUC 1396.093359 277.1248641 5.037777335 2.44893E-06 845.4521573 1946.734561 845.4521573 1946.734561
EXPER 14.84048274 6.970537846 2.129029792 0.036013792 0.990172517 28.69079297 0.990172517 28.69079297
TIME 206.290769 61.54401075 3.351922738 0.001179572 84.0041306 328.5774075 84.0041306 328.5774075

a)

Multuple regression:

SALARY =  31794.7980+1396.0934*EDUC+14.8405*EXPER+206.2908*TIME

b)

All independent variables are significanct at 5% significance level because all p values < 0.05

c)

Most significant variable is EDUCATION, Its P value is lessthan 0.00001

Least significant variable is EXPERIANCE, Its P value is between 0.01 < P value < 0.05

d)

Hypothesis:

H0: β1 = β2 = β3

Ha: At least one β is different

F stat = 12.8378

P value = 0

P value < 0.05

There is a strong evidence to conclude that Regression is significant at 5% significance level


Related Solutions

Fit a multiple regression model that relates the salary to education, work experience, and time spent...
Fit a multiple regression model that relates the salary to education, work experience, and time spent at the bank so far. a - State what your model is. b - Determine whether the independent variables are significant, or not, at a level of significance of 5%. c - Which independent variable is most significant in explaining salary? Which is least significant? d - Is your overall model significant? Provide statistical proof by conducting an F-test for overall fit of the...
Fit a multiple regression model that relates the salary to education, work experience, and time spent...
Fit a multiple regression model that relates the salary to education, work experience, and time spent at the bank so far. a - State what your model is. b - Determine whether the independent variables are significant, or not, at a level of significance of 5%. c - Which independent variable is most significant in explaining salary? Which is least significant? d - Is your overall model significant? Provide statistical proof by conducting an F-test for overall fit of the...
A multiple regression model is to be constructed to model the time spent using the internet...
A multiple regression model is to be constructed to model the time spent using the internet per week among internet users. The explanatory variables are age, hours spent working per week and annual income. Data has been collected on 30 randomly selected individuals: Time using internet (minutes) Age Hours working per week Annual income ('000) 140 56 39 28 257 35 31 79 163 35 35 34 115 33 52 27 182 45 36 37 214 51 57 80 187...
Using the data, fit an appropriate regression model to determine whether time spent studying (hours) is...
Using the data, fit an appropriate regression model to determine whether time spent studying (hours) is a useful predictor of the chance of passing the exam (result, 0=fail 1=pass). Formally assess the overall fit of the model. DATA three; INPUT result hours; /* result=0 is fail; result=1 is pass */ cards; 0 0.8 0 1.6 0 1.4 1 2.3 1 1.4 1 3.2 0 0.3 1 1.7 0 1.8 1 2.7 0 0.6 0 1.1 1 2.1 1 2.8 1...
Use the least squares method to fit a simple linear model that relates the salary (dependent...
Use the least squares method to fit a simple linear model that relates the salary (dependent variable) to education (independent variable). a- What is your model? State the hypothesis that is to be tested, the decision rule, the test statistic, and your decision, using a level of significance of 5%. b – What percentage of the variation in salary has been explained by the regression? c – Provide a 95% confidence interval estimate for the true slope value. d -...
Fit a multiple regression model using MPG as the dependent variable and DISP, HP, and WT...
Fit a multiple regression model using MPG as the dependent variable and DISP, HP, and WT as the independent variables. Is the overall regression model significant? Test at the α = 0.05 level of significance. State the null hypothesis, the alternative hypothesis =, the test statistic calculated and critical values and your test conclusion. mpg disp hp wt 21 160 110 2.62 21 160 110 2.875 22.8 108 93 2.32 21.4 258 110 3.215 18.7 360 175 3.44 18.1 225...
In the​ one-period model, education can be represented as time spent by the representative consumer that...
In the​ one-period model, education can be represented as time spent by the representative consumer that is neither leisure nor time applied to producing output. What the economy gains in the future is that the representative consumer then has more time​ available, as measured in terms of effective units of labour time​ (adjusted for skill​ level, or what economists call human​ capital). That​ is, the aggregate production function can be seen as Y=zF(K, eN)​, where e is the quantity of...
Fit a multiple linear regression model of the form y=β0 + β1 x1 + β2 x2...
Fit a multiple linear regression model of the form y=β0 + β1 x1 + β2 x2 + β3 x3 + ε. Here, ε is the random error term that is assumed to be normally distributed with 0 mean and constant variance. State the estimated regression function. How are the estimates of the three regression coefficients interpreted here? Provide your output, and interpretations in a worksheet titled “Regression Output.” Obtain the residuals and prepare a box-plot of the residuals. What information...
3. The time spent commuting from home to work for all employees of a very large...
3. The time spent commuting from home to work for all employees of a very large company has a normal distribution with a mean of 43.4 minutes and a standard deviation of 12.1 minutes. Calculate the probability that the mean time spent commuting from home to work by a sample of 21 employees will be in between 43.26 and 49.35.
Your experience tells you that an independent variable is positively correlated to the dependent variable but a multiple regression model give it a negative coefficient.
Your experience tells you that an independent variable is positively correlated to the dependent variable but a multiple regression model give it a negative coefficient. What could cause this? Your judgement is wrong. Statistics don't lie The software package made an error The homoscedasticity assumption has been violated The model may have correlated independent variables The heteroscedasticity assumption has been violated
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT