Question

In: Statistics and Probability

The R output below gives the summary of the multiple regression model for birth weight based...

The R output below gives the summary of the multiple regression model for birth weight based on both gestation length and smoking status:

lm(formula = Weight ~ Weeks + SmokingStatus, data = births)

Coefficients:

Estimate Std. Error t value Pr(>|t|)

(Intercept) -1724.42 558.84 -3.086 0.00265 **

Weeks 130.05 14.52 8.957 2.39e-14 ***

SmokingStatusSmoker -294.40 135.78 -2.168 0.03260 *

---

Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 484.6 on 97 degrees of freedom

Multiple R-squared: 0.4636, Adjusted R-squared: 0.4525

F-statistic: 41.92 on 2 and 97 DF, p-value: 7.594e-14

(e) Based on the model output, what is the estimated birth weight for a birth at 35 weeks gestation to a non-smoking mother? [1 mark]

(f) Briefly interpret the value ‘130.05’ in the output. [1 mark]

g) Why do the residuals have 97 degrees of freedom? [1 mark]

h) Based on the multiple regression model, is there any evidence of a difference in mean birth weight between smoking and non-smoking mothers? Justify your conclusion with reference to the R output above. [2 marks]

(i) Briefly explain why the conclusion from the multiple regression model might be different to the conclusion from the two-sample t-test in (d). [2 marks]

Solutions

Expert Solution

The R output below gives the summary of the multiple regression model for birth weight based on both gestation length and smoking status:

lm(formula = Weight ~ Weeks + SmokingStatus, data = births)

Coefficients:

Estimate Std. Error t value Pr(>|t|)

(Intercept) -1724.42 558.84 -3.086 0.00265 **

Weeks 130.05 14.52 8.957 2.39e-14 ***

SmokingStatusSmoker -294.40 135.78 -2.168 0.03260 *

---

Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 484.6 on 97 degrees of freedom

Multiple R-squared: 0.4636, Adjusted R-squared: 0.4525

F-statistic: 41.92 on 2 and 97 DF, p-value: 7.594e-14

(e) Based on the model output, what is the estimated birth weight for a birth at 35 weeks gestation to a non-smoking mother? [1 mark]

The estimated regression line is

Birth weight = -1724.42+130.05*weeks-294.40*smokingstatus.

Estimated Birth weight = -1724.42+130.05*35 -294.40*0

=2827.33

(f) Briefly interpret the value ‘130.05’ in the output. [1 mark]

When weeks increases by 1 unit, the birth weight increases by 130.05.

g) Why do the residuals have 97 degrees of freedom? [1 mark]

total sample size 100, therefore total Df = 100-1 =99 and df for regression is 2

df for residuals = 99-2=97.

h) Based on the multiple regression model, is there any evidence of a difference in mean birth weight between smoking and non-smoking mothers? Justify your conclusion with reference to the R output above. [2 marks]

calculated t = -2.168, P= 0.03260 which is < 0.05 level of significance. Regression coefficient for SmokingStatus is significant. Therefore there is sufficient evidence of a difference in mean birth weight between smoking and non-smoking mothers.

(i) Briefly explain why the conclusion from the multiple regression model might be different to the conclusion from the two-sample t-test in (d). [2 marks]

Multiple regression model might be different to the conclusion from the two-sample t-test because it considers the two independent variables together. two-sample t-test considers the two independent variables one at a time.


Related Solutions

The following is the estimation results for a multiple linear regression model: SUMMARY OUTPUT             Regression...
The following is the estimation results for a multiple linear regression model: SUMMARY OUTPUT             Regression Statistics R-Square                                                       0.558 Regression Standard Error (S)                  863.100 Observations                                               35                                Coeff        StdError          t-Stat    Intercept               1283.000    352.000           3.65    X1                             25.228        8.631                       X2                               0.861        0.372           Questions: Interpret each coefficient.
The following is the estimation results for a multiple linear regression model: SUMMARY OUTPUT             Regression...
The following is the estimation results for a multiple linear regression model: SUMMARY OUTPUT             Regression Statistics R-Square                                                       0.558 Regression Standard Error (S)                  863.100 Observations                                               35                                Coeff        StdError          t-Stat    Intercept               1283.000    352.000           3.65    X1                             25.228        8.631                       X2                               0.861        0.372           Question: 1. A. Write the fitted regression equation. B. Write the estimated intercepts and slopes, associated with their corresponding standard errors. C. Interpret each coefficient.
Regression equation for Case 3.0: SUMMARY OUTPUT Regression Statistics Multiple R 0.957 R Square 0.915 Adjusted...
Regression equation for Case 3.0: SUMMARY OUTPUT Regression Statistics Multiple R 0.957 R Square 0.915 Adjusted R Square 0.908 Standard Error 5.779 Observations 52 ANOVA df SS MS F Significance F Regression 4 16947.86487 4236.9662 126.8841 1.45976E-24 Residual 47 1569.442824 33.392401 Total 51 18517.30769 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 39.08190 15.31261 2.55227 0.014012 8.27693 69.88687 X-Price -7.37039 0.98942 -7.44921 1.71E-09 -9.36084 -5.37994 Y-Price -5.42813 0.33793 -16.06289 1.03E-20 -6.10796 -4.74831 Z-Price 4.05067 0.33949 11.93173 7.95E-16...
The multiple regression model is estimated in Excel and part of the output is provided below....
The multiple regression model is estimated in Excel and part of the output is provided below. ANOVA df SS MS F Significance F Regression 3 3.39E+08 1.13E+08 1.327997 0.27152899 Residual 76 6.46E+09 85052151 Total 79 6.8E+09 Question 8 (1 point) Use the information from the ANOVA table to complete the following statement. To test the overall significance of this estimated regression model, the hypotheses would state there is    between attendance and the group of all explanatory variables, jointly. there is...
Dep.= Mileage Indep.= Octane SUMMARY OUTPUT Regression Statistics Multiple R R Square Adjusted R Square Standard...
Dep.= Mileage Indep.= Octane SUMMARY OUTPUT Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 7.0000 ANOVA Significance df SS MS F F Regression 9.1970 Residual Total 169.4286 Standard Coefficients Error t Stat P-value Lower 95% Upper 95% Intercept -115.6768 Octane 1.5305 SE CI CI PI PI Predicted Predicted Lower Upper Lower Upper x0 Value Value 95% 95% 95% 95% 89.0000 1.4274 87.0000 2.0544 Is there a relationship between a car's gas MILEAGE (in miles/gallon) and the...
SUMMARY OUTPUT Regression Statistics Multiple R 0.727076179 R Square 0.528639771 Adjusted R Square 0.525504337 Standard Error...
SUMMARY OUTPUT Regression Statistics Multiple R 0.727076179 R Square 0.528639771 Adjusted R Square 0.525504337 Standard Error 3.573206748 Observations 455 ANOVA df SS MS F Significance F Regression 3 6458.025113 2152.67504 168.601791 2.7119E-73 Residual 451 5758.280717 12.7678065 Total 454 12216.30583 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 99.0% Upper 99.0% Intercept -0.250148858 0.359211364 -0.6963835 0.48654745 -0.9560846 0.45578693 -1.1793476 0.67904987 RBUK 0.025079378 0.023812698 1.05319345 0.29281626 -0.0217182 0.07187699 -0.0365187 0.08667745 RSUS 0.713727515 0.042328316 16.8617037 8.0578E-50 0.6305423 0.79691273 0.60423372 0.82322131...
SUMMARY OUTPUT Regression Statistics Multiple R 0.72707618 R Square 0.52863977 Adjusted R Square 0.52550434 Standard Error...
SUMMARY OUTPUT Regression Statistics Multiple R 0.72707618 R Square 0.52863977 Adjusted R Square 0.52550434 Standard Error 3.57320675 Observations 455 ANOVA df SS MS F Significance F Regression 3 6458.02511 2152.67504 168.601791 2.7119E-73 Residual 451 5758.28072 12.7678065 Total 454 12216.3058 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 99.0% Upper 99.0% Intercept -0.2501489 0.35921136 -0.6963835 0.48654745 -0.9560846 0.45578693 -1.1793476 0.67904987 RUK 0.02507938 0.0238127 1.05319345 0.29281626 -0.0217182 0.07187699 -0.0365187 0.08667745 RSUS 0.71372752 0.04232832 16.8617037 8.0578E-50 0.6305423 0.79691273 0.60423372 0.82322131...
SUMMARY OUTPUT Regression Statistics Multiple R 0.195389 R Square 0.038177 Adjusted R Square 0.037333 Standard Error...
SUMMARY OUTPUT Regression Statistics Multiple R 0.195389 R Square 0.038177 Adjusted R Square 0.037333 Standard Error 13.69067 Observations 1142 ANOVA df SS MS F Significance F Regression 1 8481.255 8481.255 45.2492 2.74E-11 Residual 1140 213675.2 187.4344 Total 1141 222156.4 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept 40.19631 0.596741 67.35967 0 39.02547 41.36714 39.02547 41.36714 X Variable 1 7.31E-05 1.09E-05 6.726752 2.74E-11 5.18E-05 9.45E-05 5.18E-05 9.45E-05 Discuss the statistical significance of the model...
SUMMARY OUTPUT Regression Statistics Multiple R 0.396235 R Square 0.157002 Adjusted R Square 0.156262 Standard Error...
SUMMARY OUTPUT Regression Statistics Multiple R 0.396235 R Square 0.157002 Adjusted R Square 0.156262 Standard Error 18.42647 Observations 1142 ANOVA df SS MS F Significance F Regression 1 72088.71 72088.71 212.3161 3.12E-44 Residual 1140 387069.6 339.5348 Total 1141 459158.4 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept 26.35917 0.803163 32.8192 7.4E-167 24.78333 27.93501 24.78333 27.93501 X Variable 1 0.000213 1.46E-05 14.57107 3.12E-44 0.000184 0.000242 0.000184 0.000242 a. Write the reqression equation. Discuss the...
SUMMARY OUTPUT Regression Statistics Multiple R 0.195389 R Square 0.038177 Adjusted R Square 0.037333 Standard Error...
SUMMARY OUTPUT Regression Statistics Multiple R 0.195389 R Square 0.038177 Adjusted R Square 0.037333 Standard Error 36578.71 Observations 1142 ANOVA df SS MS F Significance F Regression 1 6.05E+10 6.05E+10 45.2492 2.74E-11 Residual 1140 1.53E+12 1.34E+09 Total 1141 1.59E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept 17779.38 3518.846 5.052617 5.07E-07 10875.24 24683.53 10875.24 24683.53 X Variable 1 522.0407 77.60665 6.726752 2.74E-11 369.7728 674.3086 369.7728 674.3086 Income using age Write the regression equation....
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT