Question

In: Statistics and Probability

9.13 Using the SHHS data in Table 2.10,fit all possible multiple regression models (without interactions) that...

9.13

Using the SHHS data in Table 2.10,fit all possible multiple regression models (without interactions) that predict the y variable serum total cholesterol from diastolic blood pressure,systolic blood pressure,alcohol,carbon monoxide and cotinine. Scrutinize your results to understand how the x variables act in conjuction.For these data,which is the "best " multiple regression model for cholesterol? What percentage of variation does it explain?

Serum total cholesrerol (mmol/l) Diastolic blood pressure (mmHg) Systolic blood pressure (mmHg) Alcohol (g/day) Cigarettes (no./day) Carbon monoxide(ppm) Cotinine (ng/ml) CHD (1=yes,2=no)
5.75 80 121 5.4 0 6 13 2
6.76 83 139 64.6 0 4 3 2
6.47 76 113 21.5 20 21 284 2
7.11 79 124 8.2 40 57 395 2
5.42 100 127 24.4 20 29 283 2
7.04 79 148 13.6 0 3 0 2
5.75 79 124 54.6 0 3 1 2
7.14 100 127 6.2 0 1 0 2
6.1 79 138 0 0 1 3 2
6.55 85 133 2.4 0 2 0 2
6.29 92 141 0 0 7 0 2
5.98 100 183 21.5 20 55 245 1
5.71 78 119 50.2 0 14 424 2
6.89 90 143 16.7 0 4 0 1
4.9 85 132 40.6 4 7 82 2
6.23 88 139 16.7 25 24 324 2
7.71 109 154 7.2 1 3 11 1
5.73 93 136 10.8 0 2 0 1
6.54 100 149 26 0 3 0 2
7.16 73 107 2.9 25 29 315 1
6.13 92 132 23.9 0 2 2 2
6.25 87 123 31.1 0 7 10 2
5.19 97 141 12 0 3 4 1
6.05 74 118 23.9 0 3 0 2
7.12 85 133 24.4 0 2 0 2
5.71 88 121 45.4 0 8 2 2
6.19 69 129 24.8 15 40 367 1
6.73 98 129 52.6 15 21 233 2
5.34 70 123 38.3 1 2 7 2
4.79 82 127 23.9 0 2 1 2
6.78 74 104 4.8 0 4 7 2
6.1 88 123 86.1 0 3 1 1
4.35 88 128 15.5 20 11 554 2
7.1 79 136 7.4 10 9 189 1
5.85 102 150 4.1 0 6 0 2
6.74 68 109 1.2 15 15 230 2
7.55 80 135 92.1 25 29 472 2
7.86 78 131 23.9 6 55 407 1
6.92 101 137 2.5 0 3 0 2
6.64 97 139 119.6 40 16 298 2
6.46 76 142 62.2 40 31 404 1
5.99 73 108 0 0 2 4 2
5.39 77 112 11 30 11 251 2
6.35 81 133 16.2 0 3 0 2
5.86 88 147 88.5 0 3 0 2
5.64 65 111 0 20 16 271 2
6.6 102 149 65.8 0 3 1 2
6.76 75 140 12.4 0 2 0 2
5.51 75 125 0 25 16 441 2
7.15 92 131 31.1 20 36 434 1

Solutions

Expert Solution

Here I try to fit a multiple linear regression model using spss. I have seen that none of the variables are significant. Due to this the R squared value is very low . Only 14.5% of the total variation can be explained by the proposed model.  I am attatching the model summary here.

But the model satisfies all the assumptions for the multiple linear regression model. I attaching that also

From the table it is clear that there is no multicollinearity in the data.

the residuals are normal also.


Related Solutions

The data presented in Problem 7 are analyzed using multiple linear regression analysis and the models...
The data presented in Problem 7 are analyzed using multiple linear regression analysis and the models are shown here. In the models, the data are coded as 1 = new medication and 0 = standard medication, and age 65 and older is coded as 1 = yes and 0 = no. ŷ = 53.85 − 23.54 (Medication) ŷ = 45.31 − 19.88 (Medication) + 14.64 (Age 65 +) ŷ = 45.51 − 20.21 ( Medication ) + 14.29 ( Age...
Fit a multiple regression model using MPG as the dependent variable and DISP, HP, and WT...
Fit a multiple regression model using MPG as the dependent variable and DISP, HP, and WT as the independent variables. Is the overall regression model significant? Test at the α = 0.05 level of significance. State the null hypothesis, the alternative hypothesis =, the test statistic calculated and critical values and your test conclusion. mpg disp hp wt 21 160 110 2.62 21 160 110 2.875 22.8 108 93 2.32 21.4 258 110 3.215 18.7 360 175 3.44 18.1 225...
For data DEMOG, fit three simple linear regression models of the per capita income on each...
For data DEMOG, fit three simple linear regression models of the per capita income on each of the three predictor variables. Does a linear regression model appear to provide a good fit for each of the three predictor variables? Use all appropriate tests, descriptive measures, and plots to conclude your findings here. Which predictor variable leads to significant effect on the per capita income? usborn cap.income home pop Alabama 0.98656 21442 75.9 4040587 Alaska 0.93914 25675 34.0 550043 Arizona 0.90918...
The table below summarizes nested multiple regression models used to predict a person’s quality of life...
The table below summarizes nested multiple regression models used to predict a person’s quality of life score. Model 1 Model 2 Model 3 est. sig. est. sig. est. sig. intercept 16.68 <.001 9.16 <.001   7.57 <.001 size of social network   0.59 0.027 0.44 0.076   0.43 0.059 college degree — 3.73 0.014   3.87 0.030 time (yrs) at current job — —   0.91 0.046 # of siblings — — –0.68 0.146 R2R2 3.65% 8.01% 9.60% ΔR2ΔR2 3.65% 4.36% 1.59% FF (for ΔR2ΔR2)...
DEFINE THE COMPONENTS OF A GENERAL MULTIPLE REGRESSION EQUATION. IN A MULTIPLE REGRESSION ANOVA TABLE, THE...
DEFINE THE COMPONENTS OF A GENERAL MULTIPLE REGRESSION EQUATION. IN A MULTIPLE REGRESSION ANOVA TABLE, THE DEGREES OF FREDOM FOR THE REGRESSION IS EQUAL TO: THE TEST TO CONFIRM WHETHER THE DEPENDENT VARIABLE CAN BE ESTIMATED WITHOUT RELYING ON THE INDEPENDENT VARIABLES IS REFERRED TO AS: WHAT STATISITIC WOULD WE USE FOR TESTING TO SEE IF AT LEAST ONE INDEPENDENT REGRESSION COEFFICIENTS IS SIGNIFICANT? TO TEST INDEPENDENT VARIABLES INDIVIDUALLY TO DETERMINE WHETHER THE REGRESSION COEFFICIENTS DIFFER FROM ZERO WOULD BE:...
DEFINE THE COMPONENTS OF A GENERAL MULTIPLE REGRESSION EQUATION. IN A MULTIPLE REGRESSION ANOVA TABLE, THE...
DEFINE THE COMPONENTS OF A GENERAL MULTIPLE REGRESSION EQUATION. IN A MULTIPLE REGRESSION ANOVA TABLE, THE DEGREES OF FREDOM FOR THE REGRESSION IS EQUAL TO: THE TEST TO CONFIRM WHETHER THE DEPENDENT VARIABLE CAN BE ESTIMATED WITHOUT RELYING ON THE INDEPENDENT VARIABLES IS REFERRED TO AS: WHAT STATISITIC WOULD WE USE FOR TESTING TO SEE IF AT LEAST ONE INDEPENDENT REGRESSION COEFFICIENTS IS SIGNIFICANT? TO TEST INDEPENDENT VARIABLES INDIVIDUALLY TO DETERMINE WHETHER THE REGRESSION COEFFICIENTS DIFFER FROM ZERO WOULD BE:...
USING MATLAB: Using the data from table below fit a fourth-order polynomial to the data, but...
USING MATLAB: Using the data from table below fit a fourth-order polynomial to the data, but use a label for the year starting at 1 instead of 1872. Plot the data and the fourth-order polynomial estimate you found, with appropriate labels. What values of coefficients did your program find? What is the LMS loss function value for your model on the data? Year Built SalePrice 1885 122500 1890 240000 1900 150000 1910 125500 1912 159900 1915 149500 1920 100000 1921...
Use the SAT data and create a multiple regression table but this time as input use...
Use the SAT data and create a multiple regression table but this time as input use ONLY two variables: Letters and SAT. Answer questions 1 to 4. Choose the best fitting answer. Note: numbers are truncated unless specified. 1. If an incoming student has Letters = 7 and SAT = 1000 what would his predicted College GPA be? a. 1.99 b. 2.01 c. ​​2.18 d. 2.39 2. What is the approximate error of this prediction? a. 0.58 b. ​​0.61 c....
Use the SAT data and create a multiple regression table but this time as input use...
Use the SAT data and create a multiple regression table but this time as input use ONLY two variables: Letters and SAT. Answer questions 1 to 4. Choose the best fitting answer. Note: numbers are truncated unless specified. C ollege GPA HighSchl GPA SAT Letters 2.04 2.01 1070 5 2.56 3.4 1254 6 3.75 3.68 1466 6 1.1 1.54 706 4 3 3.32 1160 5 0.05 0.33 756 3 1.38 0.36 1058 2 1.5 1.97 1008 7 1.38 2.03 1104...
I'm using 2005 NFL stats to come up with a multiple linear regression analysis models with...
I'm using 2005 NFL stats to come up with a multiple linear regression analysis models with the winning percentage being the dependent variable. My question would be, what are the most significant variables that are used in deciding an NFL team's capacity to win? Passing yards, rushing game, defense or field goals are some of my independent variables. But I’m considering adding the defensive stats to the regression. How do I complete the introduction and model subtopics for my presentation?...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT