In: Statistics and Probability
A -year study conducted by the American Heart Association provided data on how age, blood pressure, and smoking relate to the risk of strokes. Assume that the following data are from a portion of this study. Risk is interpreted as the probability (times ) that the patient will have a stroke over the next -year period. For the smoking variable, define a dummy variable with indicating a smoker and indicating a nonsmoker.
| Risk | Age | Pressure | Smoker | ||||
| 12 | 57 | 153 | No | ||||
| 24 | 67 | 163 | No | ||||
| 13 | 58 | 153 | No | ||||
| 56 | 86 | 175 | Yes | ||||
| 28 | 59 | 194 | No | ||||
| 51 | 76 | 188 | Yes | ||||
| 18 | 56 | 157 | Yes | ||||
| 31 | 78 | 121 | No | ||||
| 37 | 80 | 136 | Yes | ||||
| 15 | 78 | 98 | No | ||||
| 22 | 71 | 154 | No | ||||
| 36 | 70 | 173 | Yes | ||||
| 15 | 67 | 135 | Yes | ||||
| 48 | 77 | 207 | Yes | ||||
| 15 | 60 | 199 | No | ||||
| 36 | 82 | 117 | Yes | ||||
| 8 | 66 | 168 | No | ||||
| 34 | 80 | 124 | Yes | ||||
| 3 | 62 | 115 | No | ||||
| 37 | 59 | 209 | Yes | ||||
a. Develop an estimated regression equation that relates risk of a stroke to the person’s age, blood pressure, and whether the person is a smoker.
Please round your answers to three decimal places. Enter negative value as negative number.
b. Is smoking a significant factor in the risk of a stroke? Explain. Use .
Compute the test statistic for the significance of smoking (to 2 decimals). Use t table.
The -value is - Select your answer -less than .01between .01 and .025between .025 and .05between .05 and .10greater than .10Item 6
What is your conclusion?
Smoking - Select your answer -isis notItem 7 significant factor.
c. What is the probability of a stroke over the next years for Art Speen, a -year-old smoker who has blood pressure of ?
(to 2 decimals).
Which of the following factors will the doctor recommend Art work on?
- Select your answer -b1 and b2b1 and b3b2 and b3b1, b2 and b3Item 9
Using Excel
0/1 for smoker - =IF(H2="YES",1,0)
| Risk | Age | Pressure | Smoker | 
| 12 | 57 | 153 | 0 | 
| 24 | 67 | 163 | 0 | 
| 13 | 58 | 153 | 0 | 
| 56 | 86 | 175 | 1 | 
| 28 | 59 | 194 | 0 | 
| 51 | 76 | 188 | 1 | 
| 18 | 56 | 157 | 1 | 
| 31 | 78 | 121 | 0 | 
| 37 | 80 | 136 | 1 | 
| 15 | 78 | 98 | 0 | 
| 22 | 71 | 154 | 0 | 
| 36 | 70 | 173 | 1 | 
| 15 | 67 | 135 | 1 | 
| 48 | 77 | 207 | 1 | 
| 15 | 60 | 199 | 0 | 
| 36 | 82 | 117 | 1 | 
| 8 | 66 | 168 | 0 | 
| 34 | 80 | 124 | 1 | 
| 3 | 62 | 115 | 0 | 
| 37 | 59 | 209 | 1 | 
| SUMMARY OUTPUT | |||||
| Regression Statistics | |||||
| Multiple R | 0.932917464 | ||||
| R Square | 0.870334994 | ||||
| Adjusted R Square | 0.846022805 | ||||
| Standard Error | 5.827840282 | ||||
| Observations | 20 | ||||
| ANOVA | |||||
| df | SS | MS | F | Significance F | |
| Regression | 3 | 3647.530442 | 1215.843481 | 35.7982988 | 2.50915E-07 | 
| Residual | 16 | 543.4195576 | 33.96372235 | ||
| Total | 19 | 4190.95 | |||
| Coefficients | Standard Error | t Stat | P-value | Lower 95% | |
| Intercept | -92.4208 | 15.5591 | -5.9400 | 0.0000 | -125.4046 | 
| Age | 1.0875 | 0.1691 | 6.4330 | 0.0000 | 0.7291 | 
| Pressure | 0.2515 | 0.0460 | 5.4644 | 0.0001 | 0.1539 | 
| Smoker | 8.7356 | 3.0405 | 2.8731 | 0.0110 | 2.2900 | 
y^= -92.421 + 1.088 Age + 0.252 Pressure + 8.736 Smoker
b)
p-value of smoker = 0.011
p-value between 0.01 and 0.025
Conclusion
we reject the null hypothesis
Smoking is a significant factor
c)
for smoker
y^= -92.421 + 1.088 Age + 0.252 Pressure + 8.736
just put the value of age and pressure to get y^
p-value of all three factors are less than 0.05
hence
all three b1,b2,b3