Question

In: Statistics and Probability

A medical study, a doctor is interested in determining what factors affect forced expired volume values...

A medical study, a doctor is interested in determining what factors affect forced expired volume values (FEV). The doctor randomly samples 15 patients and records their smoking status and age. Using the data, create the regression output (including the y-intercept and both independent variables) then answer the questions below. Note that smoking status and age are recorded as categorical variables.

  • Write out the estimated regression line. To receive full credit, you must label all variables/categories.

  • Run an overall F-test to determine if the multiple regression model is significant at a 5% significance level. To receive full credit, you must give me:

Test Statistic

P-Value

Is the model significant?

(Circle your answer)

               Yes                    No

  • Predict the FEV value for a smoker over 50 years old.
  • Estimate the average FEV for non-smokers between 30-40 years old.
FEV Smoking Status Age
0.79 smoker < 30
0.81 smoker 30 - 40
0.93 non-smoker 30 - 40
0.59 smoker 40 - 50
0.77 non-smoker > 50
0.61 smoker 40 - 50
0.95 non-smoker < 30
0.87 non-smoker < 30
0.63 smoker 40 - 50
0.88 non-smoker 30 - 40
0.61 smoker > 50
0.64 smoker < 30
0.67 smoker 30 - 40
0.87 non-smoker 40 - 50
0.93 non-smoker < 30

Solutions

Expert Solution

Let y = Forced Expired Volume (FEV)

     x1 = Smoking status

    x2 = Age

For calculation purpose lets categorize x1 and x2 in numeric formats.

if smoker then x1 = 1, if non-smoker then x1 = 0.

If age < 30 then x2 = 1, if 30 < age < 40 then x2 = 2

if 40 < age < 50 then x2 = 3, if age > 50 then x2 = 4

so the regression equation is as below;

y = m1x1 + m2x2 + b

where m1 and m2 are coefficients for each of x1 and x2 respectively and b is constant (intercept).

                                                                                                                           ............................ (1)

Find attached image for the detailed regression analysis output by using the Regression Data analysis pack from excel.

From above image we get below values:

Test Statistic F-value in above table = 32.28562325
P-value                   For intercept = 0.0000000000035370710

Is the model significant?

                                    No

From above output we get the regression equation as:

y = (-0.199808774)x1 + (-0.045748031)x2 + 0.977210349

                                                                                                                                          .......................(2)

Now we want to predict FEV(y) for smoker(x1) above 50 years(x2).

Here x1 = 1 and x2 = 4 by using categorization defined in (1) above.

so from given values regression equation will be as:

y = (-0.199808774)*1 + (-0.045748031)*4 + 0.977210349

y = 0.594409 = 0.59(2 decimals)

Hence the FEV for Smoker of Age above 50 years = 0.59

Now we want to estimate average FEV for smokers between 30-40 years old.

From given data there are total 8 smoker patients. Of this 8 there are 2 patients of age between 30-40 years as below

FEV Smoking Status Age
0.81 smoker 30 - 40
0.67 smoker 30 - 40

Hence in given situation the average FEV = (0.81 + 0.67)/2 = 0.74

Average FEV for Smoker between 30-40 years old = 0.74


Related Solutions

How does hyperinflated breathing affect the forced expired volume (FVC) and forced expired volume in 1...
How does hyperinflated breathing affect the forced expired volume (FVC) and forced expired volume in 1 second (FEV1)?
1.) What factors affect Rf values in thin-layer chromatography? 2.) What factors affect the separation on...
1.) What factors affect Rf values in thin-layer chromatography? 2.) What factors affect the separation on the column in column chromatography? 3.) What factors affect the resolution of the substances and their retention times in gas chromatography?
the leaders of the study are interested in the determining whether there is a difference in...
the leaders of the study are interested in the determining whether there is a difference in mean annual contributions for individuals covered by TSA's and those with 401(k) retirements program. you can use the results form the following table to test if the population mean of individual covered by TSAs is higher than those covered by 401(k). TSA    401(K) n1 = 8    n2=10 x1= 322.5 x2=298.3 s1=48.3 s2= 53.3 1. Formulate the appropriate null and alternative hypotheses 2....
A medical research team is interested in determining whether a new drug has an effect on...
A medical research team is interested in determining whether a new drug has an effect on creatine kinase (CK), which is often assayed in blood tests as an indicator of myocardial infarction. A random selection of 20 patients from a pool of possible subjects is selected, and each subject is given the medication. The subjects’ CK levels are observed initially, after 3 weeks, and again after 6 weeks. The purpose is to study the CK levels over time. Here is...
What are the factors that affect DNA structure? How will these factors affect the structure of...
What are the factors that affect DNA structure? How will these factors affect the structure of the DNA isolate?
describe the stroke volume and the factors that affect it during rest and during exercise
describe the stroke volume and the factors that affect it during rest and during exercise
What physiological factors affect strength? What physiological factors affect power? (Provide at least 2 factors for...
What physiological factors affect strength? What physiological factors affect power? (Provide at least 2 factors for strength and at least 2 for power)
Suppose we are interested in modeling the factors that affect salaries of CEO’s of companies. Let...
Suppose we are interested in modeling the factors that affect salaries of CEO’s of companies. Let salaryi denote CEO compensation in 1990 measured in $1000 increments. Let salesi denote the 1990 sales of a firm in millions of dollars. Let mktvali denote the market value of the firm at the end of 1990 in millions of dollars. Let profitsi denote 1990 profits in millions of dollars. Finally, let the dummy variable collegei = 1 if a CEO attended college and...
What morphometric factors would/could affect vital capacity? If vital capacity was reduced, but tidal volume remained...
What morphometric factors would/could affect vital capacity? If vital capacity was reduced, but tidal volume remained the same for an individual, how would it affect them? Would they notice at rest? If not, why not, and when might they notice the VC reduction? Can I get some help with these questions?
Study Time and Exam Score An elementary statistics instructor is interested in determining how well the...
Study Time and Exam Score An elementary statistics instructor is interested in determining how well the amount of time students spend studying for her class predicts their results on exam. The instructor asks her students to keep track of the number of hours they spent working on their statistics course between the first and second exam (including in class time, tutoring time, computer time, etc.) She then recorded their score on the second exam and the results are shown below....
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT