Question

In: Statistics and Probability

part of a long-term study of individuals aged 65 years or older, sociologists and physicians at...

part of a long-term study of individuals aged 65 years or older, sociologists and physicians at the Wentworth Medical Center in upstate New York investigated the relationship between geographic location and depression. A sample of 60 individuals, all in reasonably good health, was selected; 20 individuals were residents of Florida, 20 were residents of New York, and 20 were residents of North Carolina. Each of the individuals sampled was given a standardized test to measure depression; higher test scores indicate higher levels of depression. The dataset can be found in the Worksheet titled “MEDICAL1”.

a) What is your response variable in this experiment, what is the factor, and what are the factor levels (treatments)?

b) You’re interested if there is a difference in Depression Score depending on the State. What are your null and alternative hypotheses for this?

c) Conduct an ANOVA at the alpha = 0.05 level to see if the State has an effect on the depression score. Show the ANOVA table including F statistic and p value. What is your conclusion?

d) To see which states are significantly different in the previous ANOVA (part c), you now want to conduct a multiple comparisons procedure. Run a Least Significant Difference test at the level alpha = 0.1 and determine which states, if any, are significantly different from each other.

e) Finally, we talked about comparison-wise vs experiment-wise Type 1 error in class. Run the LSD test (see part d) again, now including the Bonferroni adjustment. Consult the specification of the LSD test (?LSD.test) on how to do this. Using the alpha = 0.1 level, what changes in your conclusion when you compare the results with part d?

State Depression_Score
Florida 3
Florida 7
Florida 7
Florida 3
Florida 8
Florida 8
Florida 8
Florida 5
Florida 5
Florida 2
Florida 6
Florida 2
Florida 6
Florida 6
Florida 9
Florida 7
Florida 5
Florida 4
Florida 7
Florida 3
New.York 8
New.York 11
New.York 9
New.York 7
New.York 8
New.York 7
New.York 8
New.York 4
New.York 13
New.York 10
New.York 6
New.York 8
New.York 12
New.York 8
New.York 6
New.York 8
New.York 5
New.York 7
New.York 7
New.York 8
North.Carolina 10
North.Carolina 7
North.Carolina 3
North.Carolina 5
North.Carolina 11
North.Carolina 8
North.Carolina 4
North.Carolina 3
North.Carolina 7
North.Carolina 8
North.Carolina 8
North.Carolina 7
North.Carolina 3
North.Carolina 9
North.Carolina 8
North.Carolina 12
North.Carolina 6
North.Carolina 3
North.Carolina 8
North.Carolina 11

Solutions

Expert Solution

a) The Response Variable in this experiment is the "Depression Score", the factor variable is "State" and the factor levels (treatments) are "Florida", "New York" and "North Carolina".

b) The null and alternative hypotheses for this are:

H0 : The mean (average value of the dependent variable that is the "Depression Score") is the same for all the States.

H1 : The mean (average value of the dependent variable that is the "Depression Score") is not same for atleast one of the States.

c) The R-Code is:

depression_score = c(3,7,7,3,8, 8, 8, 5, 5, 2, 6, 2, 6, 6, 9, 7, 5, 4, 7, 3, 8, 11, 9, 7, 8, 7, 8, 4, 13, 10, 6, 8, 12, 8, 6, 8, 5, 7, 7, 8, 10, 7, 3, 5, 11, 8, 4, 3, 7, 8, 8, 7, 3, 9, 8, 12, 6, 3, 8, 11)
state = c(rep("Florida",20),rep("New York",20),rep("North Carolina",20))
summary(aov(depression_score~state))

---

Df Sum Sq Mean Sq F value Pr(>F)

state 2 61.0 30.517 5.241 0.00814 **
Residuals 57 331.9 5.823
---

qf(.95, df1=2, df2=57) #R-Code for obtaining the tabulated F-Value.
thus, Tabulated F-Value = 3.158843 < Calculated F-Value = 5.241

Conclusion:

Hence, we do not accept the null hypothesis that is the mean (average value of the dependent variable that is the "Depression Score") is not same for atleast one of the "States".

Again, the p-value = 0.00814 < 0.05 (level of significance) and hence, we do not accept the null hypothesis that is the mean (average value of the dependent variable that is the "Depression Score") is not same for atleast one of the "States".

d) The R-Code:

out = LSD.test(y, trt = "state", alpha = 0.1, p.adj="none")
out
$statistics
MSerror Df Mean CV t.value LSD
5.822807 57 6.866667 35.14149 1.672029 1.27588

$parameters
test p.ajusted name.t ntr alpha
Fisher-LSD none state 3 0.1

$means
depression_score std r LCL UCL Min Max Q25 Q50 Q75
Florida 5.55 2.139233 20 4.647816 6.452184 2 9 3.75 6.0 7.00
New York 8.00 2.200478 20 7.097816 8.902184 4 13 7.00 8.0 8.25
North Carolina 7.05 2.837252 20 6.147816 7.952184 3 12 4.75 7.5 8.25

$comparison
NULL

$groups
depression_score groups
New York 8.00 a
North Carolina 7.05 a
Florida 5.55 b

attr(,"class")
[1] "group"

Hence, we observe that the states "New York" and "North Carolina" are not significantly different from each other while "Florida" is significantly different from the other states.

e) The R-Code:

out = LSD.test(y, trt = "state", alpha = 0.1, p.adj="bonferroni")

out

$statistics
MSerror Df Mean CV t.value MSD
5.822807 57 6.866667 35.14149 2.180883 1.664173

$parameters
test p.ajusted name.t ntr alpha
Fisher-LSD bonferroni state 3 0.1

$means
depression_score std r LCL UCL Min Max Q25 Q50 Q75
Florida 5.55 2.139233 20 4.647816 6.452184 2 9 3.75 6.0 7.00
New York 8.00 2.200478 20 7.097816 8.902184 4 13 7.00 8.0 8.25
North Carolina 7.05 2.837252 20 6.147816 7.952184 3 12 4.75 7.5 8.25

$comparison
NULL

$groups
depression_score groups
New York 8.00 a
North Carolina 7.05 ab
Florida 5.55 b

attr(,"class")
[1] "group"

Hence, we observe that the states "New York" and "North Carolina" are not significantly different from each other while "Florida" and "North Carolina" are also not significantly different from the each other but "Florida" and "New York" are significantly different from each other.


Related Solutions

As part of a long-term study of individuals 65 yars of age or older, sociologists and...
As part of a long-term study of individuals 65 yars of age or older, sociologists and physicians at the Institute of Mental Health Research in Ottawa, investigated the relationship between geographic location and depression. A random of sample individuals, all in reasonably good health, were selected from Victoria, BC; Edmonton, Alberta; Winnipeg, Manitoba; and Halifax, Nova Scotia. Each of the individuals sampled was given a standardized test to measure depressionThe data collected has been provided to you; a higher test...
30. Total serum cholesterol levels for individuals 65 years of age or older are assumed to...
30. Total serum cholesterol levels for individuals 65 years of age or older are assumed to follow a normal distribution, with a mean of 182 and a standard deviation of 14.7. a. What proportion of individuals 65 years of age and older have cholesterol levels of 175 or more? b. What proportion of individuals 65 years of age and older have cholesterol levels between 150 and 175? c. If the top 10% of the cholesterol levels are assumed to be...
Q3: A study followed a population of 2000 women aged over 65 years for 10 years...
Q3: A study followed a population of 2000 women aged over 65 years for 10 years and measured the number of cases of osteoporosis diagnosed during that time period. The investigators were interested in the effect of regular exercise on the development of osteoporosis and divided he women into two groups: 1000 women who took regular exercise and 1000 women who did not take regular exercise. The investigators recorded 800 new cases of osteoporosis over the 10 years of the...
Please show work. Total serum cholesterol levels for individuals 65 years of age and older are...
Please show work. Total serum cholesterol levels for individuals 65 years of age and older are assumed to follow a normal distribution, with a mean of 182 mg/dL and a standard deviation of 14.7 mg/dL. 10) What proportion of these individuals have cholesterol levels of 175 mg/dL or more? 11) What proportion of these individuals have cholesterol levels between 150 mg/dL and 175 mg/dL? 12) If the top 10% of the cholesterol levels are assumed to be abnormally high, what...
Walter is retired and is paying for long-term care insurance. He is 65 years old and...
Walter is retired and is paying for long-term care insurance. He is 65 years old and pays $4,200 in long-term care insurance premiums and incurred $2,350 worth of long-term care insurance expenses in 2018. If Walter's AGI is $23,500, how much of the long-term car einsurance premiums and expenses can Walter deduct on Schedule A?
In a previous? year, 59?% of females aged 15 years of age and older lived alone....
In a previous? year, 59?% of females aged 15 years of age and older lived alone. A sociologist tests whether this percentage is different today by conducting a random sample of 800 females aged 15 years of age and older and finds that 448 are living alone. Is there sufficient evidence at the alpha equals0.01 level of significance to conclude the proportion has? changed?
In 2000, 58% of females aged 15 years of age and older lived alone, according to...
In 2000, 58% of females aged 15 years of age and older lived alone, according to the U.S. Census Bureau. A sociologist tests whether this percentage is decreased today by conducting a random sample of 500 females aged 15 years of age and older and finds that 280 are living alone. Is there sufficient evidence at the α=0.05 level of significance to conclude the proportion has decreased since 2000? If a random sample of 30 females aged 15 years of...
The average age of online consumers ten years ago was 23.3 years. As older individuals gain...
The average age of online consumers ten years ago was 23.3 years. As older individuals gain confidence with the Internet, it is believed that the average age has increased. We would like to test this belief. (a) Write the appropriate hypotheses to be tested. (b) The online shoppers in our sample consisted of 40 individuals, had an average age of 24.2 years, with a standard deviation of 5.3 years. What is the test statistic and p‐value for the hypotheses being...
Adults aged 18 years old and older were randomly selected for a survey on obesity. Adults...
Adults aged 18 years old and older were randomly selected for a survey on obesity. Adults are considered obese if their body mass index (BMI) is at least 30. The researchers wanted to determine if the proportion of women in the south who are obese is less than the proportion of southern men who are obese. The results are shown in the table below. # who are obese sample size Men 42,769 155,525 Women 67,169 248,775 At the1% level of...
Your hospital wants to decrease the rate of falls in patient older than 65 years of...
Your hospital wants to decrease the rate of falls in patient older than 65 years of age. You have been asked to conduct an EBP project regarding these patient outcomes. For the Fall study discuss what descriptive research design could be used to investigate this problem. What would be the research question? Why did you select this type of study design? please include reference
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT