In: Statistics and Probability
PLEASE SHOW WORKING SOLUTION
1.Recall in our discussion of the binomial distribution the research study that examined schoolchildren developing nausea and vomiting following holiday parties. The intent of this study was to calculate probabilities corresponding to a specified number of children becoming sick out of a given sample size. Recall also that the probability, i.e. the binomial parameter "p" defined as the probability of "success" for any individual, of a randomly selected schoolchild becoming sick was given.
Suppose you are now in a different reality, in which this binomial probability parameter p is now unknown to you but you are still interested in carrying out the original study described above, though you must first estimate p with a certain level of confidence. Furthermore, you would also like to collect data from adults to examine the difference between the proportion with nausea and vomiting following holiday parties of schoolchildren and adults, which will reflect any possible age differences in becoming sick. You obtain research funding to randomly sample 18 schoolchildren and 39 adults with an inclusion criterion that a given participant must have recently attended a holiday party, and conduct a medical evaluation by a certified pediatrician and general practitioner for the schoolchildren and adults, respectively. After anxiously awaiting your medical colleagues to complete their medical assessments, they email you data contained in the following tables.
|
|
What is the estimated 95% confidence interval (CI) of the difference in proportions between schoolchildren and adults developing nausea and vomiting following holiday parties? Assign groups 1 and 2 to be schoolchildren and adults, respectively.
Please note the following: 1) in practice, you as the analyst decide how to assign groups 1 and 2 and subsequently interpret the results appropriately in the context of the data, though for the purposes of this exercise the groups are assigned for you; 2) 0 and 1 are defined as no and yes, respectively, which is a typical coding scheme in Public Health; 3) you might calculate a CI that is different from any of the multiple choice options listed below due to rounding differences, therefore select the closest match; and 4) you may copy and paste the data into Excel to facilitate analysis.
Select one:
a. -0.1838 to 0.3718
b. -0.1636 to 0.4313
c. -0.2058 to 0.3272
d. -0.2123 to 0.4183
sample #1 ----->
first sample size, n1=
18
number of successes, sample 1 = x1=
10
proportion success of sample 1 , p̂1=
x1/n1= 0.5555556
sample #2 ----->
second sample size, n2 =
39
number of successes, sample 2 = x2 =
18
proportion success of sample 1 , p̂ 2= x2/n2 =
0.461538
difference in sample proportions, p̂1 - p̂2 =
0.5556 - 0.4615 =
0.0940
level of significance, α = 0.05
Z critical value = Z α/2 =
1.960 [excel function: =normsinv(α/2)
Std error , SE = SQRT(p̂1 * (1 - p̂1)/n1 + p̂2 *
(1-p̂2)/n2) = 0.14174
margin of error , E = Z*SE = 1.960
* 0.1417 = 0.27780
confidence interval is
lower limit = (p̂1 - p̂2) - E = 0.094
- 0.2778 = -0.1837849
upper limit = (p̂1 - p̂2) + E = 0.094
+ 0.2778 = 0.3718191
so, confidence interval is (
-0.1838 < p1 - p2 <
0.3718 )
option A)