In: Biology
You do some research and find that a census of Ireland in the 1910s found 65 individuals afflicted with cystic fibrosis out of 100,000 people. You want to investigate whether the gene involved in cystic fibrosis is under Hardy-Weinberg equilibrium. To do this, you use the frequencies observed in the present to determine the expected number of individuals in each class in the 1910 study.
Frequencies observed in the present: p=.97 , q=.03
Fill in the following table (2 points):
CC |
Cc |
cc |
Total |
|
Expected number of individuals (i.e. numbers based on present dataset) |
100,000 |
|||
Observed number of individuals in 1910 |
65 |
|||
?2 = |
Use the ?2 table provided below to infer the p-value of the test (0.5 points)
Reminder: the p value is found by cross referencing the ?2value with the critical value. For example, if your ?2 value is 5.34 and the degree of freedom is 1, you would say 0.05 > p-value> 0.01.
Can you conclude that the gene involved in Cystic fibrosis is in Hardy-Weinberg Equilibrium with 95% certainty? Explain why or why not in a sentence or two. (0.5 point)
Answer:
Based on the given information:
Present:
1910:
Chi Square Test:
Genotype | Observed No. in 1910 (O) | Expected Number current (E | (O-E) | (O-E)^2/E |
CC | 94966 | 94090 | 876 | 8.2 |
Cc | 4969 | 5820 | -851 | 124.4 |
cc | 65 | 90 | -25 | 6.9 |
Total | 100000 | 100000 | Chi square (sum) | 139.5 |
Degree of freedom = Number of categories - 1 = 3 - 1 = 2
P-Value corresponding to Chi square value of 139.5 with 2 degree of freedom with 95% certainity (significance level 0.05) : P-Value is < 0.00001
Since P-value is less than the significance level (0.05), therefore Null Hypothesis is rejected. Based on the experimental evidence, there is statistically significant difference in the expected frequency (current) and observed frequency (1910) and the population deviates from Hardy Weinberg Equilibrium.