Question

In: Statistics and Probability

2. A random sample of 395 people were surveyed and each person was asked to report...

2. A random sample of 395 people were surveyed and each person was asked to report the highest education level they obtained. The data that resulted from the survey is summarized in the following table:
    High School   Bachelors   Masters   Ph.d.   Total
Female   60   54   46   41   201
Male   40   44   53   57   194
Total   100   98   99   98   395
a. Are gender and education level dependent at 5% level of significance? (6mks)
b.State and explain two methods of studying correlation                    (4mks

Solutions

Expert Solution

Solution

Part (a)

Solution is based on the theory of Chi-square Test for Independence.

Final answers are given below. Back-up Theory and Details of calculations follow at the end.

DF

3

α

0.05

χ2crit

7.8147

χ2cal

8.0061

p-value

0.0459

Reject H0

Since null hypothesis of independence is rejected, we conclude that

gender and education level are NOT independent. Answer 1

Back-up Theory and Details of calculations

Suppose we have a contingency table with r rows representing r levels/grades of one attribute and c columns representing c levels/grades of another attribute.

The Chi-square Test of Independence is designed to test if the two attributes are associated.

Hypotheses

Null H0: The two attributes, namely gender and education level are independent .

Vs

Alternative H1: The two attributes are not independent

Test Statistic

χ2 = ∑(i = 1 to r, j = 1 to c){(Oij - Eij)2/Eij}, where Oij and Eij are respectively, the observed and the expected frequencies of the ijth cell of the contingency table.

Under H0, Eij = (Oi. x O.j)/O.., where Oi.,O.j, and O.. are respectively the ith row total, jth column total and grand total.

Calculations

Oij

j1

j2

j3

j4

Oi.

i1

60

54

46

41

201

i2

40

44

53

57

194

O.j

100

98

99

98

395

Eij = (Oi. X O.j)/N

j1

j2

j3

j4

Ei.

i1

50.8861

49.8684

50.3772

49.8684

201

i2

49.1139

48.1316

48.6228

48.1316

194

E.j

100

98

99

98

395

χ2ij

j1

j2

j3

j4

Total

i1

1.6323

0.3423

0.3803

1.5771

3.9321

i2

1.6912

0.3547

0.3941

1.6340

4.0740

Total

3.3236

0.6970

0.7744

3.2111

8.0061

Distribution

Under H0, χ2 ~ χ2n, where n = {(r - 1)(s - 1)}

Critical Value

Given level of significance as α, critical value is the upper α% point of χ2n.

p-value

P(χ2n > χ2cal)

Critical value and p-value obtained using Excel Function: Statistical CHIINV and CHIDIST are given in the above table.

Decision

Since χ2cal > χ2crit, or equivalently, p-value < α, H0is rejected.

Part (b)

Two methods of studying correlation

1. Chi-square Test for Independence which will ascertain if there is association

2. Theory of correlation and regression which will first ascertain if there is association and if so establish the actual correlation relationship. Answer 2

DONE


Related Solutions

Is gender independent of education level? A random sample of 395 people were surveyed and each...
Is gender independent of education level? A random sample of 395 people were surveyed and each person was asked to report the highest education level they obtained. The data that resulted from the survey is summarized in the following table: High School Bachelors Masters Ph.d. Total Female 60 54 46 41 201 Male 40 44 53 57 194 Total 100 98 99 98 395 Question: Are gender and education level dependent at 5% level of significance? In other words, given...
9.9. Is gender independent of education level? A random sample of people were surveyed and each...
9.9. Is gender independent of education level? A random sample of people were surveyed and each person was asked to report the highest education level they obtained. Perform a hypothesis test. Include all 5 steps. High School Bachelors Masters Female 30 60 54 Male 25 40 44
A random sample of 100 adults were surveyed, and they were asked if the regularly watch...
A random sample of 100 adults were surveyed, and they were asked if the regularly watch NFL games.  They were asked in their favorite team had ever won the Super Bowl. Won Super Bowl Did not win Super Bowl Row total Watch games 24 56 80 Do not watch games 11 9 20 Column total 35 65 100 Find: a) P(watch NFL games given their favorite team won the Super Bowl) b) P(favorite team won the Super Bowl and watch NFL...
A random sample of 20 workers in a factory were asked to report the age of...
A random sample of 20 workers in a factory were asked to report the age of their car and how many miles the vehicle had on it. A computer printout resulted in the following information. Variable Coef SE Coef t-ratio Prob Constant 7288.54 6591 1.11 <0.2826 Age 11630.6 1249 9.31 <0.0001 R sq = 82% R sq adj = 81.1% s = 19280 Find the LSRL A new worker starts next week and we know that his car is 7...
In 1990 a random sample of 136 people surveyed showed that 39 were vegetarians. In 2015...
In 1990 a random sample of 136 people surveyed showed that 39 were vegetarians. In 2015 a similar survey showed that in a random sample of 95 people, 36 were vegetarians. Test the claim that the proportion of people that are vegetarians has changed. Use a 5% level of significance Ho: H1: Statistic: Conclusion:
9.9. Is gender independent of education level? A random sample of people was surveyed and each...
9.9. Is gender independent of education level? A random sample of people was surveyed and each person was asked to report the highest education level they obtained. Perform a hypothesis test. Include all 5 steps. High School Bachelors Masters Female 30 60 54 Male 25 40 44 10.9. Compute and interpret the correlation coefficient for the following grades of 6 students selected at random. Mathematical Grade 70 92 80 74 65 83 English Grade 74 84 63 87 78 90
A random sample of 25 people is surveyed, and it was found that they spent an...
A random sample of 25 people is surveyed, and it was found that they spent an average of 32 hours a week watching TV. The standard deviation of their sample was 6 hours. (a) Calculate a 95% confidence interval for the average number of hours spent watching TV per week (population mean µ - using two-tailed distribution). (b) What is the 95% confidence interval for the single 26th person to be surveyed? (c) What is the 95% confidence interval (two-sided...
A sample of World Campus students were surveyed. They were asked which of the following they...
A sample of World Campus students were surveyed. They were asked which of the following they prefer to drink: beer, water, or neither. And, their biological sex was recorded. These data are presented in the table below. [75 points] Biological Sex Female Male Beer 61 137 Wine 144 44 Neither 76 62 H. In this sample, are biological sex and drink preference independent or related? Explain why.
A sample of World Campus students were surveyed. They were asked which of the following they...
A sample of World Campus students were surveyed. They were asked which of the following they prefer to drink: beer, water, or neither. And, their biological sex was recorded. These data are presented in the table below. [75 points] Biological Sex Female Male Beer 61 137 Wine 144 44 Neither 76 62 H. In this sample, are biological sex and drink preference independent or related? Explain why.
A sample of World Campus students were surveyed. They were asked which of the following they...
A sample of World Campus students were surveyed. They were asked which of the following they prefer to drink: beer, water, or neither. And, their biological sex was recorded. These data are presented in the table below. Biological Sex Female Male Beer 61 137 Wine 144 44 Neither 76 62 In this sample, are biological sex and drink preference independent or related? Explain why.   Compute the relative risk comparing the proportion of males who prefer beer to the proportion of...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT