Question

In: Statistics and Probability

Consider the following contingency table, which you will use to conduct a χ2 (Chi2) test. Sociological...

Consider the following contingency table, which you will use to conduct a χ2 (Chi2) test.

  • Sociological tidbit: participation in pre-K (pre-school) has been shown to correlate with a LOT of later-in-life positive outcomes - from elementary school performance, to high school performance, to higher employment rates, to higher incomes, and lower numbers of arrests over the life course. Many sociologists of education agree that investing in pre-K is a huge step towards reducing inequality in society [Source (Links to an external site.)]

Here is fake data for the purposes of this problem - let's say this is data from a random sample of 30 year olds, asking about school achievements and pre-K participation.

Public Pre-K (Headstart, etc.) Private Pre-K No Pre-K TOTAL
Did Not Graduate from High School 25 50 25 100
Graduated from High School 75 150 25 250
TOTAL 100 200 50 350

A. What is your alternative hypothesis? (4 pts)
B. What are the degrees of freedom for this problem? (2 pts)
C. What is your alpha level for this problem? (2 pts)
D. How many respondents were observed to have attended public or private pre-K and graduated from high school? (Round to whole number if necessary). (4 pts)
E. How many respondents were expected to have attended public or private pre-K and did not graduate from high school? (Round to whole number if necessary). (4 pts)
F. What is your calculated χ2 (Chi2) statistic?  (Round to two decimal places). (4 pts)
G. Using the Bognar Chi2 App [Link (Links to an external site.)], what is the returned alpha level given your χ2 (Chi2) statistic and your df? (2 pts)
H. What decision do you make about the alternative hypothesis? Explain why. (2 pts)
I. Report your final results in a full sentence (as exemplified on your worksheets from class). (4 pts)
J. Based on the results of this test alone, can we say that pre-K participation CAUSES improvements in high school graduation rates? Why or why not? Explain.

Solutions

Expert Solution

Pre-K No Pre-K TOTAL
Did Not Graduate from High School 75 25 100
Graduated from High School 225 25 250
TOTAL 300 50 350

A. What is your alternative hypothesis? (4 pts)

H1: There is association between pre - K participation and school achievements.
B. What are the degrees of freedom for this problem? (2 pts)

df = (2-1)*(2-1) = 1
C. What is your alpha level for this problem? (2 pts)

0.05 (lets say)
D. How many respondents were observed to have attended public or private pre-K and graduated from high school? (Round to whole number if necessary). (4 pts)

250
E. How many respondents were expected to have attended public or private pre-K and did not graduate from high school? (Round to whole number if necessary). (4 pts)


F. What is your calculated χ2 (Chi2) statistic?  (Round to two decimal places). (4 pts)

The expected values are computed in terms of row and column totals. In fact, the formula is where Ri​ corresponds to the total sum of elements in row i, Cj​ corresponds to the total sum of elements in column j, and T is the grand total. The table below shows the calculations to obtain the table with expected values:

Expected Values Pre K No Pre K Total
Not graduate \ = 85.714350300×100​=85.714 \ = 14.28635050×100​=14.286 100
Graduate \= 214.286350300×250​=214.286 = 35.71435050×250​=35.714 250
Total 300 50 350

Based on the observed and expected values, the squared distances can be computed according to the following formula: (E - O)^2/E(E−O)2/E. The table with squared distances is shown below:

Squared Distances Pre K No Pre K
Not graduate \= 1.33985.714(75−85.714)2​=1.339 \ = 8.03614.286(25−14.286)2​=8.036
Graduate = 0.536214.286(225−214.286)2​=0.536 = 3.21435.714(25−35.714)2​=3.214

Null and Alternative Hypotheses

The following null and alternative hypotheses need to be tested:

H0​: The two variables are independent

Ha​: The two variables are dependent

This corresponds to a Chi-Square test of independence.

Rejection Region

Based on the information provided, the significance level is \α=0.05 , the number of degrees of freedom is df=(2−1)×(2−1)=1, so then the rejection region for this test is R={χ2:χ2>3.841}.

Test Statistics

The Chi-Squared statistic is computed as follows:

Decision about the null hypothesis

Since it is observed that χ2=13.125>χc2​=3.841, it is then concluded that the null hypothesis is rejected.

Conclusion

It is concluded that the null hypothesis Ho is rejected. Therefore, there is enough evidence to claim that the two variables are dependent, at the 0.05 significance level.

The corresponding p-value for the test is p=Pr(χ12​>13.125)=0.0003

G. Using the Bognar Chi2 App [Link (Links to an external site.)], what is the returned alpha level given your χ2 (Chi2) statistic and your df? (2 pts) I have no idea for this
H. What decision do you make about the alternative hypothesis? Explain why. (2 pts)

It is concluded that the null hypothesis Ho is rejected. Therefore, there is enough evidence to claim that the two variables are dependent, at the 0.05 significance level.
I. Report your final results in a full sentence (as exemplified on your worksheets from class). (4 pts)

We can say that participation in Pre-K influence success at stages of life by improving rates at school or college.
J. Based on the results of this test alone, can we say that pre-K participation CAUSES improvements in high school graduation rates? Why or why not? Explain.

Yes, as explained above.

Please rate my answer and comment for doubt


Related Solutions

When performing a χ2 test for independence in a contingency table with r rows and c​...
When performing a χ2 test for independence in a contingency table with r rows and c​ columns, determine the​ upper-tail critical value of the test statistic in each of the following circumstances. a. α=​0.05, r=6​, c=5 d. α=​0.01, r=5​, c=6 b. α=​0.01, r=3​, c=4 e. α=​0.01, r=4​, c=5 c. α=​0.01, r=3​, c=6 a. The critical value is ____. ​(Round to three decimal places as​ needed.) b. The critical value is _____. ​(Round to three decimal places as​ needed.) c. The...
Given the following contingency table, conduct a test for independence at the 1% significance level. (You...
Given the following contingency table, conduct a test for independence at the 1% significance level. (You may find it useful to reference the appropriate table: chi-square table or F table) Variable A Variable B 1 2 1 28 38 2 30 66 a. Choose the null and alternative hypotheses. H0: The two variables are independent.; HA: The two variables are dependent. H0: The two variables are dependent.; HA: The two variables are independent. b. Calculate the value of the test...
1. Given the following contingency table, conduct a test for independence at the 1% significance level....
1. Given the following contingency table, conduct a test for independence at the 1% significance level. (You may find it useful to reference the appropriate table: chi-square table or F table) Variable A Variable B 1 2 1 31 32 2 34 58 Calculate the value of the test statistic. (Round intermediate calculations to at least 4 decimal places and final answer to 3 decimal places.) 2. A market researcher for an automobile company suspects differences in preferred color between...
Use a x2 test to test the claim that in the given contingency table, the row variable and the column variable are independent.
Use a x2 test to test the claim that in the given contingency table, the row variable and the column variable are independent. 1) Use the sample data below to test whether car color affects the likelihood of being in an 1) accident. Use a significance level of 0.01 
Test the following hypotheses by using the χ2 goodness of fit test.
You may need to use the appropriate technology to answer this question. Test the following hypotheses by using the χ2 goodness of fit test. H0: pA = 0.40, pB = 0.40, and pC = 0.20 Ha: The population proportions are not pA = 0.40, pB = 0.40, and pC = 0.20. A sample of size 200 yielded 60 in category A, 120 in category B, and 20 in category C. Use α = 0.01 and test to see whether the...
Describe, through an example or two, the use of a contingency table analysis (Chi Square test)...
Describe, through an example or two, the use of a contingency table analysis (Chi Square test) in a business research project. What conclusion(s) could we draw from a Chi Square test?
Describe, through an example or two, the use of a contingency table analysis (Chi Square test)...
Describe, through an example or two, the use of a contingency table analysis (Chi Square test) in a business research project. What conclusion(s) could we draw from a Chi Square test?
Use the provided contingency table and expected frequencies. At alpha equals 0.01​, test the hypothesis that...
Use the provided contingency table and expected frequencies. At alpha equals 0.01​, test the hypothesis that the variables are independent. What is the test statistic? Result stretched not stretched injury 16(20.979) 24(19.021) No injury 209(204.021) 180(184.979)
Which of the following interpretations of a Chi-square test is CORRECT? (you may use the table...
Which of the following interpretations of a Chi-square test is CORRECT? (you may use the table below as reference). a. Assume that a Chi-square test was conducted to test the goodness of fit to a 3:1 ratio and that a Chi-square value of 2.62 was obtained. The null hypothesis should be accepted. b. With the test of a 3:1 ratio, there are two degrees of freedom. c. The larger the Chi-square value, the more likely your results are real. d....
How are the expected values calculated for the contingency table of a test if independence and...
How are the expected values calculated for the contingency table of a test if independence and what is the reason that this formula works?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT