Question

In: Math

1. Define the following terms: A. Contingency table B. Chi-square test 2. List the assumptions required...

1. Define the following terms:

A. Contingency table

B. Chi-square test

2. List the assumptions required to perform a chi-square test?

Solutions

Expert Solution

solution:

1) Contingency Table

A two-way table (also called a contingency table) is a useful tool for examining relationships between categorical variables. The entries in the cells of a two-way table can be frequency counts or relative frequencies (just like a one-way table ).

Dance Sports TV Total
Men 2 10 8 20
Women 16 6 8 30
Total 18 16 16 50

The two-way table above shows the favorite leisure activities for 50 adults - 20 men and 30 women. Because entries in the table are frequency counts, the table is a frequency table .

B)

1) Chi-Square Goodness of Fit Test

A chi-square goodness of fit test attempts to answer the following question: Are sample data consistent with a hypothesized distribution?

The test is appropriate when the following conditions are met:

  • The sampling method is simple random sampling .
  • The population is at least 10 times as large as the sample.
  • The variable under study is categorical .
  • The expected value for each level of the variable is at least 5.

Here is how to conduct the test.

  • Define hypotheses. For a chi-square goodness of fit test, the hypotheses take the following form.

    H0: The data are consistent with a specified distribution.
    Ha: The data are not consistent with a specified distribution.

    Typically, the null hypothesis specifies the proportion of observations at each level of the categorical variable. The alternative hypothesis is that at least one of the specified proportions is not true.

  • Specify significance level. Often, researchers choose significance levels equal to 0.01, 0.05, or 0.10; but any value between 0 and 1 can be used.
  • Find degrees of freedom. The degrees of freedom (DF) is equal to the number of levels (k) of the categorical variable minus one: DF = k - 1 .
  • Compute expected frequency counts. The expected frequency counts at each level of the categorical variable are equal to the sample size times the hypothesized proportion from the null hypothesis

    Ei = npi

    where Ei is the expected frequency count for the ith level of the categorical variable, n is the total sample size, and pi is the hypothesized proportion of observations in level i.
  • Find test statistic. The test statistic is a chi-square random variable (Χ2) defined by the following equation.

    Χ2 = Σ [ (Oi - Ei)2 / Ei ]

    where Oi is the observed frequency count for the ith level of the categorical variable, and Ei is the expected frequency count for the ith level of the categorical variable.
  • Find P-value. The P-value is the probability of observing a sample statistic as extreme as the test statistic. Since the test statistic is a chi-square, use the Chi-Square Distribution Calculator to assess the probability associated with the test statistic. Use the degrees of freedom computed above.

If the sample findings are unlikely, given the null hypothesis, the researcher rejects the null hypothesis. Typically, this involves comparing the P-value to the significance level , and rejecting the null hypothesis when the P-value is less than the significance level.

2)

Chi-Square Test for Independence

A chi-square test for independence is applied when you have two categorical variables from a single population. It is used to determine whether there is a significant association between the two variables.

The test consists of four steps: (1) state the hypotheses, (2) formulate an analysis plan, (3) analyze sample data, and (4) interpret results.

  • State the hypotheses. A chi-square test for independence is conducted on two categorical variables. Suppose that Variable A has r levels, and Variable B has c levels. The null hypothesis states that knowing the level of Variable A does not help you predict the level of Variable B. That is, the variables are independent. The alternative hypothesis states that the variables are not independent.
  • Formulate analysis plan. The analysis plan describes how to use sample data to accept or reject the null hypothesis. The plan should specify a significance level and should identify the chi-square test for independence as the test method.
  • Analyze sample data. Using sample data, find the degrees of freedom, expected frequencies, test statistic, and the P-value associated with the test statistic.
    • Degrees of freedom. The degrees of freedom (DF) is equal to:

      DF = (r - 1) * (c - 1)

      where r is the number of levels for one catagorical variable, and c is the number of levels for the other categorical variable.
    • Expected frequencies. The expected frequency counts are computed separately for each level of one categorical variable at each level of the other categorical variable. Compute r*c expected frequencies, according to the following formula.

      Er,c = (nr * nc) / n

      where Er,c is the expected frequency count for level r of Variable A and level c of Variable B, nr is the total number of sample observations at level r of Variable A, nc is the total number of sample observations at level cof Variable B, and n is the total sample size.
    • Test statistic. The test statistic is a chi-square random variable (Χ2) defined by the following equation.

      Χ2 = Σ [ (Or,c - Er,c)2 / Er,c ]

      where Or,c is the observed frequency count at level r of Variable A and level c of Variable B, and Er,c is the expected frequency count at level r of Variable A and level c of Variable B.
    • P-value. The P-value is the probability of observing a sample statistic as extreme as the test statistic. Since the test statistic is a chi-square, use the Chi-Square Distribution Calculator to assess the probability associated with the test statistic. Use the degrees of freedom computed above.
  • Interpret results. If the sample findings are unlikely, given the null hypothesis, the researcher rejects the null hypothesis. Typically, this involves comparing the P-value to the significance level , and rejecting the null hypothesis when the P-value is less than the significance level.

3)

Chi-Square Test for Homogeneity

The chi-square test of homogeneity is applied to a single categorical variable . It is used to compare the distribution of frequency counts across different populations. It answers the following question: Are frequency counts distributed identically across different populations?

The test procedure is appropriate when the following conditions are met:

  • For each population, the sampling method is simple random sampling .
  • The population is at least 10 times as large as the sample.
  • The variable under study is categorical .
  • If sample data are displayed in a contingency table (Populations x Category levels), the expected frequency count for each cell of the table is at least 5.

This approach consists of four steps: (1) state the hypotheses, (2) formulate an analysis plan, (3) analyze sample data, and (4) interpret results.

  • State the hypothesis. Every hypothesis test requires a null hypothesis and an alternative hypothesis. Suppose that data were sampled from r populations, and assume that the categorical variable had c levels. At any specified level of the categorical variable, the null hypothesis states that each population has the same proportion of observations. Thus,

    H0: Plevel 1 of population 1 = Plevel 1 of population 2 = . . . = Plevel 1 of population r
    H0: Plevel 2 of population 1 = Plevel 2 of population 2 = . . . = Plevel 2 of population r
    . . .
    H0: Plevel c of population 1 = Plevel c of population 2 = . . . = Plevel c of population r


    The alternative hypothesis (Ha) is that at least one of the null hypothesis statements is false.
  • Formulate an analysis plan. The analysis plan describes how to use sample data to accept or reject the null hypothesis. The plan should specify the significance level and the test method (i.e., the chi-square test of homogeneity).
  • Analyze sample data. Using sample data from the contingency tables, find the degrees of freedom, expected frequency counts, test statistic, and the P-value associated with the test statistic. The analysis described in this section is illustrated in the sample problem at the end of this lesson.
    • Degrees of freedom. The degrees of freedom (DF) is equal to:

      DF = (r - 1) * (c - 1)

      where r is the number of populations, and c is the number of levels for the categorical variable.
    • Expected frequency counts. The expected frequency counts are computed separately for each population at each level of the categorical variable, according to the following formula.

      Er,c = (nr * nc) / n

      where Er,c is the expected frequency count for population r at level c of the categorical variable, nr is the total number of observations from population r, nc is the total number of observations at treatment level c, and n is the total sample size.
    • Test statistic. The test statistic is a chi-square random variable (Χ2) defined by the following equation.

      Χ2 = Σ [ (Or,c - Er,c)2 / Er,c ]

      where Or,c is the observed frequency count in population r for level c of the categorical variable, and Er,c is the expected frequency count in population r for level c of the categorical variable.
    • P-value. The P-value is the probability of observing a sample statistic as extreme as the test statistic. Since the test statistic is a chi-square, use the Chi-Square Distribution Calculator to assess the probability associated with the test statistic. Use the degrees of freedom computed above.
    • Interpret results. If the sample findings are unlikely, given the null hypothesis, the researcher rejects the null hypothesis. Typically, this involves comparing the P-value to the significance level , and rejecting the null hypothesis when the P-value is less than the significance level.

Related Solutions

3. How many qualitative variables is the chi-square test of a contingency table a test of...
3. How many qualitative variables is the chi-square test of a contingency table a test of independence for? 11.3 Chi-Square Test for Independence 4. How many expected cell frequencies satisfy subsequent assumptions, so that the chi-square test of a contingency table is valid? 11.3 Chi-Square Test for Independence Page 368 Test Review Questions Chapter 12 1. What is the name of the variable that's used to predict another variable? 12.1 The Simple Linear Regression Model 2. What is the “standard...
Describe, through an example or two, the use of a contingency table analysis (Chi Square test)...
Describe, through an example or two, the use of a contingency table analysis (Chi Square test) in a business research project. What conclusion(s) could we draw from a Chi Square test?
Describe, through an example or two, the use of a contingency table analysis (Chi Square test)...
Describe, through an example or two, the use of a contingency table analysis (Chi Square test) in a business research project. What conclusion(s) could we draw from a Chi Square test?
When we run a chi-square test of independence on a 2x2 table, the resulting Chi-square test...
When we run a chi-square test of independence on a 2x2 table, the resulting Chi-square test statistic would be equal to the square of the Z-test statistic from the Z-test of two independent proportions. True or false
In Chi-square goodness of fit and contingency table tests, it is sometimes necessary to reduce the...
In Chi-square goodness of fit and contingency table tests, it is sometimes necessary to reduce the number of classifications used in order to do what? Explain why. Statistics
1) What’s the difference among the chi-square test for goodness of fit, the chi-square test for...
1) What’s the difference among the chi-square test for goodness of fit, the chi-square test for independence, and the chi-square test for homogeneity 2) State the requirements to perform a chi-square test
For each of the following examples, state whether the chi-square goodness-of-fit test or the chi-square test...
For each of the following examples, state whether the chi-square goodness-of-fit test or the chi-square test for independence is appropriate, and state the degrees of freedom (df) for the test. Part A) An instructor tests whether class attendance (low, high) and grade point average (low, average, high) are independent. - State whether the chi-square goodness-of-fit test or the chi-square test for independence is appropriate. - State the degrees of freedom for the test. df = Part B) A student tests...
by utilizing this chi-Square Test answer the following question and explain what the table means. Are...
by utilizing this chi-Square Test answer the following question and explain what the table means. Are Americans in favor of the use of unmanned drones by police agencies in the United States? Table 4 Chi-Square Test for Association: Gender, Opinion Gender -1 0 1 Total 39 17 44 100 Male 39.2 25.13 35.68 0.0010 2.6278 1.9409 39 33 27 99 Female 38.8 24.87 35.32 0.0010 2.6544 1.9605 Total 78 50 71 199 Note. Possible responses (-1, 0, and 1). -1...
Which of the following interpretations of a Chi-square test is CORRECT? (you may use the table...
Which of the following interpretations of a Chi-square test is CORRECT? (you may use the table below as reference). a. Assume that a Chi-square test was conducted to test the goodness of fit to a 3:1 ratio and that a Chi-square value of 2.62 was obtained. The null hypothesis should be accepted. b. With the test of a 3:1 ratio, there are two degrees of freedom. c. The larger the Chi-square value, the more likely your results are real. d....
Answer True or False for the following: 1. Chi-square requires assumptions about the shape of the...
Answer True or False for the following: 1. Chi-square requires assumptions about the shape of the population distribution from which a sample is drawn. 2. Goodness of fit refers to how close the observed data are to those predicted from a hypothesis. 3. The null hypothesis (H0) states that no association exists between the two cross-tabulated variables in the population, and therefore the variables are statistically independent. 4. High chi square values indicate a high probability that the observed deviations...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT