Question

In: Math

Explain the different hypothesis tests one could use when assessing the distribution of a categorical variable...

Explain the different hypothesis tests one could use when assessing the distribution of a categorical variable (e.g. smoking status) with only two levels (e.g. levels: smoker and non-smoker) vs. more than two levels (e.g. levels: heavy smoker, moderate smoker, occasional smoker, non-smoker).

Be precise. Use the language of the textbook to identify the appropriate test and how you would conduct it. NOTE: Minimum of 150 words for primary post and 50 words for each of three replies to your peers.

Solutions

Expert Solution

The hypothesis test that should be used in this case is the Chi Square Test of Goodness of Fit.

This test is used to test the validity of a distribution assumed for a random phenomenon.

It evaluates the null hypotheses H0 (that the data are governed by the assumed distribution) against the alternative (that the data are not drawn from the assumed distribution).

Let p1, p2, ..., pk denote the probabilities hypothesized for k possible outcomes.

In n independent trials, let Y1, Y2, ..., Yk denote the observed counts of each outcome which are to be compared to the expected counts np1, np2, ..., npk.

The chi-square test statistic is qk-1 =

= (Y1 - np1)²  + (Y2 - np2)² + ... + (Yk - npk)²
  ----------    ----------          --------
     np1           np2                npk

H0 is rejected if this value exceeds the upper critical value of the (k-1) distribution, where is the desired level of significance.

As with most test statistics, the larger the difference between observed and expected, the larger the test statistic becomes.

The distribution of the test statistic under the null hypothesis is approximately the same as the theoretical chi-square distribution.

This means that once the chi-square value and the number of degrees of freedom is known, you the probability of getting that value of chi-square can be calculated using the chi-square distribution.

The number of degrees of freedom is the number of categories minus one.


Related Solutions

Which Hypotheses Tests use a Chi-squared distribution? Which Hypothesis Tests use an F distribution? What is...
Which Hypotheses Tests use a Chi-squared distribution? Which Hypothesis Tests use an F distribution? What is the rule for sample size calculations? What is the rule regarding the appropriate signs to be used in the null and alternative hypotheses?
Which Hypotheses Tests use a Chi-squared distribution? Which Hypothesis Tests use an F distribution? What is...
Which Hypotheses Tests use a Chi-squared distribution? Which Hypothesis Tests use an F distribution? What is the rule for sample size calculations? What is the rule regarding the appropriate signs to be used in the null and alternative hypotheses?
whaich of the following could be used to display a categorical distribution graddically? Explain your answers!...
whaich of the following could be used to display a categorical distribution graddically? Explain your answers! (a) histogram (b) pie chart (c) bar chart (d) stem and leaf diplay
Could you please explain to me when I should use the two proportion tests? A two...
Could you please explain to me when I should use the two proportion tests? A two proportion f test tests variance/standard deviation. So if it is testing how much the two proportions vary then isn't it doing what a linear regression t test does? Whats the difference between a linear regression t test and goodness of fit test statistic? Is there such thing as a two proportion chi squared test or is that just an f test? The only difference...
A two-variable model involving one quantitative explanatory variable and one categorical (binary) explanatory variable (and no...
A two-variable model involving one quantitative explanatory variable and one categorical (binary) explanatory variable (and no interaction), results in two regression lines that are: A.     Always parallel. B.     Could be parallel but, depending on the data, may not. C.      Never parallel. D.     Always horizontal. The two methods of including a binary categorical variable in a regression model are to use indicator coding or effect coding. For indicator coding in the two-variable model (with no interaction): A.     The binary variable is coded (-1,1) and the coefficient...
Hypothesis Testing: Use of Multiple Sample Hypothesis Tests In your chosen field, when might you want...
Hypothesis Testing: Use of Multiple Sample Hypothesis Tests In your chosen field, when might you want to know the mean differences between two or more groups? Please describe the situation, including how and why the hypothesis test would be used.
Explain confidence intervals and hypothesis tests in terms that a 4th grader could understand. Include 2–3...
Explain confidence intervals and hypothesis tests in terms that a 4th grader could understand. Include 2–3 tips and tricks for how you would determine which type of inference you are working with (confidence interval estimation or hypothesis testing). In addition, explain the concept of the p-value in hypothesis testing. Since the p-value is commonly output from data analysis software when conducting a hypothesis test and is reported in study results and articles, it is important to understand not only how...
Why is the use of OLS regression inappropriate when the dependent variable is dichotomous? explain one...
Why is the use of OLS regression inappropriate when the dependent variable is dichotomous? explain one technique you can use instead, clearly stating how to interpret the estimated regression coefficients.   
Nonparametric tests should not be used when ______. the associations being tested involve categorical variables the...
Nonparametric tests should not be used when ______. the associations being tested involve categorical variables the dependent variables are ordinal scales the assumptions of parametric tests are met the population distribution is heavily skewed The chi-square tests are used to analyze ______. Medians frequency of data continuous variables skewness The ______ tests are more powerful than the ______ tests, which means the ______ is higher for nonparametric tests. parametric; nonparametric; type II error nonparametric; parametric; type I error parametric; nonparametric;...
Use of Multiple Sample Hypothesis Tests In an hospital setting, when might you want to know...
Use of Multiple Sample Hypothesis Tests In an hospital setting, when might you want to know the mean differences between two or more groups? Please describe the situation, including how and why it would be used.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT