Question

In: Statistics and Probability

Question 4 Using the dataset below, estimate whether there is a difference among the graduation rates...

Question 4
Using the dataset below, estimate whether there is a difference among the graduation rates - identified as percentages - of five high schools over a 10-year period. Explain your results.
Yr HS 1 HS 2 HS 3 HS 4 HS 5
2003 67 82 94 65 88
2004 68 87 78 65 87
2005 65 83 81 45 86
2006 68 73 76 57 88
2007 67 77 75 68 89
2008 71 74 81 76 87
2009 78 76 79 77 81
2010 76 78 89 72 78
2011 72 76 76 69 89
2012 77 86 77 58 87

Solutions

Expert Solution

This is a simple problem related to hypothesis testing of 5 different sample means ()

The hypothesis model will be

We shall use the Tukey's HSD test to analyse the hypothesis .

But before that let us conduct the one way ANOVA test to find if the overall F statistic is significant or not .

This will tell us about the direction of solution .Post ANOVA we shall proceed to Tukey's test to compare pairwise means of samples.

This is done as follows.

The p-value corrresponding to the F-statistic of one-way ANOVA is lower than 0.01 which strongly suggests that one or more pairs of treatments are significantly different.

Now we know that our hypothesis is significant on overall levels .

We now delve into finer details (if there is siginificant difference then which all pairs contribute to it?)

This is here that Tukey's HSD test will be used.

we shall apply Tukey's HSD test to each of the 10 pairs to pinpoint and identify as to which of them exhibits statistially significant difference?

To do so we shall firstly establish the critical value of the Tukey-Kramer HSD Q statistic based on the k=5 treatments

and ν=45 degrees of freedom for the error term,

and for significance level α= 0.01 and 0.05 (p-values) .

I took help from online statistical calculators to get these two values.

We obtain these critical values for Q, for α of 0.01 and 0.05, as

Q(α=0.01,k=5,ν=45) = 4.8928 and

Q=0.05,k=5,ν=45)= 4.0186, respectively.

Next, we establish a Tukey test statistic from our sample columns to compare with the appropriate critical value obtained.

We calculate a parameter for each pair of columns being compared, which we loosely call here as the Tukey-Kramer HSD Q-statistic, or simply the Tukey HSD Q-statistic, and is given as:

Now please note that here the sample sizes in the columns are equal,and hence their harmonic mean is simply the common sample size.

Also kindly note that the quantity = 6.2084 is the square root of the Mean Square Error = 38.5444 determined in the precursor one-way ANOVA procedure .

Now we are sorted !!!

WE now only need to find the respective Tukey HSD Q-statistic, for each of the 10 pairs and compare them to the critical Q statistical value found above for various confidence levels as shared below again.

We find the corresponding p value from the p value table for each of the paired samplewise Q statisitc.

I attach below color coded results (red for insignificant, green for significant) of evaluating whether Qi,j>Qcritical for all relevant pairs of treatments. The corresponding p values (observed Vs critical) are also attached.

The results will be

.


Related Solutions

Question 4 Using the dataset below, estimate whether there is a difference among the graduation rates...
Question 4 Using the dataset below, estimate whether there is a difference among the graduation rates - identified as percentages - of five high schools over a 10-year period. Explain your results. Yr HS 1 HS 2 HS 3 HS 4 HS 5 2003 67 82 94 65 88 2004 68 87 78 65 87 2005 65 83 81 45 86 2006 68 73 76 57 88 2007 67 77 75 68 89 2008 71 74 81 76 87 2009...
1. Two researcher want to find out whether there is a difference among graduation rates (these...
1. Two researcher want to find out whether there is a difference among graduation rates (these are in percentages) of five colleges over a 10-year period. Using the Data set below, determine if there a difference between colleges? Do this manually and show all eight steps in computing. (HINT: There is one variable being examined (i.e. graduation rates) for more than two groups (i.e. college 1, 2, 3, 4, 5) that are tested only once). College 1 College 2 College...
1. Two researcher want to find out whether there is a difference among graduation rates (these...
1. Two researcher want to find out whether there is a difference among graduation rates (these are in percentages) of five colleges over a 10-year period. Using the Data set below, determine if there a difference between colleges? Do this manually and show all eight steps in computing. (HINT: There is one variable being examined (i.e. graduation rates) for more than two groups (i.e. college 1, 2, 3, 4, 5) that are tested only once). College 1 College 2 College...
What is the p value for the dataset below? determine whether a significant difference exists among...
What is the p value for the dataset below? determine whether a significant difference exists among groups A, B, and C? Use the format 0.999. Chemical A Chemical B Chemical C 112 121 84 103 125 96 98 98 105 122 100 89 130 95 102 107 122 98 105 121 105 120 115 89 100 128 100 124 130 90 Referring to the results from your analysis of chemicals A, B, and C; which pair of groups had the...
QUESTION 4 A researcher is interested in testing whether there is a difference in aggression levels...
QUESTION 4 A researcher is interested in testing whether there is a difference in aggression levels between kids who watch only TV shows, only play video game, or watch TV and play video games. Which statistical test should be used to determine whether a difference exists in aggression levels for these 3 groups? T-Test for One Sample T-Test for Independent Samples T-Test for Related Samples One-Way ANOVA Two-Variable Chi Square Test Question 5: A psychologist is interested in the relationship...
4. Chapter 12: Using the attached dataset “Chapter 12 Data Set 1” to determine whether there...
4. Chapter 12: Using the attached dataset “Chapter 12 Data Set 1” to determine whether there was a change in tons of paper before vs. after a recycling program in these 25 districts. a. Is this a directional or non-directional hypothesis? b. Should you use a one-tailed or two-tailed test? c. Is a dependent samples t-test an appropriate way to analyze these data? d. Conduct the between groups t-test using Excel (either method). Use the .05 confidence level. What is...
Using the data below, answer the question: Is there a significant difference in the GPAs of...
Using the data below, answer the question: Is there a significant difference in the GPAs of students who graduate in three years vs four years vs five or more years? Use the five steps for hypothesis testing to make your decision. Remember the "Sesame Street" song! Years to Graduate N Mean Standard deviation 3 8 3.66 0.28 4 26 2.81 0.59 5 or more 8 2.66 0.42 Source Sum of Squares df Mean Squares Between Groups 5.24 2 2.617 Within...
Complete the following table and test whether there is a difference among the group means at...
Complete the following table and test whether there is a difference among the group means at a 0.025 level of significance. Source df SS MS F Among Groups 787.5 3.5 Within Groups 16 1200 Total
A political scientist is interested in the question of whether or not there is a difference...
A political scientist is interested in the question of whether or not there is a difference between Republicans and Democrats when it comes to their involvement in voluntary associations. Using a 25-point scale to measure involvement in voluntary associations, and collecting information from a random sample of 17 Republicans and 12 Democrats, the political scientist discovers the following: Republicans: Mean of 12.56, standard deviation of 3.77 Democrats: Mean of 16.43, standard deviation of 4.21 Test the null hypothesis at the...
Question 1: Below is a dataset for a sample of 33 countries with indicators of Obesity...
Question 1: Below is a dataset for a sample of 33 countries with indicators of Obesity Prevalence and GDP Per Capita for each country. How much variance in Obesity Prevalence is explained by GDP Per Capita? (Question pertains to Coefficient of Determination) Country Obesity prevalence (% of adults are obese) GDP per capita (international-$) Qatar 33.90 118,207 Luxembourg 24.20 94,921 Singapore 6.60 82,622 Brunei 14.70 71,789 Kuwait 37.00 68,862 United Arab Emirates 29.90 67,706 Norway 25.00 64,160 Ireland 26.90 63,227...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT