Question

In: Statistics and Probability

The data below show sport preference and age of participant from a random sample of members...

The data below show sport preference and age of participant from a random sample of members of a sports club. Test if sport preference is independent of age at the 0.02 significant level.

H0: Sport preference is independent of age
Ha: Sport preference is dependent on age

18-25 26-30 31-40 41 and over
Tennis 40 60 58 44
Swimming 59 76 50 66
Basketball 73 61 67 53

a. Complete the table: Give all answers as decimals rounded to 4 places.

Observed
Frequency
Expected
Frequency
(O−E)2E(O-E)2E  
40
60
58
44
59
76
50
66
73
61
67
53
Total

(b) What is the chi-square test-statistic for this data?
      Test Statistic: χ2=χ2=  


(d) The p-value is...

  • greater than αα
  • less than (or equal to) αα

(e) The p-value leads to a decision to...

  • reject the null
  • fail to reject the null

(f) What is the final conclusion?

  • There is not sufficient evidence to conclude sport preference is dependent on age.
  • There is sufficient evidence to conclude sport preference is dependent on age.

Solutions

Expert Solution

Given table data is as below
MATRIX col1 col2 col3 col4 TOTALS
row 1 40 60 58 44 202
row 2 59 76 50 66 251
row 3 73 61 67 53 254
TOTALS 172 197 175 163 N = 707
------------------------------------------------------------------

calculation formula for E table matrix
E-TABLE col1 col2 col3 col4
row 1 row1*col1/N row1*col2/N row1*col3/N row1*col4/N
row 2 row2*col1/N row2*col2/N row2*col3/N row2*col4/N
row 3 row3*col1/N row3*col2/N row3*col3/N row3*col4/N
------------------------------------------------------------------

expected frequencies calculated by applying E - table matrix formulae
E-TABLE col1 col2 col3 col4
row 1 49.143 56.286 50 46.571
row 2 61.064 69.939 62.129 57.868
row 3 61.793 70.775 62.871 58.56
------------------------------------------------------------------

calculate chisquare test statistic using given observed frequencies, calculated expected frequencies from above
Oi Ei Oi-Ei (Oi-Ei)^2 (Oi-Ei)^2/Ei
40 49.143 -9.143 83.594 1.701
60 56.286 3.714 13.794 0.245
58 50 8 64 1.28
44 46.571 -2.571 6.61 0.142
59 61.064 -2.064 4.26 0.07
76 69.939 6.061 36.736 0.525
50 62.129 -12.129 147.113 2.368
66 57.868 8.132 66.129 1.143
73 61.793 11.207 125.597 2.033
61 70.775 -9.775 95.551 1.35
67 62.871 4.129 17.049 0.271
53 58.56 -5.56 30.914 0.528
ᴪ^2 o = 11.656

------------------------------------------------------------------

set up null vs alternative as
null, Ho: no relation b/w X and Y OR X and Y are independent
alternative, H1: exists a relation b/w X and Y OR X and Y are dependent
level of significance, α = 0.02
from standard normal table, chi square value at right tailed, ᴪ^2 α/2 =15.033
since our test is right tailed,reject Ho when ᴪ^2 o > 15.033
we use test statistic ᴪ^2 o = Σ(Oi-Ei)^2/Ei
from the table , ᴪ^2 o = 11.656
critical value
the value of |ᴪ^2 α| at los 0.02 with d.f (r-1)(c-1)= ( 3 -1 ) * ( 4 - 1 ) = 2 * 3 = 6 is 15.033
we got | ᴪ^2| =11.656 & | ᴪ^2 α | =15.033
make decision
hence value of | ᴪ^2 o | < | ᴪ^2 α | and here we do not reject Ho
ᴪ^2 p_value =0.07


ANSWERS
---------------
a. null, Ho: no relation b/w X and Y OR X and Y are independent
alternative, H1: exists a relation b/w X and Y OR X and Y are dependent
b. test statistic: 11.656
c. critical value: 15.033
d. p-value:0.07

e. p value is greater than alpha value
decision: do not reject Ho

f. we do not have enough evidence to support the claim that   sport preference is dependent on age.


Related Solutions

The data below show sport preference and age of participant from a random sample of members...
The data below show sport preference and age of participant from a random sample of members of a sports club. Is there evidence to suggest that they are related? Frequencies of Sport Preference and Age Tennis Swimming Basketball 18-25 88 94 74 26-30 107 84 91 31-40 79 62 55 Over 40 76 71 43 What can be concluded at the α = 0.10 significance level? What is the correct statistical test to use? Paired t-test Homogeneity Goodness-of-Fit Independence What...
The data below show sport preference and age of participant from a random sample of members...
The data below show sport preference and age of participant from a random sample of members of a sports club. Test if sport preference is independent of age at the 0.02 significant level. H0: Sport preference is independent of age Ha: Sport preference is dependent on age 18-25 26-30 31-40 41 and over Tennis 43 60 56 44 Swimming 58 76 50 63 Basketball 74 61 65 49 a. Complete the table: Give all answers as decimals rounded to 4...
The data below show sport preference and age of participant from a random sample of members...
The data below show sport preference and age of participant from a random sample of members of a sports club. Test if sport preference is independent of age at the 0.05 significant level. H0: Sport preference is independent of age Ha: Sport preference is dependent on age 18-25 26-30 31-40 41 and over Tennis 44 59 59 47 Swimming 57 77 46 66 Basketball 70 58 66 53 a. Complete the table: Give all answers as decimals rounded to 4...
The following data show the body temperatures (in degrees Fahrenheit) from a random sample of 12...
The following data show the body temperatures (in degrees Fahrenheit) from a random sample of 12 independent healthy people in the United States. 98.5 98.2 99.1 96.6 98.1 98.7 97.5 99.0 97.4 98.3 97.8 98.2 What would be the estimate of the average body temperature of healthy people in the United States. Find a 95% confidence interval for the average body temperature of healthy people in the United States. Explain carefully the interval you found in part (b) means. can...
The sample data below have been collected based on a simple random sample from a normally...
The sample data below have been collected based on a simple random sample from a normally distributed population. Complete parts a and b. 5 4 0 5 7 6 9 0 8 4 a. Compute a 98% confidence interval estimate for the population mean. The 98% confidence interval for the population mean is from to . (Round to two decimal places as needed. Use ascending order.) b. Show what the impact would be if the confidence level is increased to...
A random sample from a normal population is obtained, and the data are given below. Find...
A random sample from a normal population is obtained, and the data are given below. Find a 90% confidence interval for . 114 157 203 257 284 299 305 344 378 410 421 450 478 480 512 533 545 What is the upper bound of the confidence interval (round off to the nearest integer)?
The data resulting from a random sample of 5 observations are shown below. Y is the...
The data resulting from a random sample of 5 observations are shown below. Y is the dependent variable, and X1 and X2 are the independent variables. Observation Y X1 X2 1 87 95 11 2 86 94 11 3 84 94 11 4 83 93 12 5 84 93 12 Use Excel's Regression tool to answer the following questions. To copy the data set, highlight the table, press Ctrl-c, click on the Excel cell to which you want to copy,...
The data below is a random sample of 3 observations drawn from the United States population....
The data below is a random sample of 3 observations drawn from the United States population. Use the data to answer the following questions i. Find 95% confidence intervals of the population mean of experience and wage. ii. Estimate ρe,w, the correlation between the variables experience and wage. iii. Find βˆ 1 and βˆ 0, the estimates of the parameters in the following regression equation wage = β0 + β1education + ϵ iv. Predict wages for a person with 15...
The data below is the mileage (thousands of miles) and age of your cars as sample....
The data below is the mileage (thousands of miles) and age of your cars as sample. Year Miles Age 2017    8.5    1 2009 100.3    9 2014   32.7    4 2004 125.0   14 2003 115.0   15 2011   85.5    7 2012   23.1    6 2012   45.0    6 2004 123.0   14 2013   51.2    5 2013 116.0    5 2009 110.0    9 2003 143.0   15 2017   12.0    1 2005 180.0   13 2008 270.0   10 Please include appropriate Minitab Results when important a. Identify terms in the simple...
Two random samples were drawn from members of the U.S. Congress. One sample was taken from...
Two random samples were drawn from members of the U.S. Congress. One sample was taken from members who are Democrats and the other from members who are Republicans. For each sample, the number of dollars spent on federal projects in each congressperson's home district was recorded. Dollars Spent on Federal Projects in Home Districts Party Less than 5 Billion 5 to 10 Billion More than 10 billion Row Total Democratic 6 16 23 45 Republican 11 17 19 47 Column...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT