Question

In: Statistics and Probability

Scenario 2: Car crashes ~ Randomly selected deaths from car crashed were obtained and the results...

Scenario 2: Car crashes ~ Randomly selected deaths from car crashed were obtained and the results are shown in the table below:

Day

Sun

Mon

Tue

Wed

Thu

Fri

Sat

Fatalities

132

108

95

99

115

132

138

State officials wonder if car crash fatalities occur with unequal frequency on different days of the week. They represent this research question in terms of two competing hypotheses:

H0​: Car crashes occur with equal frequency throughout the week and that any uneven results in sample data are negligible and attributable to random chance.

Ha​: Car crashes do not occur with equal frequency throughout the week and that any uneven results in sample data are reflect true difference in crash rates across the days of the week.

1.If we assume the null hypothesis H_0H0​ to be true, what would be the expected number of fatalities each day of the week? ....the expected value of the test statistic?

2. To decide which of the hypotheses to continue believing, the researchers conducted a \chi^2χ2 Goodness of Fit test. The resulting test statistic representing departures from expected counts across the whole table was \chi^2 = 15.2479χ2=15.2479 with a corresponding p-value of 0.01840.0184. Which day of the week contributed the most toward this overall test statistic?

3. Based on the p-value given above, what would be your conclusion to this hypothesis test at the \alpha = 0.01α=0.01 level?

A)Reject the null hypothesis.

B)Fail to reject the null hypothesis.

Solutions

Expert Solution

1. If we assume the null hypothesis H0 to be true, what would be the expected number of fatalities each day of the week? What is the expected value of the test statistic?

Solution:

Here, we have to use chi square test of goodness of fit.

Null hypothesis: H0: Car crashes occur with equal frequency throughout the week and that any uneven results in sample data are negligible and attributable to random chance.

Alternative hypothesis: Ha: Car crashes do not occur with equal frequency throughout the week and that any uneven results in sample data are reflect true difference in crash rates across the days of the week.

We are given level of significance = α = 0.01

Test statistic formula is given as below:

Chi square = ∑[(O – E)^2/E]

Where, O is observed frequencies and E is expected frequencies.

We are given

N = 7

Degrees of freedom = df = N – 1 = 7 – 1 = 6

α = 0.01

Critical value = 16.81189

(by using Chi square table or excel)

Calculation tables for test statistic are given as below:

Day

O

E

(O - E)^2

(O - E)^2/E

Sun

132

117

225

1.923076923

Mon

108

117

81

0.692307692

Tue

95

117

484

4.136752137

Wed

99

117

324

2.769230769

Thu

115

117

4

0.034188034

Fri

132

117

225

1.923076923

Sat

138

117

441

3.769230769

Total

819

819

15.24786325

Expected number of fatalities = ∑O/N = 819/7 = 117

Chi square = ∑[(O – E)^2/E] = 15.24786325

Expected value of test statistic = 15.24786325

2. Which day of the week contributed the most toward this overall test statistic?

Tuesday, because the chi square contribution for this day is most among all given days.

(See above table)

3. Based on the p-value given above, what would be your conclusion to this hypothesis test at the α = 0.01 level?

Answer: B)Fail to reject the null hypothesis.

Explanation:

We have

P-value = 0.018414068

(By using Chi square table or excel)

P-value > α = 0.01

So, we do not reject the null hypothesis

There is sufficient evidence to conclude that Car crashes occur with equal frequency throughout the week and that any uneven results in sample data are negligible and attributable to random chance.


Related Solutions

In a statistic class, 11 scores were randomly selected with the following results were obtained: 68,...
In a statistic class, 11 scores were randomly selected with the following results were obtained: 68, 74, 66, 37, 52, 71, 90, 65, 76, 73, 22. What are the outer fences? A)-6.0, 140.0 B)-10.0, 92.0 C)2.0, 128.0 D)2.0, 162.0 E)37.0, 107.0
Randomly selected birth records were obtained and results arelisted below. Use a 0.01 significance level to...
Randomly selected birth records were obtained and results arelisted below. Use a 0.01 significance level to test the reasonableclaim that births occur with equal frequency on the different daysof the week. How might the apparent lower frequency on Saturday andSunday be explained? Day Sun Mon Tues Wed Thurs Fri Sat Births 43 63 59 58 56 57 45
Suppose IQ scores were obtained from randomly selected twinstwins. For 2020 such pairs of​ people, the...
Suppose IQ scores were obtained from randomly selected twinstwins. For 2020 such pairs of​ people, the linear correlation coefficient is 0.8670.867 and the equation of the regression line is ModifyingAbove y with caret equals 5.44 plus 0.93 xy=5.44+0.93x​, where x represents the IQ score of the twin born secondtwin born second. ​Also, the 2020 x values have a mean of 96.2696.26 and the 2020 y values have a mean of 94.9594.95. What is the best predicted IQ of the twin...
Suppose IQ scores were obtained from randomly selected twins . For 20 such pairs of​ people,...
Suppose IQ scores were obtained from randomly selected twins . For 20 such pairs of​ people, the linear correlation coefficient is 0.937 and the equation of the regression line is ModifyingAbove y with caret equals negative 16.5 plus 1.16 x ​, where x represents the IQ score of the twin born second . ​Also, the 20 x values have a mean of 101.81 and the 20 y values have a mean of 101.8 . What is the best predicted IQ...
Different hotels in a certain area are randomly​ selected, and their ratings and prices were obtained...
Different hotels in a certain area are randomly​ selected, and their ratings and prices were obtained online. Using​ technology, with x representing the ratings and y representing​ price, we find that the regression equation has a slope of 125 and a​ y-intercept of negative 374. Complete parts​ (a) and​ (b) below. a. What is the equation of the regression​ line? Select the correct choice below and fill in the answer boxes to complete your choice. A. yequals nothingplusleft parenthesis nothing...
Records of randomly selected births were obtained and categorised based on the day of the week...
Records of randomly selected births were obtained and categorised based on the day of the week the birth occurred, which is provided below. Since babies are unaware of the tradition work week, it would be reasonable to assume the distribution of births would occur equally on each of the days of the week. At 1% level of significance, test the claim. Can you think of an explanation for the result? Day Sunday Monday Tuesday Wednesday Thursday Friday Saturday Number of...
The age for race car drivers were selected at random. The following ages were obtained: 32...
The age for race car drivers were selected at random. The following ages were obtained: 32 32 33 33 41 29 38 32 33 23 27 45 52 29 and 25. (Give three decimals for all number answers.) 1.  What is the average for this sample? 2.  What is the standard deviation for this sample? Use a .05 significance level to test the claim that the mean age of all race drivers is greater than 30 years. 3.  What is the significance level?  ...
Scenario 2: In a city of 100,000 people, there were last year 1000 deaths and 1500...
Scenario 2: In a city of 100,000 people, there were last year 1000 deaths and 1500 live births (none were multiple births), of which 1200 infants survived to see their first birthday. Three mothers died in childbirth. Using the Week 3 PowerPoint for definitions, calculate the answers to the questions below. (http://adph.org/healthstats/assets/Formulas.pdf ) Infant mortality ratio (2 points): Crude death rate (2 points): Crude birth rate (2 points): What possible conclusions could you draw from these rates, about the overall...
A study was conducted on a sample randomly selected from city A. The results are presented...
A study was conducted on a sample randomly selected from city A. The results are presented in the table. Based on the information in the table answer the below questions: Age groups Number of participants Proportion of female Relative frequency of participants 15 years or younger 10 0.4   16-25 years 25 0.6   26-30 years 10 1.0   31 years or older 2 1.0   Are the age groups mutually exclusive? Based on the number of participants, can you argue for or against...
The following table summarizes results from 985 pedestrian deaths that were caused by accidents (based on...
The following table summarizes results from 985 pedestrian deaths that were caused by accidents (based on data from the National Highway Traffic Safety Administration). If the pedestrian was intoxicated, find the probability that the driver was intoxicated. Round to two decimals. pedestrian intoxicated yes no driver intoxicated yes 59 79 no 266 581
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT