Question

In: Statistics and Probability

Scenario 2: Car crashes ~ Randomly selected deaths from car crashed were obtained and the results...

Scenario 2: Car crashes ~ Randomly selected deaths from car crashed were obtained and the results are shown in the table below:

Day	Sun	Mon	Tue	Wed	Thu	Fri	Sat
Fatalities	132	108	95	99	115	132	138

State officials wonder if car crash fatalities occur with unequal frequency on different days of the week. They represent this research question in terms of two competing hypotheses:

H0: Car crashes occur with equal frequency throughout the week and that any uneven results in sample data are negligible and attributable to random chance.

Ha: Car crashes do not occur with equal frequency throughout the week and that any uneven results in sample data are reflect true difference in crash rates across the days of the week.

1.If we assume the null hypothesis H_0H0 to be true, what would be the expected number of fatalities each day of the week? ....the expected value of the test statistic?

2. To decide which of the hypotheses to continue believing, the researchers conducted a \chi^2χ2 Goodness of Fit test. The resulting test statistic representing departures from expected counts across the whole table was \chi^2 = 15.2479χ2=15.2479 with a corresponding p-value of 0.01840.0184. Which day of the week contributed the most toward this overall test statistic?

3. Based on the p-value given above, what would be your conclusion to this hypothesis test at the \alpha = 0.01α=0.01 level?

A)Reject the null hypothesis.

B)Fail to reject the null hypothesis.

Expert Solution

1. If we assume the null hypothesis H₀ to be true, what would be the expected number of fatalities each day of the week? What is the expected value of the test statistic?

Solution:

Here, we have to use chi square test of goodness of fit.

Null hypothesis: H₀: Car crashes occur with equal frequency throughout the week and that any uneven results in sample data are negligible and attributable to random chance.

Alternative hypothesis: H_a: Car crashes do not occur with equal frequency throughout the week and that any uneven results in sample data are reflect true difference in crash rates across the days of the week.

We are given level of significance = α = 0.01

Test statistic formula is given as below:

Chi square = ∑[(O – E)^2/E]

Where, O is observed frequencies and E is expected frequencies.

We are given

N = 7

Degrees of freedom = df = N – 1 = 7 – 1 = 6

α = 0.01

Critical value = 16.81189

(by using Chi square table or excel)

Calculation tables for test statistic are given as below:

Day	O	E	(O - E)^2	(O - E)^2/E
Sun	132	117	225	1.923076923
Mon	108	117	81	0.692307692
Tue	95	117	484	4.136752137
Wed	99	117	324	2.769230769
Thu	115	117	4	0.034188034
Fri	132	117	225	1.923076923
Sat	138	117	441	3.769230769
Total	819	819		15.24786325

Expected number of fatalities = ∑O/N = 819/7 = 117

Chi square = ∑[(O – E)^2/E] = 15.24786325

Expected value of test statistic = 15.24786325

2. Which day of the week contributed the most toward this overall test statistic?

Tuesday, because the chi square contribution for this day is most among all given days.

(See above table)

3. Based on the p-value given above, what would be your conclusion to this hypothesis test at the α = 0.01 level?

Answer: B)Fail to reject the null hypothesis.

Explanation:

We have

P-value = 0.018414068

(By using Chi square table or excel)

P-value > α = 0.01

So, we do not reject the null hypothesis

There is sufficient evidence to conclude that Car crashes occur with equal frequency throughout the week and that any uneven results in sample data are negligible and attributable to random chance.

orchestra answered 3 years ago

In a statistic class, 11 scores were randomly selected with the following results were obtained: 68,...

In a statistic class, 11 scores were randomly selected with the following results were obtained: 68, 74, 66, 37, 52, 71, 90, 65, 76, 73, 22. What are the outer fences? A)-6.0, 140.0 B)-10.0, 92.0 C)2.0, 128.0 D)2.0, 162.0 E)37.0, 107.0

Randomly selected birth records were obtained and results arelisted below. Use a 0.01 significance level to...

Randomly selected birth records were obtained and results arelisted below. Use a 0.01 significance level to test the reasonableclaim that births occur with equal frequency on the different daysof the week. How might the apparent lower frequency on Saturday andSunday be explained? Day Sun Mon Tues Wed Thurs Fri Sat Births 43 63 59 58 56 57 45

Suppose IQ scores were obtained from randomly selected twinstwins. For 2020 such pairs of people, the...

Suppose IQ scores were obtained from randomly selected twinstwins. For 2020 such pairs of people, the linear correlation coefficient is 0.8670.867 and the equation of the regression line is ModifyingAbove y with caret equals 5.44 plus 0.93 xy=5.44+0.93x, where x represents the IQ score of the twin born secondtwin born second. Also, the 2020 x values have a mean of 96.2696.26 and the 2020 y values have a mean of 94.9594.95. What is the best predicted IQ of the twin...

Suppose IQ scores were obtained from randomly selected twins . For 20 such pairs of people,...

Suppose IQ scores were obtained from randomly selected twins . For 20 such pairs of people, the linear correlation coefficient is 0.937 and the equation of the regression line is ModifyingAbove y with caret equals negative 16.5 plus 1.16 x , where x represents the IQ score of the twin born second . Also, the 20 x values have a mean of 101.81 and the 20 y values have a mean of 101.8 . What is the best predicted IQ...

Different hotels in a certain area are randomly selected, and their ratings and prices were obtained...

Different hotels in a certain area are randomly selected, and their ratings and prices were obtained online. Using technology, with x representing the ratings and y representing price, we find that the regression equation has a slope of 125 and a y-intercept of negative 374. Complete parts (a) and (b) below. a. What is the equation of the regression line? Select the correct choice below and fill in the answer boxes to complete your choice. A. yequals nothingplusleft parenthesis nothing...

Records of randomly selected births were obtained and categorised based on the day of the week...

Records of randomly selected births were obtained and categorised based on the day of the week the birth occurred, which is provided below. Since babies are unaware of the tradition work week, it would be reasonable to assume the distribution of births would occur equally on each of the days of the week. At 1% level of significance, test the claim. Can you think of an explanation for the result? Day Sunday Monday Tuesday Wednesday Thursday Friday Saturday Number of...

The age for race car drivers were selected at random. The following ages were obtained: 32...

The age for race car drivers were selected at random. The following ages were obtained: 32 32 33 33 41 29 38 32 33 23 27 45 52 29 and 25. (Give three decimals for all number answers.) 1. What is the average for this sample? 2. What is the standard deviation for this sample? Use a .05 significance level to test the claim that the mean age of all race drivers is greater than 30 years. 3. What is the significance level? ...

Scenario 2: In a city of 100,000 people, there were last year 1000 deaths and 1500...

Scenario 2: In a city of 100,000 people, there were last year 1000 deaths and 1500 live births (none were multiple births), of which 1200 infants survived to see their first birthday. Three mothers died in childbirth. Using the Week 3 PowerPoint for definitions, calculate the answers to the questions below. (http://adph.org/healthstats/assets/Formulas.pdf ) Infant mortality ratio (2 points): Crude death rate (2 points): Crude birth rate (2 points): What possible conclusions could you draw from these rates, about the overall...

A study was conducted on a sample randomly selected from city A. The results are presented...

A study was conducted on a sample randomly selected from city A. The results are presented in the table. Based on the information in the table answer the below questions: Age groups Number of participants Proportion of female Relative frequency of participants 15 years or younger 10 0.4 16-25 years 25 0.6 26-30 years 10 1.0 31 years or older 2 1.0 Are the age groups mutually exclusive? Based on the number of participants, can you argue for or against...

The following table summarizes results from 985 pedestrian deaths that were caused by accidents (based on...

The following table summarizes results from 985 pedestrian deaths that were caused by accidents (based on data from the National Highway Traffic Safety Administration). If the pedestrian was intoxicated, find the probability that the driver was intoxicated. Round to two decimals. pedestrian intoxicated yes no driver intoxicated yes 59 79 no 266 581