Question

In: Statistics and Probability

1. Seat Belt Use ~   The U.S. Department of Transportation collected seat belt use data by...

1. Seat Belt Use ~   The U.S. Department of Transportation collected seat belt use data by stationing observers at randomly selected roadway sites and recording the number of vehicle occupants who were wearing seat belts. A random sample of 1000 vehicle occupants in the Northeast shows that 909 were wearing seat belts, while a random sample of 1000 vehicle occupants in the Midwest showed that 855 were wearing seat belts.

CREDIT ONLY GIVEN IF WORK IS SHOWN. ANSWERS WITHOUT WRITTEN WORK WILL NOT BE GRADED.

1a. Let p1​ and p2​ denote the population proportion of occupants wearing seat belts in the Northeast and Midwest, respectively. Which of the following is the appropriate set of hypotheses to test if the rate at which Northeast drivers wear seat belts is higher than those in the Midwest

A. H0​:p1​>p2​ vs. Ha​:p1​≤p2​

B. H0​:p1​−p2​=−0.054 vs. Ha​:p1​−p2​<−0.054

C. H0​:p1​=p2​ vs.Ha​:p1​<p2​

D. H0​:p^​1​=p^​2​ vs. Ha​:p^​1​<p^​2​

E. None of these is the correct set of hypotheses.

1b. The null hypothesis H0​ believes that the rate at which drivers in the Midwest and Northeast wear seat belts is the same, and that any observed difference across the two samples is attributable to random chance alone. If this was, in fact, the case, what is the best estimate of the rate at which drivers wear seat belts, regardless of whether they live in the Northeast or Midwest? Submit your answer out to four decimal places.

1c. The study results give phat^​1​−phat^​2​ of -0.054. Approximately how far off, on average, would we expect this sample statistic to be from the parameter of interest, p1​−p2​? Submit your answer out to four decimals.

1d. Which of the following is a correct interpretation of the quantity you computed in Q3?

A In repeated samples, the quantity in Q3 tells us how often we would expect our estimate of \hat{p}_2-\hat{p}_2p^​2​−p^​2​ to be equal to 0 approximately.

B Since this quantity is less than 0.05, we will reject the null hypothesis.

C The quantity in Q3 tells us, over repeated samples, how much we would expect our estimate to be, on average, away from the true population difference in proportions.

D The quantity in Q3 tells us how often any estimate of the difference in the population proportions will be wrong.

1e. Compute the test statistic and p-value associated with these hypotheses. Submit your answers for both out to at least six decimal places.

1f. Using the p-value from Q5 and a significance level of α = 0.05, what do you conclude?

A i. There is insufficient evidence to reject the null hypothesis. It appears the rate is higher in the Northeast.

B ii. There is sufficient evidence to reject the null hypothesis. It appears the rate is higher in the Northeast.

C iii. There is insufficient evidence to reject the null hypothesis. It appears the rate is not higher in the Northeast.

D iv. There is sufficient evidence to reject the null hypothesis. It appears the rate is not higher in the Northeast.

Solutions

Expert Solution


Related Solutions

Scenario 1: Seat Belt Use ~   The U.S. Department of Transportation collected seat belt use data...
Scenario 1: Seat Belt Use ~   The U.S. Department of Transportation collected seat belt use data by stationing observers at randomly selected roadway sites and recording the number of vehicle occupants who were wearing seat belts. A random sample of 1000 vehicle occupants in the Northeast shows that 909 were wearing seat belts, while a random sample of 1000 vehicle occupants in the Midwest showed that 855 were wearing seat belts. The null hypothesis H_0H 0 ​ believes that the...
As part of a study on transportation safety, the U.S. Department of Transportation collected data on...
As part of a study on transportation safety, the U.S. Department of Transportation collected data on the number of fatal accidents per 1000 licenses and the percentage of licensed drivers under the age of 21 in a sample of 42 cities. Data collected over a one-year period follow. These data are contained in the file named “Safety.csv”. 1- Find the sample mean and standard deviation for each variable. Round your answers to the nearest thousandth. 2- Use the function lm()...
As part of a study on transportation safety, the U.S. Department of Transportation collected data on...
As part of a study on transportation safety, the U.S. Department of Transportation collected data on the number of fatal accidents per 1000 licenses and the percentage of licensed drivers under the age of 21 in a sample of 42 cities. Data collected over a one-year period follow. These data are contained in the file named “Safety.csv”. 1- Find the sample mean and standard deviation for each variable. Round your answers to the nearest thousandth. 2- Use the function lm()...
An insurance company collects data on​ seat-belt use among drivers in a country. Of 1300 drivers...
An insurance company collects data on​ seat-belt use among drivers in a country. Of 1300 drivers 30-39 years​ old, 22​% said that they buckle​ up, where as 384 of 1200 drivers 55-64 years old said that they did. At the 10% significance​ level, do the data suggest that there is a difference in​ seat-belt use between drivers 30-39 years old and those 55-64? Calculate the test statistic z=
An insurance company collects data on​ seat-belt use among drivers in a country. Of 2000 drivers...
An insurance company collects data on​ seat-belt use among drivers in a country. Of 2000 drivers 30 -39 years​ old, 23​% said that they buckle​ up, whereas 482 of 1800 drivers 55-64 years old said that they did. Find a 90​% confidence interval for the difference between the proportions of​ seat-belt users for drivers in the age groups 30-39 years and 55-64 years. Construct a 90​% confidence interval.
The U.S. Department of Transportation reported that in a large study of mobile phone use while...
The U.S. Department of Transportation reported that in a large study of mobile phone use while driving conducted in 2015, 4.4% of female drivers observed and 3.5% of male drivers observed were using a handheld mobile phone while driving.† Suppose that these percentages were based on observations from independent random samples of 1,300 male drivers and 1,300 female drivers (the actual sample sizes were much larger). Assume that it is reasonable to regard these samples as representative of male drivers...
Here's data on randomly sampled seat belt users in New York City where seat belts are...
Here's data on randomly sampled seat belt users in New York City where seat belts are required and in Boston where they are not required. Drivers Wearing Seat Belts NYC 220 183 Boston 117 68 Let p1 = proportion of NYC drivers wearing seat belts p2 = proportion of Boston drivers wearing seat belts Find
A recent study of seat belt use found that for a random sample of 117 female...
A recent study of seat belt use found that for a random sample of 117 female Hispanic drivers in Boston; 68 were wearing seat belts. Is this sufficient evidence to claim that the proportion of all female Hispanic drivers in Boston who wear seat belts is greater than 50%. Use a significance level of 5%. This problem is about (circle the correct one): One population proportion                      Two population proportions         One population mean                             Two populations means (Independent samples) One population standard...
A study of seat belt users and nonusers yielded the randomly selected sample data summarized in...
A study of seat belt users and nonusers yielded the randomly selected sample data summarized in the accompanying table. Use a 0.05 significance level to test the claim that the amount of smoking is independent of seat belt use. A plausible theory is that people who smoke are less concerned about their health and safety and are therefore less inclined to wear seat belts. Is this theory supported by the sample​ data? Number of Cigarettes Smoked per Day 0 1–14...
A study of seat belt users and nonusers yielded the randomly selected sample data summarized in...
A study of seat belt users and nonusers yielded the randomly selected sample data summarized in the given table. Number of cigarettes smoked per day 0 1-14 15 and over Wear seat belts 185 30 50 Don't wear seat belts 155 25 55 At the 0.01 significance level, test the claim that the amount of smoking is independent of seat belt use. Which of the following is true? Select one: a. The expected value of the test statistics is 13.50...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT