In the book Business Research Methods (5th ed.), Donald R.
Cooper and C. William Emory discuss studying the relationship
between on-the-job accidents and smoking. Cooper and Emory describe
the study as follows:
Suppose a manager implementing a smoke-free workplace policy is interested in whether smoking affects worker accidents. Since the company has complete reports of on-the-job accidents, she draws a sample of names of workers who were involved in accidents during the last year. A similar sample from among workers who had no reported accidents in the last year is drawn. She interviews members of both groups to determine if they are smokers or not.
The sample results are given in the following table.
On-the-Job Accident | ||||||
Smoker | Yes | No | Row Total | |||
Heavy | 12 | 8 | 20 | |||
Moderate | 9 | 10 | 19 | |||
Nonsmoker | 13 | 14 | 27 | |||
Column total | 34 | 32 | 66 | |||
Expected counts are below observed counts | ||||||
Accident | No Accident | Total | ||||
Heavy | 12 | 8 | 20 | |||
10.30 | 9.70 | |||||
Moderate | 9 | 10 | 19 | |||
9.79 | 9.21 | |||||
Nonsmoker | 13 | 14 | 27 | |||
13.91 | 13.09 | |||||
Total | 34 | 32 | 66 | |||
Chi-Sq = .83, DF = 2, P-Value = 0.660 | ||||||
(a) For each row and column total in the above
table, find the corresponding row/column percentage. (Round
your answers to 2 decimal places.)
Row 1 | % |
Row 2 | % |
Row 3 | % |
Column 1 | % |
Column 2 | % |
(b) For each cell in the above table, find the
corresponding cell, row, and column percentages. (Round
your answers to 2 decimal places.)
Accident | No Accident | ||
Heavy | Cell= % | Cell= % | |
Row= % | Row= % | ||
Column= % | Column= % | ||
Moderate | Cell= % | Cell= % | |
Row= % | Row= % | ||
Column= % | Column= % | ||
Nonsmoker | Cell= % | Cell= % | |
Row= % | Row= % | ||
Column= % | Column= % | ||
(c) Use the MINITAB output in the above to test
the hypothesis that the incidence of on-the-job accidents is
independent of smoking habits. Set α = .01.
(Click to select)Do not rejectReject
H0.
(d) Is there a difference in on-the-job
accident occurrences between smokers and nonsmokers?
Conclude there is (Click to select)no differencedifference between smokers and nonsmokers.
In: Statistics and Probability
Refer to the gasoline sales time series data in the given table.
Week | Sales (1,000s of gallons) |
1 | 17 |
2 | 21 |
3 | 19 |
4 | 23 |
5 | 18 |
6 | 16 |
7 | 20 |
8 | 18 |
9 | 22 |
10 | 20 |
11 | 15 |
12 | 22 |
(a) | Compute four-week and five-week moving averages for the time series. | ||||||||||||||||||||||||||||||||||||||||||||||||||||
If required, round your answers to two decimal places. | |||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||
(b) | Compute the MSE for the four-week and five-week moving average forecasts. | ||||||||||||||||||||||||||||||||||||||||||||||||||||
If required, round your final answers to three decimal places. | |||||||||||||||||||||||||||||||||||||||||||||||||||||
MSE for four-week moving average = | |||||||||||||||||||||||||||||||||||||||||||||||||||||
MSE for five-week moving average = | |||||||||||||||||||||||||||||||||||||||||||||||||||||
(c) | What appears to be the best number of weeks of past data (three, four, or five) to use in the moving average computation? Consider that the MSE for the three-week moving average is 10.222. | ||||||||||||||||||||||||||||||||||||||||||||||||||||
- Select your answer -ThreeFourFive |
In: Statistics and Probability
The following frequency table summarizes a set of data. What is the five-number summary?
Value | Frequency |
---|---|
1 | 5 |
2 | 2 |
3 | 1 |
7 | 1 |
8 | 1 |
17 | 1 |
18 | 5 |
19 | 3 |
20 | 1 |
21 | 1 |
23 | 1 |
26 | 1 |
Select the correct answer below:
Min | Q1 | Median | Q3 | Max |
---|---|---|---|---|
1 | 7 | 8 | 21 | 26 |
Min | Q1 | Median | Q3 | Max |
---|---|---|---|---|
1 | 2 | 6 | 19 | 26 |
Min | Q1 | Median | Q3 | Max |
---|---|---|---|---|
1 | 3 | 22 | 21 | 26 |
Min | Q1 | Median | Q3 | Max |
---|---|---|---|---|
1 | 2 | 18 | 19 | 26 |
Min | Q1 | Median | Q3 | Max |
---|---|---|---|---|
1 | 4 | 5 | 18 | 26 |
In: Statistics and Probability
A company that produces computers claims that that the average life of one of their computers is 8 years. A sample of 12 computers shows a sample mean life of 7.8 years, with a population standard deviation of 0.2 years. Does the data suggest that the average life of one of the computers is not 8 years at 0.05 level significance? Assume computer lifetimes are normally distributed.
1. Give null and alternate hypothesis
2. Give test statistic and P value, state conclusion
3. State what type 1 and type 2 errors would be
In: Statistics and Probability
A sample of 30 commuters in the area of a certain city yielded the accompanying commute times, in minutes. Preliminary data analyses indicate that the t-interval procedure can reasonably be applied. Find and interpret a 95% confidence interval for the mean commute time of all commuters in the area of the city (note: xbar = 28.20 minutes and s = 9.39 minutes)
In: Statistics and Probability
Comparison/Contrast Outline
Attending a Historically Black College or University versus attending another type of institution of higher learning.
In: Statistics and Probability
In its 2018 State of the First Amendment survey, the Freedom Forum Institute found that 40 percent of respondents could not recall any of the freedoms protected by the First Amendment. You decide to build a distribution for how many respondents could not recall any of the First Amendments. You take a random sample of 10 Americans. 1. What are the assumptions of a binomial distribution? Does this example match those assumptions? 2. What is the probability that the sample has exactly n successes, for n=1,2,3…10? 3. Plot the probabilities that were calculated in problem 2. 4. Find the probability that the sample has at least 5 successes. 5. Find the probability that the sample has at most 3 successes.
In: Statistics and Probability
Many people believe that unusual behavior is more likely to occur during a full moon. As a test for empirical evidence to support this belief, suppose that you categorized the visits of new clients to a community health unit over a one-year period by lunar phases and found the following distribution of visits: Full moon 62, new moon 50, first quarter 60, and third quarter 56.
Answer the following questions:
1. What are the null and alternate hypotheses?
2. What are the expected values for each of the categories?
3. What is the chi-square obtained?
4. What is the critical value?
5. What is your statistical decision? Justify your answer.
6. What is your conclusion?
In: Statistics and Probability
A t statistic was used to conduct a test of the null hypothesis H0: µ = 11 against the alternative Ha: µ ≠ 11, with a p-value equal to 0.042. A two-sided confidence interval for µ is to be considered. Of the following, which is the largest level of confidence for which the confidence interval will NOT contain 11?
A 90% confidence level
A 92% confidence level
A 96% confidence level
A 97% confidence level
A 98% confidence level
In: Statistics and Probability
2) A company makes the housing for a mechanical component used in lawn mowers. The critical dimension, ?, of the housing is Beta distributed with the following parameters: ? = 12.7 inches; ? = 13.5 inches; ? = 5 and ? = 7. To be acceptable, however, the dimension should be between 12.8 and 13.3 inches.
Determine:
(a) The mean and standard deviation of the dimension.
(b) The probability that a randomly selected housing will be acceptable. (Hint: You can use Microsoft Excel or a similar software for the calculation step)
(c) The company mentioned assumes that the critical dimension is normally distributed with the mean and standard deviation matching the original distribution. What is the probability that a randomly selected housing will be acceptable under this different assumption? Comment on the discrepancy from the previous answer.
In: Statistics and Probability
In the Journal of Marketing Research (November 1996),
Gupta studied the extent to which the purchase behavior of
scanner panels is representative of overall brand
preferences. A scanner panel is a sample of households whose
purchase data are recorded when a magnetic identification card is
presented at a store checkout. The table below gives peanut butter
purchase data collected by the A. C. Nielson Company using a panel
of 2,500 households in Sioux Falls, South Dakota. The data were
collected over 102 weeks. The table also gives the market shares
obtained by recording all peanut butter purchases at the same
stores during the same period.
Brand | Size | Number of Purchases by Household Panel | Market Shares |
Jif | 18 oz. | 3,193 | 19.36% |
Jif | 28 | 1,876 | 7.84 |
Jif | 40 | 792 | 5.06 |
Peter Pan | 10 | 4,061 | 17.64 |
Skippy | 18 | 6,279 | 27.16 |
Skippy | 28 | 1,639 | 12.54 |
Skippy | 40 | 1,415 | 10.40 |
Total | 19,255 | ||
Goodness-of-Fit Test | |||||
obs | expected | O – E | (O – E)2/E | % of chisq | |
3,193 | 3,727.768 | -534.768 | 76.715 | 7.98 | |
1,876 | 1,509.592 | 366.408 | 88.935 | 9.25 | |
792 | 974.303 | -182.303 | 34.111 | 3.55 | |
4,061 | 3,396.582 | 664.418 | 129.969 | 13.51 | |
6,279 | 5,229.658 | 1,049.342 | 210.553 | 21.89 | |
1,639 | 2,414.577 | -775.577 | 249.120 | 25.90 | |
1,415 | 2,002.520 | -587.520 | 172.373 | 17.92 | |
19,255 | 19,255.000 | .000 | 961.776 | 100.00 | |
(a) Show that it is appropriate to carry out a
chi-square test.
Each expected value is ≥
(b) Test to determine whether the purchase
behavior of the panel of 2,500 households is consistent with the
purchase behavior of the population of all peanut butter
purchasers. Assume here that purchase decisions by panel members
are reasonably independent, and set α = .05.
(Round your answers
χ2to
2 decimal places and
χ2.05
to 3 decimal places.)
χ2χ2 | |
χ2.05χ.052 | |
In: Statistics and Probability
2. A researcher for Netflix wants to know if people have different preferences for two shows: Friends and The Office. The researcher recruits 7 people. Each person watches five minutes of Friends and rates the show on a scale of 0-10, where 0 means they hate it and 10 means they love it. Then each person watches five minutes of The Office rates the show on a scale of 0-10. Assume that you are working at the .05 level of significance. The researcher obtains the following data:
Subject |
Rating of Friends |
Rating of The Office |
1 |
7 |
10 |
2 |
2 |
1 |
3 |
4 |
6 |
4 |
9 |
7 |
5 |
5 |
4 |
6 |
1 |
5 |
7 |
6 |
8 |
In: Statistics and Probability
Let X1 and X2 be independent UNIF(0,1) random variables and consider the transformations Y1= X1X2 and Y2 =X1/X2. Find the joint pdf of Y1 and Y2 and indicate their joint support of Y1 and Y2. Show Work.
In: Statistics and Probability
A manufacturer makes ball bearings that are supposed to have a mean weight of 30 g. A retailer suspects that the mean weight is not 30g. The mean weight for a random sample of 16b ball bearings is 28.4g with a standard deviation of 4.5g. At the 0.05 significance level, test the claim that the sample comes from a population with a mean not equal to 30g. Find the critical value(s) and critical region. Identify the null and alternative hypotheses, test statistic, critical value(s) and critical region, as indicated, and state the final conclusion that addresses the problem. Show all seven steps.
In: Statistics and Probability
Consumer Reports provided extensive testing and ratings for more than 100 HDTVs. An overall score, based primarily on picture quality, was developed for each model. In general, a higher overall score indicates better performance. The following (hypothetical) data show the price and overall score for the ten 42-inch plasma televisions (Consumer Report data slightly changed here):
Brand |
Price (X) |
Score (Y) |
|||||
Dell |
2900 |
50 |
|||||
Hisense |
2800 |
52 |
|||||
Hitachi |
2700 |
45 |
|||||
JVC |
3500 |
60 |
|||||
LG |
3300 |
56 |
|||||
Maxent |
2000 |
30 |
|||||
Panasonic |
4200 |
68 |
|||||
Phillips |
3100 |
56 |
|||||
Proview |
2500 |
35 |
|||||
Samsung |
3000 |
48 |
|||||
Use the above data to develop and estimated regression equation and interpret the coefficients. Compute Coefficient of Determination and correlation coefficient and show their relation. Interpret the explanatory power of the model. Estimate the overall score for a 42-inch plasma television with a price of $3400. Finally, test the significance of the slope coefficient.
In: Statistics and Probability