Refer to the Baseball 2016 data, which reports information on the 2016 Major League Baseball season. Let attendance be the dependent variable and total team salary be the independent variable. Determine the regression equation and answer the following questions.
Draw a scatter diagram. From the diagram, does there seem to be a direct relationship between the two variables?
What is the expected attendance for a team with a salary of $100.0 million?
If the owners pay an additional $30 million, how many more people could they expect to attend?
At the .05 significance level, can we conclude that the slope of the regression line is positive? Conduct the appropriate test of hypothesis.
What percentage of the variation in attendance is accounted for by salary?
Determine the correlation between attendance and team batting average and between attendance and team ERA. Which is stronger? Conduct an appropriate test of hypothesis for each set of variables.
Show all work in Excel
| Team | League | Year Opened | Team Salary | Attendance | Wins | ERA | BA | HR | Year | Average salary | ||
| Arizona | National | 1998 | 65.80 | 2080145 | 79 | 4.04 | 0.264 | 154 | 2000 | 1988034 | ||
| Atlanta | National | 1996 | 89.60 | 2001392 | 67 | 4.41 | 0.251 | 100 | 2001 | 2264403 | ||
| Baltimore | American | 1992 | 118.90 | 2281202 | 81 | 4.05 | 0.250 | 217 | 2002 | 2383235 | ||
| Boston | American | 1912 | 168.70 | 2880694 | 78 | 4.31 | 0.265 | 161 | 2003 | 2555476 | ||
| Chicago Cubs | National | 1914 | 117.20 | 2959812 | 97 | 3.36 | 0.244 | 171 | 2004 | 2486609 | ||
| Chicago Sox | American | 1991 | 110.70 | 1755810 | 76 | 3.98 | 0.250 | 136 | 2005 | 2632655 | ||
| Cincinnati | National | 2003 | 117.70 | 2419506 | 64 | 4.33 | 0.248 | 167 | 2006 | 2866544 | ||
| Cleveland | American | 1994 | 87.70 | 1388905 | 81 | 3.67 | 0.256 | 141 | 2007 | 2944556 | ||
| Colorado | National | 1995 | 98.30 | 2506789 | 68 | 5.04 | 0.265 | 186 | 2008 | 3154845 | ||
| Detroit | American | 2000 | 172.80 | 2726048 | 74 | 4.64 | 0.270 | 151 | 2009 | 3240206 | ||
| Houston | American | 2000 | 69.10 | 2153585 | 86 | 3.57 | 0.250 | 230 | 2010 | 3297828 | ||
| Kansas City | American | 1973 | 112.90 | 2708549 | 95 | 3.73 | 0.269 | 139 | 2011 | 3305393 | ||
| LA Angels | American | 1966 | 146.40 | 3012765 | 85 | 3.94 | 0.246 | 176 | 2012 | 3440000 | ||
| LA Dodgers | National | 1962 | 230.40 | 3764815 | 92 | 3.44 | 0.250 | 187 | 2013 | 3650000 | ||
| Miami | National | 2012 | 84.60 | 1752235 | 71 | 4.02 | 0.260 | 120 | 2014 | 3950000 | ||
| Milwaukee | National | 2001 | 98.70 | 2542558 | 68 | 4.28 | 0.251 | 145 | 2015 | 4250000 | ||
| Minnesota | American | 2010 | 108.30 | 2220054 | 83 | 4.07 | 0.247 | 156 | ||||
| NY Mets | National | 2009 | 100.10 | 2569753 | 90 | 3.43 | 0.244 | 177 | ||||
| NY Yankees | American | 2009 | 213.50 | 3193795 | 87 | 4.05 | 0.251 | 212 | ||||
| Oakland | American | 1966 | 80.80 | 1768175 | 68 | 4.14 | 0.251 | 146 | ||||
| Philadelphia | National | 2004 | 133.00 | 1831080 | 63 | 4.69 | 0.249 | 130 | ||||
| Pittsburgh | National | 2001 | 85.90 | 2498596 | 98 | 3.21 | 0.260 | 140 | ||||
| San Diego | National | 2004 | 126.60 | 2459742 | 74 | 4.09 | 0.243 | 148 | ||||
| San Francisco | National | 2000 | 166.50 | 3375882 | 84 | 3.72 | 0.267 | 136 | ||||
| Seattle | American | 1999 | 123.20 | 2193581 | 76 | 4.16 | 0.249 | 198 | ||||
| St. Louis | National | 2006 | 120.30 | 3520889 | 100 | 2.94 | 0.253 | 137 | ||||
| Tampa Bay | American | 1990 | 74.80 | 1287054 | 80 | 3.74 | 0.252 | 167 | ||||
| Texas | American | 1994 | 144.80 | 2491875 | 88 | 4.24 | 0.257 | 172 | ||||
| Toronto | American | 1989 | 116.40 | 2794891 | 93 | 3.8 | 0.269 | 232 | ||||
| Washington | National | 2008 | 174.50 | 2619843 | 83 | 3.62 | 0.251 | 177 |
In: Math
Carleton Chemical claims that they can produce an average of
more than 800 tons of meladone
per week. A random sample of 36 weeks of production yielded a
sample mean of 823 tons, with
a standard deviation of 79.8 tons.
Does the sample data provide sufficient evidence to support the
claim
made by Carleton Chemical? Use a significance level of
α = .05.
In: Math
1. What is the z-score associated with the 75th percentile?
2. What z-scores bound the middle 50% of a normal distribution?
3. What z-score has 10% of the distribution above it?
4. What z-score has 20% of the distribution below it?
5. Reading comprehension scores for junior high students are
normally distributed with a mean of80.0 and a standard deviation of
5.0.
a. What percent of students have scores greater than 87.5?
b. What percent of students have scores between 75 and 85
In: Math
The New Jersey Department of Public Health offers psychological support programs for substance abuse patients with depression. It is suggested that the type of depression varies by the type of substance abuse. If so, such a relationship might help the department better target treatments. They random sample 75 medically declared substance abusers with depression. (C15PROB7.SAV) (χ2 = 5.12, p=.077; V=0.26; Lambdarow= 0.11, p > .05
| Substance Abuse | Clinical | Dysthymic | Manic |
| Alcohol | 20 | 10 | 8 |
| Drugs | 10 | 15 | 12 |
Select and justify the best test(s). The chi-square, Phi, Yates, or Lambda (or even a combination) might be best for a problem given the data and research question. Do not assume the independent is always on the row.
Provide the null and alternative hypotheses in formal and plain language for the appropriate test at the 0.05 significance level.
Do the math and reject/retain null at a=.05. State your critical value.
Explain the results in plain language.
In: Math
A researcher wishes to determine whether there is a significant relationship between the gender of psychology students and the refreshment drink they prefer. The results obtained from a survey of students are presented in the following table: Preferred Drink Gender Water Coffee Soda Total Male 46 29 40 115 Female 29 36 70 135 Total 75 65 110 250.
Perform an appropriate two-tailed hypothesis test at α = 0.05. If a significant result is obtained, determine the strength of the relationship. Show all four decision making steps. (20)
In: Math
Determine the sample size n needed to construct a 90% confidence interval to estimate the population proportion for the following sample proportions when the margin of error equals 8%.
a. p over bar equals 0.10
b. p over bar equals 0.20
c. p over bar equals 0.30
Click the icon to view a table of standard normal cumulative probabilities.
In: Math
At a university the historical mean of scholarship examination scores for freshman applications is 800. A historical population standard deviation σ = 150 is assumed known. Each year, the assistant dean uses a sample of applications to determine whether the mean examination score for the new freshman applications has changed.
(a) State the hypotheses.
(b) What is the 95% confidence interval estimate of the population mean examination score if a sample of 90 applications provided a sample mean x = 834?
(c) Use the confidence interval to conduct a hypothesis test. Using α = 0.05, what is your conclusion?
(d) What is the test statistic? What is the p-value?
In: Math
Refer to the Lincolnville School District bus data.
Conduct a test of hypothesis to reveal whether the mean maintenance cost is equal for each of the bus manufacturers. Use the .01 significance level.
Conduct a test of hypothesis to determine whether the mean miles traveled since the last maintenance is equal for each bus manufacturer. Use the .05 significance level.
Show work in Excel.
| ID | Manufacturer | Engine Type | Engine Type (0=diesel) | Capacity | Maintenance cost | Age | Odometer Miles | Miles |
| 122 | Bluebird | Gasoline | 1 | 55 | 9394 | 10 | 116580 | 11967 |
| 279 | Bluebird | Diesel | 0 | 55 | 1008 | 2 | 22672 | 11925 |
| 500 | Bluebird | Gasoline | 1 | 55 | 5329 | 5 | 50765 | 11922 |
| 520 | Bluebird | Diesel | 0 | 55 | 4794 | 10 | 119130 | 11896 |
| 714 | Bluebird | Diesel | 0 | 42 | 3742 | 7 | 73703 | 11837 |
| 875 | Bluebird | Diesel | 0 | 55 | 4376 | 9 | 97947 | 11814 |
| 600 | Bluebird | Diesel | 0 | 55 | 4832 | 10 | 119860 | 11800 |
| 953 | Bluebird | Diesel | 0 | 55 | 5160 | 10 | 117700 | 11798 |
| 101 | Bluebird | Diesel | 0 | 55 | 1955 | 4 | 41096 | 11789 |
| 358 | Bluebird | Diesel | 0 | 55 | 2775 | 6 | 70086 | 11782 |
| 29 | Bluebird | Gasoline | 1 | 55 | 5352 | 6 | 69438 | 11781 |
| 686 | Bluebird | Diesel | 0 | 55 | 1569 | 3 | 34674 | 11757 |
| 887 | Bluebird | Diesel | 0 | 55 | 3743 | 8 | 93672 | 11704 |
| 464 | Bluebird | Gasoline | 1 | 55 | 2540 | 3 | 34530 | 11698 |
| 43 | Bluebird | Gasoline | 1 | 55 | 8263 | 9 | 102969 | 11615 |
| 704 | Bluebird | Diesel | 0 | 55 | 4218 | 8 | 83424 | 11610 |
| 814 | Bluebird | Diesel | 0 | 55 | 2028 | 4 | 40824 | 11576 |
| 39 | Bluebird | Gasoline | 1 | 55 | 5821 | 6 | 69444 | 11533 |
| 699 | Bluebird | Gasoline | 1 | 55 | 9069 | 9 | 98307 | 11518 |
| 75 | Bluebird | Diesel | 0 | 55 | 3011 | 6 | 71970 | 11462 |
| 982 | Bluebird | Diesel | 0 | 55 | 505 | 1 | 10276 | 11359 |
| 321 | Bluebird | Diesel | 0 | 42 | 2732 | 6 | 70122 | 11358 |
| 884 | Bluebird | Diesel | 0 | 55 | 4364 | 9 | 92457 | 11231 |
| 57 | Bluebird | Diesel | 0 | 55 | 3190 | 7 | 79240 | 11222 |
| 731 | Bluebird | Diesel | 0 | 42 | 3213 | 6 | 68526 | 11168 |
| 135 | Bluebird | Diesel | 0 | 55 | 3560 | 7 | 76426 | 11127 |
| 692 | Bluebird | Diesel | 0 | 55 | 3770 | 8 | 93248 | 11048 |
| 200 | Bluebird | Diesel | 0 | 55 | 5168 | 10 | 103700 | 11018 |
| 540 | Bluebird | Gasoline | 1 | 55 | 3656 | 4 | 45284 | 10945 |
| 660 | Bluebird | Gasoline | 1 | 55 | 6213 | 6 | 64434 | 10911 |
| 482 | Bluebird | Gasoline | 1 | 55 | 10575 | 10 | 116534 | 10802 |
| 984 | Bluebird | Diesel | 0 | 55 | 3809 | 8 | 87664 | 10760 |
| 977 | Bluebird | Diesel | 0 | 55 | 3769 | 7 | 79422 | 10759 |
| 326 | Bluebird | Diesel | 0 | 55 | 4563 | 9 | 107343 | 10724 |
| 554 | Bluebird | Diesel | 0 | 42 | 1826 | 4 | 44604 | 10662 |
| 695 | Bluebird | Diesel | 0 | 55 | 1061 | 2 | 23152 | 10633 |
| 861 | Bluebird | Gasoline | 1 | 55 | 9669 | 10 | 106040 | 10551 |
| 883 | Bluebird | Gasoline | 1 | 55 | 1881 | 2 | 20742 | 10344 |
| 954 | Bluebird | Diesel | 0 | 42 | 5284 | 10 | 101000 | 10235 |
| 768 | Bluebird | Diesel | 0 | 42 | 3173 | 7 | 71778 | 10227 |
| 490 | Bluebird | Gasoline | 1 | 55 | 10133 | 10 | 106240 | 10210 |
| 725 | Bluebird | Diesel | 0 | 55 | 2356 | 5 | 57065 | 10209 |
| 507 | Bluebird | Diesel | 0 | 55 | 3690 | 7 | 72849 | 10095 |
| 40 | Bluebird | Gasoline | 1 | 55 | 9573 | 10 | 118470 | 10081 |
| 918 | Bluebird | Diesel | 0 | 55 | 2470 | 5 | 53620 | 10075 |
| 387 | Bluebird | Gasoline | 1 | 55 | 6863 | 8 | 89960 | 10055 |
| 418 | Bluebird | Diesel | 0 | 55 | 4513 | 9 | 104715 | 10000 |
| 10 | Keiser | Gasoline | 1 | 14 | 4646 | 5 | 54375 | 11973 |
| 751 | Keiser | Diesel | 0 | 14 | 1078 | 2 | 22444 | 11948 |
| 759 | Keiser | Diesel | 0 | 55 | 3952 | 8 | 87872 | 11883 |
| 365 | Keiser | Diesel | 0 | 55 | 3065 | 6 | 63384 | 11778 |
| 162 | Keiser | Gasoline | 1 | 55 | 3143 | 3 | 31266 | 11758 |
| 370 | Keiser | Gasoline | 1 | 55 | 7766 | 8 | 86528 | 11707 |
| 948 | Keiser | Diesel | 0 | 42 | 4342 | 9 | 97956 | 11691 |
| 678 | Keiser | Diesel | 0 | 55 | 3361 | 7 | 75229 | 11668 |
| 481 | Keiser | Gasoline | 1 | 6 | 3097 | 3 | 34362 | 11662 |
| 693 | Keiser | Gasoline | 1 | 55 | 9193 | 9 | 101889 | 11461 |
| 989 | Keiser | Diesel | 0 | 55 | 4795 | 9 | 106605 | 11418 |
| 724 | Keiser | Diesel | 0 | 42 | 3754 | 8 | 91968 | 11344 |
| 732 | Keiser | Diesel | 0 | 42 | 4640 | 9 | 101196 | 11342 |
| 880 | Keiser | Gasoline | 1 | 55 | 8410 | 9 | 97065 | 11336 |
| 61 | Keiser | Diesel | 0 | 55 | 4139 | 9 | 103536 | 11148 |
| 754 | Keiser | Diesel | 0 | 14 | 7380 | 14 | 146860 | 11003 |
| 353 | Keiser | Gasoline | 1 | 55 | 4279 | 4 | 45744 | 10902 |
| 705 | Keiser | Diesel | 0 | 42 | 2152 | 4 | 47596 | 10755 |
| 767 | Keiser | Diesel | 0 | 55 | 2985 | 6 | 71538 | 10726 |
| 120 | Keiser | Diesel | 0 | 42 | 4723 | 10 | 110320 | 10674 |
| 9 | Keiser | Gasoline | 1 | 55 | 3527 | 4 | 46848 | 10591 |
| 603 | Keiser | Diesel | 0 | 14 | 2116 | 4 | 44384 | 10518 |
| 427 | Keiser | Gasoline | 1 | 55 | 6927 | 7 | 73423 | 10355 |
| 45 | Keiser | Diesel | 0 | 55 | 3124 | 6 | 60102 | 10167 |
| 38 | Keiser | Gasoline | 1 | 14 | 5976 | 6 | 61662 | 10140 |
| 396 | Thompson | Diesel | 0 | 14 | 1072 | 2 | 21858 | 11969 |
| 193 | Thompson | Diesel | 0 | 14 | 5922 | 11 | 128711 | 11248 |
| 833 | Thompson | Diesel | 0 | 14 | 3920 | 8 | 90968 | 11112 |
| 671 | Thompson | Gasoline | 1 | 14 | 6733 | 8 | 89792 | 11100 |
| 398 | Thompson | Diesel | 0 | 6 | 4752 | 9 | 95922 | 10802 |
| 156 | Thompson | Diesel | 0 | 14 | 6212 | 12 | 140460 | 10473 |
| 168 | Thompson | Gasoline | 1 | 14 | 7004 | 7 | 83006 | 10315 |
| 314 | Thompson | Diesel | 0 | 6 | 5408 | 11 | 128117 | 10128 |
In: Math
The dataset TrafficFlow gives the delay time in seconds for 24 simulation runs in Dresden, Germany, comparing the current timed traffic light system on each run to a proposed flexible traffic light system in which lights communicate traffic flow information to neighboring lights. On average, public transportation was delayed 105 seconds under the timed system and 44 seconds under the flexible system. Since this is a matched pairs experiment, we are interested in the difference in times between the two methods for each of the 24 simulations. For the n=24 differences D, we were given that x¯D=61 seconds with sD=15.19 seconds. We wish to estimate the average time savings for public transportation on this stretch of road if the city of Dresden moves to the new system.
what parameter are we estimating? give correct notation
suppose that we write the 24 differences on 24 slips of paper. describe how to physically use the paper slips to create a bootstrap sample.
what statistic do we for this one bootstrap sample?
if we create a bootstrap distribution using many of these bootstrap statistics what shape do we expect it to be centered?
how can we use the values in the bootstrap distribution to find the standard error?
the standard error 3.1 for one set of 10,000 bootstrap samples. find and interpret a 95% confidence interval for the average time savings.
In: Math
You may need to use the appropriate appendix table or technology to answer this question.
Consider the following hypothesis test.
H0: μ = 30
Ha: μ ≠ 30
The population standard deviation is 14. Use α = 0.05. How large a sample should be taken if the researcher is willing to accept a 0.10 probability of making a type II error when the actual population mean is 34?
In: Math
A researcher selects a sample of 49 participants from a population with a mean of 12 and a standard deviation of 3.5. What is the probability of selecting a sample mean that is at least equal to the population mean? 0.50 equal to the probability of selecting a sample mean that is at most equal to the population mean all of the above none of the above
In: Math
Have you ever tried to get out of jury duty? About 25% of those called will find an excuse (work, poor health, travel out of town, etc.) to avoid jury duty.†
(a) If 11 people are called for jury duty, what is the
probability that all 11 will be available to serve on the jury?
(Round your answer to three decimal places.)
(b) If 11 people are called for jury duty, what is the probability
that 5 or more will not be available to serve on the jury?
(Round your answer to three decimal places.)
(c) Find the expected number of those available to serve on the
jury. What is the standard deviation? (Round your answers to two
decimal places.)
| μ = people |
| σ = people |
(d) How many people n must the jury commissioner contact
to be 95.9% sure of finding at least 12 people who are available to
serve? (Enter your answer as a whole number.)
people
In: Math
For patients who have been given a diabetes test, blood-glucose readings are approximately normally distributed with mean 128 mg/dl and a standard deviation 8 mg/dl. Suppose that a sample of 3 patients will be selected and the sample mean blood-glucose level will be computed. Enter answers rounded to three decimal places.
According to the empirical rule, in 95 percent of samples the SAMPLE MEAN blood-glucose level will be between the lower-bound of _________ and the upper-bound of ______
In: Math
Part 1 Binomial Distribution [Mark 20%/cancer type, 40% total mark]
Five year survival chance from any cancer depends on many factors like availability of treatment options, expertise of attending medical team and more. Five year survival rate is also an important measure and it is used by medical practitioners to report prognosis to patients and family. We will be analyzing five year survival rate of two types of cancer, very aggressive and very treatable cancer and to have comparative analysis of cancer in Norway.
(NOTE: due to limitation imposed by our available probability distribution table assume survival rate for breast cancer is 90% and for esophageal cancer is 20%)
To simplify our comparative analysis, we will assume 480 patients were admitted in January 2018. For each type of cancer:
|
Selected number of patient will survive 5 years |
Probability of breast cancer patient will survive 5 years |
Probability of esophageal cancer patient will survive 5 years |
|
0 |
||
|
1 |
||
|
2 |
||
|
3 |
||
|
4 |
||
|
5 |
||
|
6 |
||
|
7 |
||
|
8 |
||
|
9 |
||
|
10 |
||
|
11 |
||
|
12 |
||
|
13 |
||
|
14 |
||
|
15 |
Part 2 Normal distribution [Mark 30%]
Daily discharge from phosphate mine is normally distributed with a mean daily discharge of 38 mg/L and a standard deviation of 12 mg/L. What proportion of days will the daily discharge exceed 58 mg/L?
Part 3 Normal approximation of binomial Probability Distribution [Mark 30%]
Airlines and hotels often grant reservation in excess to their available capacity, to minimize loss and maximize profitability due to no shows. Suppose that the records of Air Georgian shows that on average, 10% of their prospective passengers will not show up at departure gates. If Air Georgian sells 215 tickets and their plane has capacity for 200 passengers.
In: Math
n 1998, the Nabisco Company launched a “1000 Chips Challenge” advertising campaign in which it was claimed that every 18-ounce bag of their Chips Ahoy cookies contains 1000 chips (on average). A curious statistics student purchased 8 randomly selected bags of cookies and counted the chocolate chips. The data is given below:
1200 1019 1214 1087 1214 900 1200 825
a) The student concluded that the data was not normally distributed and wanted to use a Wilcoxon Signed-Rank test to test the company’s claim. What assumption is needed in this case?
b) Assuming the assumption in part a. is met, at the 1% significance level, do the data provide sufficient evidence to conclude that the average number of chocolate chips in a bag of Chips Ahoy cookies differs from 1000? Carry out the Wilcoxon Signed-Rank Test by hand.
In: Math