The New Jersey Department of Public Health offers psychological support programs for substance abuse patients with depression. It is suggested that the type of depression varies by the type of substance abuse. If so, such a relationship might help the department better target treatments. They random sample 75 medically declared substance abusers with depression. (C15PROB7.SAV) (χ2 = 5.12, p=.077; V=0.26; Lambdarow= 0.11, p > .05
Substance Abuse | Clinical | Dysthymic | Manic |
Alcohol | 20 | 10 | 8 |
Drugs | 10 | 15 | 12 |
Select and justify the best test(s). The chi-square, Phi, Yates, or Lambda (or even a combination) might be best for a problem given the data and research question. Do not assume the independent is always on the row.
Provide the null and alternative hypotheses in formal and plain language for the appropriate test at the 0.05 significance level.
Do the math and reject/retain null at a=.05. State your critical value.
Explain the results in plain language.
In: Math
A researcher wishes to determine whether there is a significant relationship between the gender of psychology students and the refreshment drink they prefer. The results obtained from a survey of students are presented in the following table: Preferred Drink Gender Water Coffee Soda Total Male 46 29 40 115 Female 29 36 70 135 Total 75 65 110 250.
Perform an appropriate two-tailed hypothesis test at α = 0.05. If a significant result is obtained, determine the strength of the relationship. Show all four decision making steps. (20)
In: Math
Determine the sample size n needed to construct a 90% confidence interval to estimate the population proportion for the following sample proportions when the margin of error equals 8%.
a. p over bar equals 0.10
b. p over bar equals 0.20
c. p over bar equals 0.30
Click the icon to view a table of standard normal cumulative probabilities.
In: Math
At a university the historical mean of scholarship examination scores for freshman applications is 800. A historical population standard deviation σ = 150 is assumed known. Each year, the assistant dean uses a sample of applications to determine whether the mean examination score for the new freshman applications has changed.
(a) State the hypotheses.
(b) What is the 95% confidence interval estimate of the population mean examination score if a sample of 90 applications provided a sample mean x = 834?
(c) Use the confidence interval to conduct a hypothesis test. Using α = 0.05, what is your conclusion?
(d) What is the test statistic? What is the p-value?
In: Math
Refer to the Lincolnville School District bus data.
Conduct a test of hypothesis to reveal whether the mean maintenance cost is equal for each of the bus manufacturers. Use the .01 significance level.
Conduct a test of hypothesis to determine whether the mean miles traveled since the last maintenance is equal for each bus manufacturer. Use the .05 significance level.
Show work in Excel.
ID | Manufacturer | Engine Type | Engine Type (0=diesel) | Capacity | Maintenance cost | Age | Odometer Miles | Miles |
122 | Bluebird | Gasoline | 1 | 55 | 9394 | 10 | 116580 | 11967 |
279 | Bluebird | Diesel | 0 | 55 | 1008 | 2 | 22672 | 11925 |
500 | Bluebird | Gasoline | 1 | 55 | 5329 | 5 | 50765 | 11922 |
520 | Bluebird | Diesel | 0 | 55 | 4794 | 10 | 119130 | 11896 |
714 | Bluebird | Diesel | 0 | 42 | 3742 | 7 | 73703 | 11837 |
875 | Bluebird | Diesel | 0 | 55 | 4376 | 9 | 97947 | 11814 |
600 | Bluebird | Diesel | 0 | 55 | 4832 | 10 | 119860 | 11800 |
953 | Bluebird | Diesel | 0 | 55 | 5160 | 10 | 117700 | 11798 |
101 | Bluebird | Diesel | 0 | 55 | 1955 | 4 | 41096 | 11789 |
358 | Bluebird | Diesel | 0 | 55 | 2775 | 6 | 70086 | 11782 |
29 | Bluebird | Gasoline | 1 | 55 | 5352 | 6 | 69438 | 11781 |
686 | Bluebird | Diesel | 0 | 55 | 1569 | 3 | 34674 | 11757 |
887 | Bluebird | Diesel | 0 | 55 | 3743 | 8 | 93672 | 11704 |
464 | Bluebird | Gasoline | 1 | 55 | 2540 | 3 | 34530 | 11698 |
43 | Bluebird | Gasoline | 1 | 55 | 8263 | 9 | 102969 | 11615 |
704 | Bluebird | Diesel | 0 | 55 | 4218 | 8 | 83424 | 11610 |
814 | Bluebird | Diesel | 0 | 55 | 2028 | 4 | 40824 | 11576 |
39 | Bluebird | Gasoline | 1 | 55 | 5821 | 6 | 69444 | 11533 |
699 | Bluebird | Gasoline | 1 | 55 | 9069 | 9 | 98307 | 11518 |
75 | Bluebird | Diesel | 0 | 55 | 3011 | 6 | 71970 | 11462 |
982 | Bluebird | Diesel | 0 | 55 | 505 | 1 | 10276 | 11359 |
321 | Bluebird | Diesel | 0 | 42 | 2732 | 6 | 70122 | 11358 |
884 | Bluebird | Diesel | 0 | 55 | 4364 | 9 | 92457 | 11231 |
57 | Bluebird | Diesel | 0 | 55 | 3190 | 7 | 79240 | 11222 |
731 | Bluebird | Diesel | 0 | 42 | 3213 | 6 | 68526 | 11168 |
135 | Bluebird | Diesel | 0 | 55 | 3560 | 7 | 76426 | 11127 |
692 | Bluebird | Diesel | 0 | 55 | 3770 | 8 | 93248 | 11048 |
200 | Bluebird | Diesel | 0 | 55 | 5168 | 10 | 103700 | 11018 |
540 | Bluebird | Gasoline | 1 | 55 | 3656 | 4 | 45284 | 10945 |
660 | Bluebird | Gasoline | 1 | 55 | 6213 | 6 | 64434 | 10911 |
482 | Bluebird | Gasoline | 1 | 55 | 10575 | 10 | 116534 | 10802 |
984 | Bluebird | Diesel | 0 | 55 | 3809 | 8 | 87664 | 10760 |
977 | Bluebird | Diesel | 0 | 55 | 3769 | 7 | 79422 | 10759 |
326 | Bluebird | Diesel | 0 | 55 | 4563 | 9 | 107343 | 10724 |
554 | Bluebird | Diesel | 0 | 42 | 1826 | 4 | 44604 | 10662 |
695 | Bluebird | Diesel | 0 | 55 | 1061 | 2 | 23152 | 10633 |
861 | Bluebird | Gasoline | 1 | 55 | 9669 | 10 | 106040 | 10551 |
883 | Bluebird | Gasoline | 1 | 55 | 1881 | 2 | 20742 | 10344 |
954 | Bluebird | Diesel | 0 | 42 | 5284 | 10 | 101000 | 10235 |
768 | Bluebird | Diesel | 0 | 42 | 3173 | 7 | 71778 | 10227 |
490 | Bluebird | Gasoline | 1 | 55 | 10133 | 10 | 106240 | 10210 |
725 | Bluebird | Diesel | 0 | 55 | 2356 | 5 | 57065 | 10209 |
507 | Bluebird | Diesel | 0 | 55 | 3690 | 7 | 72849 | 10095 |
40 | Bluebird | Gasoline | 1 | 55 | 9573 | 10 | 118470 | 10081 |
918 | Bluebird | Diesel | 0 | 55 | 2470 | 5 | 53620 | 10075 |
387 | Bluebird | Gasoline | 1 | 55 | 6863 | 8 | 89960 | 10055 |
418 | Bluebird | Diesel | 0 | 55 | 4513 | 9 | 104715 | 10000 |
10 | Keiser | Gasoline | 1 | 14 | 4646 | 5 | 54375 | 11973 |
751 | Keiser | Diesel | 0 | 14 | 1078 | 2 | 22444 | 11948 |
759 | Keiser | Diesel | 0 | 55 | 3952 | 8 | 87872 | 11883 |
365 | Keiser | Diesel | 0 | 55 | 3065 | 6 | 63384 | 11778 |
162 | Keiser | Gasoline | 1 | 55 | 3143 | 3 | 31266 | 11758 |
370 | Keiser | Gasoline | 1 | 55 | 7766 | 8 | 86528 | 11707 |
948 | Keiser | Diesel | 0 | 42 | 4342 | 9 | 97956 | 11691 |
678 | Keiser | Diesel | 0 | 55 | 3361 | 7 | 75229 | 11668 |
481 | Keiser | Gasoline | 1 | 6 | 3097 | 3 | 34362 | 11662 |
693 | Keiser | Gasoline | 1 | 55 | 9193 | 9 | 101889 | 11461 |
989 | Keiser | Diesel | 0 | 55 | 4795 | 9 | 106605 | 11418 |
724 | Keiser | Diesel | 0 | 42 | 3754 | 8 | 91968 | 11344 |
732 | Keiser | Diesel | 0 | 42 | 4640 | 9 | 101196 | 11342 |
880 | Keiser | Gasoline | 1 | 55 | 8410 | 9 | 97065 | 11336 |
61 | Keiser | Diesel | 0 | 55 | 4139 | 9 | 103536 | 11148 |
754 | Keiser | Diesel | 0 | 14 | 7380 | 14 | 146860 | 11003 |
353 | Keiser | Gasoline | 1 | 55 | 4279 | 4 | 45744 | 10902 |
705 | Keiser | Diesel | 0 | 42 | 2152 | 4 | 47596 | 10755 |
767 | Keiser | Diesel | 0 | 55 | 2985 | 6 | 71538 | 10726 |
120 | Keiser | Diesel | 0 | 42 | 4723 | 10 | 110320 | 10674 |
9 | Keiser | Gasoline | 1 | 55 | 3527 | 4 | 46848 | 10591 |
603 | Keiser | Diesel | 0 | 14 | 2116 | 4 | 44384 | 10518 |
427 | Keiser | Gasoline | 1 | 55 | 6927 | 7 | 73423 | 10355 |
45 | Keiser | Diesel | 0 | 55 | 3124 | 6 | 60102 | 10167 |
38 | Keiser | Gasoline | 1 | 14 | 5976 | 6 | 61662 | 10140 |
396 | Thompson | Diesel | 0 | 14 | 1072 | 2 | 21858 | 11969 |
193 | Thompson | Diesel | 0 | 14 | 5922 | 11 | 128711 | 11248 |
833 | Thompson | Diesel | 0 | 14 | 3920 | 8 | 90968 | 11112 |
671 | Thompson | Gasoline | 1 | 14 | 6733 | 8 | 89792 | 11100 |
398 | Thompson | Diesel | 0 | 6 | 4752 | 9 | 95922 | 10802 |
156 | Thompson | Diesel | 0 | 14 | 6212 | 12 | 140460 | 10473 |
168 | Thompson | Gasoline | 1 | 14 | 7004 | 7 | 83006 | 10315 |
314 | Thompson | Diesel | 0 | 6 | 5408 | 11 | 128117 | 10128 |
In: Math
The dataset TrafficFlow gives the delay time in seconds for 24 simulation runs in Dresden, Germany, comparing the current timed traffic light system on each run to a proposed flexible traffic light system in which lights communicate traffic flow information to neighboring lights. On average, public transportation was delayed 105 seconds under the timed system and 44 seconds under the flexible system. Since this is a matched pairs experiment, we are interested in the difference in times between the two methods for each of the 24 simulations. For the n=24 differences D, we were given that x¯D=61 seconds with sD=15.19 seconds. We wish to estimate the average time savings for public transportation on this stretch of road if the city of Dresden moves to the new system.
what parameter are we estimating? give correct notation
suppose that we write the 24 differences on 24 slips of paper. describe how to physically use the paper slips to create a bootstrap sample.
what statistic do we for this one bootstrap sample?
if we create a bootstrap distribution using many of these bootstrap statistics what shape do we expect it to be centered?
how can we use the values in the bootstrap distribution to find the standard error?
the standard error 3.1 for one set of 10,000 bootstrap samples. find and interpret a 95% confidence interval for the average time savings.
In: Math
You may need to use the appropriate appendix table or technology to answer this question.
Consider the following hypothesis test.
H0: μ = 30
Ha: μ ≠ 30
The population standard deviation is 14. Use α = 0.05. How large a sample should be taken if the researcher is willing to accept a 0.10 probability of making a type II error when the actual population mean is 34?
In: Math
A researcher selects a sample of 49 participants from a population with a mean of 12 and a standard deviation of 3.5. What is the probability of selecting a sample mean that is at least equal to the population mean? 0.50 equal to the probability of selecting a sample mean that is at most equal to the population mean all of the above none of the above
In: Math
Have you ever tried to get out of jury duty? About 25% of those called will find an excuse (work, poor health, travel out of town, etc.) to avoid jury duty.†
(a) If 11 people are called for jury duty, what is the
probability that all 11 will be available to serve on the jury?
(Round your answer to three decimal places.)
(b) If 11 people are called for jury duty, what is the probability
that 5 or more will not be available to serve on the jury?
(Round your answer to three decimal places.)
(c) Find the expected number of those available to serve on the
jury. What is the standard deviation? (Round your answers to two
decimal places.)
μ = people |
σ = people |
(d) How many people n must the jury commissioner contact
to be 95.9% sure of finding at least 12 people who are available to
serve? (Enter your answer as a whole number.)
people
In: Math
For patients who have been given a diabetes test, blood-glucose readings are approximately normally distributed with mean 128 mg/dl and a standard deviation 8 mg/dl. Suppose that a sample of 3 patients will be selected and the sample mean blood-glucose level will be computed. Enter answers rounded to three decimal places.
According to the empirical rule, in 95 percent of samples the SAMPLE MEAN blood-glucose level will be between the lower-bound of _________ and the upper-bound of ______
In: Math
Part 1 Binomial Distribution [Mark 20%/cancer type, 40% total mark]
Five year survival chance from any cancer depends on many factors like availability of treatment options, expertise of attending medical team and more. Five year survival rate is also an important measure and it is used by medical practitioners to report prognosis to patients and family. We will be analyzing five year survival rate of two types of cancer, very aggressive and very treatable cancer and to have comparative analysis of cancer in Norway.
(NOTE: due to limitation imposed by our available probability distribution table assume survival rate for breast cancer is 90% and for esophageal cancer is 20%)
To simplify our comparative analysis, we will assume 480 patients were admitted in January 2018. For each type of cancer:
Selected number of patient will survive 5 years |
Probability of breast cancer patient will survive 5 years |
Probability of esophageal cancer patient will survive 5 years |
0 |
||
1 |
||
2 |
||
3 |
||
4 |
||
5 |
||
6 |
||
7 |
||
8 |
||
9 |
||
10 |
||
11 |
||
12 |
||
13 |
||
14 |
||
15 |
Part 2 Normal distribution [Mark 30%]
Daily discharge from phosphate mine is normally distributed with a mean daily discharge of 38 mg/L and a standard deviation of 12 mg/L. What proportion of days will the daily discharge exceed 58 mg/L?
Part 3 Normal approximation of binomial Probability Distribution [Mark 30%]
Airlines and hotels often grant reservation in excess to their available capacity, to minimize loss and maximize profitability due to no shows. Suppose that the records of Air Georgian shows that on average, 10% of their prospective passengers will not show up at departure gates. If Air Georgian sells 215 tickets and their plane has capacity for 200 passengers.
In: Math
n 1998, the Nabisco Company launched a “1000 Chips Challenge” advertising campaign in which it was claimed that every 18-ounce bag of their Chips Ahoy cookies contains 1000 chips (on average). A curious statistics student purchased 8 randomly selected bags of cookies and counted the chocolate chips. The data is given below:
1200 1019 1214 1087 1214 900 1200 825
a) The student concluded that the data was not normally distributed and wanted to use a Wilcoxon Signed-Rank test to test the company’s claim. What assumption is needed in this case?
b) Assuming the assumption in part a. is met, at the 1% significance level, do the data provide sufficient evidence to conclude that the average number of chocolate chips in a bag of Chips Ahoy cookies differs from 1000? Carry out the Wilcoxon Signed-Rank Test by hand.
In: Math
( PLEASE SHOW ALL YOUR WORK)
You will need your ticker code (company abbreviation) for stock prices for this question. Use your ticker code to obtain the closing prices for the following two time periods to obtain two data sets:
March 2, 2019 to March 16, 2019
Data set A
February 16, 2019 to February 28, 2019
Data set B
Take the closing prices from data set B and add 0.5 to each one of them. Treat data sets A and B as hypothetical sample level data on the weights of newborns whose parents smoke cigarettes (data set A), and those whose parents do not (data set B).
a) Conduct a hypothesis test to compare the variances between the two data sets.
b) Conduct a hypothesis to compare the means between the two data sets. Selecting the assumption of equal variance or unequal variance for the calculations should be based on the results of the previous test.
c) Calculate a 95% confidence interval for the difference between means
A | B | ||
84.09 | 83.74 | ||
83.22 | 84.45 | ||
82.35 | 84.37 | ||
83.39 | 83.77 | ||
82.65 | 84.66 | ||
82.09 | 85.5 | ||
82.49 | 85.35 | ||
82.19 | 86.29 | ||
82.32 | |||
82.4 | |||
83.06 | |||
Mean | 82.75 | 84.76625 | |
SD | 0.617770184 | 0.887612166 |
In: Math
8) What proportion of a normal distribution is located between each of the following z-score boundaries?
a. z = –0.25 and z = +0.25
b. z = –0.67 and z = +0.67
c. z = –1.20 and z = +1.20
13) A normal distribution has a mean of μ = 30 and a standard deviation of σ = 12. For each of the following scores, indicate whether the body is to the right or left of the score and find the proportion of the distribution located in the body.
a. X = 33
b. X = 18
c. X = 24
d. X = 39
19) A report in 2010 indicates that Americans between the ages of 8 and 18 spend an average of μ = 7.5 hours per day using some sort of electronic device such as smart phones, computers, or tablets. Assume that the distribution of times is normal with a standard deviation of σ = 2.5 hours and find the following values.
a. What is the probability of selecting an individual who uses electronic devices more than 12 hours a day?
b. What proportion of 8- to 18-year-old Americans spend between 5 and 10 hours per day using electronic devices? In symbols, p (5 < X < 10) = ?
In: Math
|
Please show Excel work:
A. Use α = .05. Test to determine whether the proportions of female and male voters who intend to vote for the Democrat candidate differ? Report the test statistic and the p-value.
B. Provide a 99% confidence interval for the difference in the proportion of female and male voters who intend to vote for the Democrat candidate
In: Math