An exercise science major wants to try to use body weight to predict how much someone can bench press. He collects the data shown below on 30 male students. Both quantities are measured in pounds.
Body Weight | Bench Press |
147 | 134 |
134 | 131 |
141 | 125 |
129 | 135 |
152 | 147 |
176 | 142 |
196 | 171 |
200 | 158 |
132 | 134 |
176 | 153 |
204 | 153 |
194 | 156 |
211 | 164 |
145 | 129 |
180 | 141 |
201 | 162 |
124 | 118 |
145 | 151 |
172 | 143 |
145 | 137 |
137 | 121 |
153 | 143 |
191 | 170 |
177 | 160 |
150 | 138 |
202 | 157 |
144 | 135 |
151 | 153 |
202 | 157 |
216 | 169 |
a) What type of association does there appear to be between
these two variables?
Solved Answer: positive association
b) Compute a 95% confidence interval for the average bench press of 150 pound males. What is the lower limit? Give your answer to two decimal places.
c) Compute a 95% confidence interval for the average bench press of 150 pound males. What is the upper limit? Give your answer to two decimal places.
d) Compute a 95% prediction interval for the bench press of a 150 pound male. What is the lower limit? Give your answer to two decimal places.
e) Compute a 95% prediction interval for the bench press of a 150 pound male. What is the upper limit? Give your answer to two decimal places.
Please provide formulas used for the confidence intervals and prediction intervals and standard error. Thank you!
In: Statistics and Probability
Although it is generally assumed that routine exercise will improve overall health, there is some question about how much exercise is necessary to produce the benefits. To address this question, a researcher obtained 20 pairs of participants, matched on age, gender, and weight. All participants exercised regularly but one participant within each pair exercised less than two hours per week, and the other spent more than five hours per week exercising. Each participant was subsequently evaluated by a physician and given an overall rating of their health. The researcher runs the statistics using SPSS and obtains the output below:
T-Test
PAIRED SAMPLE STATISTICS
mean | N | STD. Deviation | Std. error Mean | |
pair two_hrs | 13.40 | 20 | 2.563 | .573 |
1 five_hrs | 14.45 | 20 | 2.235 | .500 |
Paired Samples Test
Paired Differences | Paired Differences | Paired Differences |
PairedDifferences 95% confidence interval of the differences |
... | ||||
Mean | Std. Deviation | Std Error mean | Lower Upper | t | df | Sig.(2-tailed) | ||
Pair 1 two_hrs-five_hrs | -1.050 | 2.038 | .456 | -2.004 -.096 | -2.304 | 19 | .033 | |
1) using the hypothesis, what is the null hypothesis and the alternative hypothesis? and should it be rejected, why or why not?
2) what is the dependent and independent variable?
3) Is this a between-subjects or within-subjects design?
4) What is the effect size? Please explain your answer.
In: Statistics and Probability
. Suppose 2% of people have Syndrome X. We have a Syndrome X detecting test which gives which gives a positive result for 90% of people who do have the syndrome, but also gives a positive result for 10% of people who don’t actually have the syndrome. A patient comes in and gets a positive result. What are the chances they have Syndrome X? For full credit, you must show your work.
In: Statistics and Probability
10. Companies often have different mutual funds to serve different investment time horizons. Use the tab titled BESTFUNDS to determine if there is a difference between in the three-year annualized return for small cap growth, mid-cap growth, and large cap growth mutual funds. (a) Identify which type of test you plan to use and why. (b) Analyze and report your findings. (c) Put the findings into meaningful words (i.e., explain what the test allows you to conclude about the types of mutual funds). (d) Does your result from (b) give you statistical permission to probe group differences, yes or no? (4 points)
Small | Mid-Cap | Large |
20.19 | 25.30 | 29.03 |
27.81 | 26.98 | 23.17 |
21.47 | 28.75 | 22.52 |
22.00 | 24.88 | 21.87 |
21.10 | 28.01 | 25.98 |
25.73 | 22.78 | 22.08 |
28.41 | 30.91 | 31.46 |
27.00 | 25.40 | 26.80 |
25.41 | 30.30 | 23.83 |
21.59 | 32.51 | 28.72 |
In: Statistics and Probability
9. Mutual funds mix different types of investments which alters performance. Use the tab titled BESTFUNDS1 to determine if there is a difference between the one-year and three-year annualized return for the 20 mutual funds shown in the file. (a) Identify which type of test is most appropriate for you to use, justify your answer. (b) Determine whether or not the mean return differs for the two investment horizons (use α = .05). (c) Make sure to interpret the results for real-world use (i.e., explain what the test allows you to conclude about the mutual funds). (3 points)
Mutual Fund | One-Year return | Three-Year Return |
Mutual Fund 1 | 2.4 | 4.5 |
Mutual Fund 2 | 2.5 | 3.5 |
Mutual Fund 3 | 7.4 | 7.4 |
Mutual Fund 4 | 15.9 | 5.3 |
Mutual Fund 5 | 7.0 | 19.2 |
Mutual Fund 6 | 18.2 | 14.3 |
Mutual Fund 7 | 18.0 | 26.7 |
Mutual Fund 8 | 5.2 | 22.8 |
Mutual Fund 9 | 17.2 | 9.6 |
Mutual Fund 10 | 18.7 | 9.3 |
Mutual Fund 11 | 9.8 | 20.3 |
Mutual Fund 12 | 17.7 | 27.6 |
Mutual Fund 13 | 12.1 | 19.3 |
Mutual Fund 14 | 11.2 | 21.6 |
Mutual Fund 15 | 4.0 | 8.0 |
Mutual Fund 16 | 21.6 | 11.7 |
Mutual Fund 17 | 6.5 | 8.2 |
Mutual Fund 18 | 7.0 | 13.9 |
Mutual Fund 19 | 10.4 | 10.8 |
Mutual Fund 20 | 4.1 | 17.6 |
In: Statistics and Probability
A production line manager wants to determine how well the production line is running. He randomly selected 90 items off of the assembly line and found that 8 were defective. (Assume all conditions have been satisfied for building a confidence interval). Find the 99% confidence interval.
(0.0234, 0.1099) |
|
(0.0116, 0.1662) |
|
(0.0301, 0.1477) |
|
(0.0396, 0.1382) |
In: Statistics and Probability
Year # AIDS cases diagnosed # AIDS deaths
Year | # AIDS cases diagnosed | # AIDS deaths |
---|---|---|
Pre–1981 | 91 | 29 |
1981 | 319 | 121 |
1982 | 1,170 | 453 |
1983 | 3,076 | 1,482 |
1984 | 6,240 | 3,466 |
1985 | 11,776 | 6,878 |
1986 | 19,032 | 11,987 |
1987 | 28,564 | 16,162 |
1988 | 35,447 | 20,868 |
1989 | 42,674 | 27,591 |
1990 | 48,634 | 31,335 |
1991 | 59,660 | 36,560 |
1992 | 78,530 | 41,055 |
1993 | 78,834 | 44,730 |
1994 | 71,874 | 49,095 |
1995 | 68,505 | 49,456 |
1996 | 59,347 | 38,510 |
1997 | 47,149 | 20,736 |
1998 | 38,393 | 19,005 |
1999 | 25,174 | 18,454 |
2000 | 25,522 | 17,347 |
2001 | 25,643 | 17,402 |
2002 | 26,464 | 16,371 |
Total | 802,118 | 489,093 |
Graph "year" vs. "# AIDS deaths." Do not include pre-1981. Label both axes with words. Scale both axes. Calculate the following. (Round your answers to the nearest whole number. Round the correlation coefficient r to four decimal places.)
a =
b=
r=
n=
In: Statistics and Probability
A program was created to randomly choose customers at a clothing
store to receive a discount. The program claims 22% of the receipts
will get a discount in the long run. The owner of the clothing
store is skeptical and believes the program's calculations are
incorrect. He selects a random sample and finds that 17% received
the discount. The confidence interval is 0.17 ± 0.05 with all
conditions for inference met.
Part A: Using the given confidence interval, is it
statistically evident that the program is not working? Explain. (3
points)
Part B: Is it statistically evident from the
confidence interval that the program creates the discount with a
0.22 probability? Explain. (2 points)
Part C: Another random sample of receipts is
taken. This sample is five times the size of the original.
Seventeen percent of the receipts in the second sample received the
discount. What is the value of margin of error based on the second
sample with the same confidence level as the original interval? (2
points)
Part D: Using the margin of error from the second
sample in part C, is the program working as planned? Explain. (3
points)
In: Statistics and Probability
A thermocouple is being calibrated with a well designed ice bath. The measurements are given in the table to the right. Using any mathematical tool of your choice,the mean is .174 and standard deviation of the sample is .24228. I recommend excel for this problem and a simple example is included on the module for this class.
a) What is the bias error in this sample, assuming a perfect ice bath?
b) Assume you have corrected for the bias error and write an
expression for the precision error in this sample. Use the formula
T=X± ( 2.101)ST (95% ) where X is the sample mean (which will be 0
C after correction for the bias error in this case) ST is the
sample standard deviation, and the coefficient 2.101 is a
correction to the expression due to the limited amount of data.
(Google “t-test” if you are not familiar with this procedure – if
we had an infinite amount of data, this value would be 1.96.)
Test number Measurement C
1 0.220
2 0.280
3 0.190
4 0.510
5 -0.130
6 0.230
7 0.400
8 0.110
9 0.100
10 0.390
11 -0.090
12 0.490
13 0.250
14 0.550
15 0.100
16 -0.180
17 -0.250
18 -0.030
4) In the previous test you were given a formula: The coefficient
of the Standard Deviation, 2.101, is based on a test called a
t-test and depends on the confidence interval and the number of
measurements, which were 95% and 18 in that case. Assume that there
are only 14 measurements, but you have exactly the same average and
standard deviation that you calculated on the previous problem. Go
to section 3.6.12 of the book referenced in problem 1, read that
section if necessary, and determine what the coefficient should be
for this smaller data set. (In other words, what should “2.101” be
replaced with in this new data set with only 14 samples.)
In: Statistics and Probability
Word Problems with a Sample Data Set:
16 18 12 17 20 18 15 14
*Round all final answers to 2 decimal places if necessary
In: Statistics and Probability
Customer |
Response |
Distance to Jack |
Distance to Colleen |
Janet |
Yes |
1.7 |
1.3 |
Adam |
Yes |
1.6 |
1.3 |
Tom |
No |
3.8 |
0.9 |
Sarah |
Yes |
2.4 |
1.2 |
Nancy |
No |
3.7 |
0.9 |
Joseph |
No |
2.5 |
1.5 |
What will be the predicted responses of Jack and Colleen using k-NN when k is set to the following values? Justify your answer.
In: Statistics and Probability
A box in a certain supply room contains four 30W lightbulbs, five 60W lightbulbs, and six 70W lightbulbs. Suppose that four lightbulbs are randomly selected. 1) What is the probability that at least one lightbulb of each type is selected? 2)Find the expected number of lightbulbs (incorrect ones) that need to be selected before a 60W or 70W lightbulb is selected. Show this by building a probability table.
In: Statistics and Probability
González Industries requires its sales personnel to keep track of their weekly contacts with customers. A sample of 16 reports showed a mean of 32.4 customer contacts per week for the sales personnel, and a sample standard deviation of 5.7 contacts. Assuming customer contacts is a normally distributed variable, generate a 95% confidence interval estimate of the true mean number of customer contacts per week at González Industries. Begin by stating whether this estimation problem should use the student t distribution or the normal (Z) distribution. Should the t distribution be used?
If you indicated the t distribution should be used, give the value that should be used here. If you said, "no", then indicate which Z value should be used.
What is the lower limit of the confidence interval?
What is the upper limit of the confidence interval?
In: Statistics and Probability
Assume a binomial probability distribution has
p = 0.60
and
n = 300.
(a)
What are the mean and standard deviation? (Round your answers to two decimal places.)
mean standard deviation
(b)
Is this situation one in which binomial probabilities can be approximated by the normal probability distribution? Explain.
No, because np ≥ 5 and n(1 − p) ≥ 5. Yes, because n ≥ 30. Yes, because np ≥ 5 and n(1 − p) ≥ 5. No, because np < 5 and n(1 − p) < 5. Yes, because np < 5 and n(1 − p) < 5.
(c)
What is the probability of 160 to 170 successes? Use the normal approximation of the binomial distribution to answer this question. (Round your answer to four decimal places.)
(d)
What is the probability of 190 or more successes? Use the normal approximation of the binomial distribution to answer this question. (Round your answer to four decimal places.)
(e)
What is the advantage of using the normal probability distribution to approximate the binomial probabilities?
The advantage would be that using the normal probability distribution to approximate the binomial probabilities increases the number of calculations. The advantage would be that using the normal probability distribution to approximate the binomial probabilities makes the calculations less accurate. The advantage would be that using the normal probability distribution to approximate the binomial probabilities makes the calculations more accurate. The advantage would be that using the the normal probability distribution to approximate the binomial probabilities reduces the number of calculations.
How would you calculate the probability in part (d) using the binomial distribution. (Use f(x) to denote the binomial probability function.)
P(x ≥ 190) = f(191) + f(192) + f(193) + f(194) + + f(300)
P(x ≥ 190) = f(190) + f(191) + f(192) + f(193) + + f(300)
P(x ≥ 190) = f(0) + f(1) + + f(188) + f(189)
P(x ≥ 190) = f(0) + f(1) + + f(189) + f(190)
P(x ≥ 190) = 1 − f(189) − f(190) − f(191) − f(192) − − f(300)
In: Statistics and Probability
One pharmaceutical company claims their flu vaccine reduces the risk of illness by 40%. One political pundit believes that the real percentage is less. He does a study and the conclusion of the statistical test is to reject the null hypnosis. According to the CDC recent studies show that flu vaccinations reduce the risk of illness by between 40% and 60%. Given this information answer the following questions.
(a) State the null hypothesis as a mathematical statement.
(b) State the alternative hypothesis as a mathematical statement. (c)Was a correct decision made? Why or why not?
(d) If an error was made state what type of error. If no error was made state ”correct decision”.
In: Statistics and Probability