Questions
An exercise science major wants to try to use body weight to predict how much someone...

An exercise science major wants to try to use body weight to predict how much someone can bench press. He collects the data shown below on 30 male students. Both quantities are measured in pounds.

Body Weight Bench Press
147 134
134 131
141 125
129 135
152 147
176 142
196 171
200 158
132 134
176 153
204 153
194 156
211 164
145 129
180 141
201 162
124 118
145 151
172 143
145 137
137 121
153 143
191 170
177 160
150 138
202 157
144 135
151 153
202 157
216 169

a) What type of association does there appear to be between these two variables?
Solved Answer: positive association

b) Compute a 95% confidence interval for the average bench press of 150 pound males. What is the lower limit? Give your answer to two decimal places.

c) Compute a 95% confidence interval for the average bench press of 150 pound males. What is the upper limit? Give your answer to two decimal places.

d) Compute a 95% prediction interval for the bench press of a 150 pound male. What is the lower limit? Give your answer to two decimal places.

e) Compute a 95% prediction interval for the bench press of a 150 pound male. What is the upper limit? Give your answer to two decimal places.

Please provide formulas used for the confidence intervals and prediction intervals and standard error. Thank you!

In: Statistics and Probability

Although it is generally assumed that routine exercise will improve overall health, there is some question...

Although it is generally assumed that routine exercise will improve overall health, there is some question about how much exercise is necessary to produce the benefits. To address this question, a researcher obtained 20 pairs of participants, matched on age, gender, and weight. All participants exercised regularly but one participant within each pair exercised less than two hours per week, and the other spent more than five hours per week exercising. Each participant was subsequently evaluated by a physician and given an overall rating of their health. The researcher runs the statistics using SPSS and obtains the output below:

T-Test

PAIRED SAMPLE STATISTICS

mean N STD. Deviation Std. error Mean
pair two_hrs 13.40 20 2.563 .573
1 five_hrs 14.45 20 2.235 .500

Paired Samples Test

Paired Differences Paired Differences Paired Differences

PairedDifferences

95% confidence interval of the differences

...
Mean Std. Deviation Std Error mean Lower Upper t df Sig.(2-tailed)
Pair 1 two_hrs-five_hrs -1.050 2.038 .456 -2.004 -.096 -2.304 19 .033

1) using the hypothesis, what is the null hypothesis and the alternative hypothesis? and should it be rejected, why or why not?

2) what is the dependent and independent variable?

3) Is this a between-subjects or within-subjects design?

4) What is the effect size? Please explain your answer.

In: Statistics and Probability

. Suppose 2% of people have Syndrome X. We have a Syndrome X detecting test which...

. Suppose 2% of people have Syndrome X. We have a Syndrome X detecting test which gives which gives a positive result for 90% of people who do have the syndrome, but also gives a positive result for 10% of people who don’t actually have the syndrome. A patient comes in and gets a positive result. What are the chances they have Syndrome X? For full credit, you must show your work.

In: Statistics and Probability

10. Companies often have different mutual funds to serve different investment time horizons. Use the tab...

10. Companies often have different mutual funds to serve different investment time horizons. Use the tab titled BESTFUNDS to determine if there is a difference between in the three-year annualized return for small cap growth, mid-cap growth, and large cap growth mutual funds. (a) Identify which type of test you plan to use and why. (b) Analyze and report your findings. (c) Put the findings into meaningful words (i.e., explain what the test allows you to conclude about the types of mutual funds). (d) Does your result from (b) give you statistical permission to probe group differences, yes or no? (4 points)

Small Mid-Cap Large
20.19 25.30 29.03
27.81 26.98 23.17
21.47 28.75 22.52
22.00 24.88 21.87
21.10 28.01 25.98
25.73 22.78 22.08
28.41 30.91 31.46
27.00 25.40 26.80
25.41 30.30 23.83
21.59 32.51 28.72

In: Statistics and Probability

9. Mutual funds mix different types of investments which alters performance. Use the tab titled BESTFUNDS1...

9. Mutual funds mix different types of investments which alters performance. Use the tab titled BESTFUNDS1 to determine if there is a difference between the one-year and three-year annualized return for the 20 mutual funds shown in the file. (a) Identify which type of test is most appropriate for you to use, justify your answer. (b) Determine whether or not the mean return differs for the two investment horizons (use α = .05). (c) Make sure to interpret the results for real-world use (i.e., explain what the test allows you to conclude about the mutual funds). (3 points)

Mutual Fund One-Year return Three-Year Return
Mutual Fund 1 2.4 4.5
Mutual Fund 2 2.5 3.5
Mutual Fund 3 7.4 7.4
Mutual Fund 4 15.9 5.3
Mutual Fund 5 7.0 19.2
Mutual Fund 6 18.2 14.3
Mutual Fund 7 18.0 26.7
Mutual Fund 8 5.2 22.8
Mutual Fund 9 17.2 9.6
Mutual Fund 10 18.7 9.3
Mutual Fund 11 9.8 20.3
Mutual Fund 12 17.7 27.6
Mutual Fund 13 12.1 19.3
Mutual Fund 14 11.2 21.6
Mutual Fund 15 4.0 8.0
Mutual Fund 16 21.6 11.7
Mutual Fund 17 6.5 8.2
Mutual Fund 18 7.0 13.9
Mutual Fund 19 10.4 10.8
Mutual Fund 20 4.1 17.6

In: Statistics and Probability

A production line manager wants to determine how well the production line is running. He randomly...

A production line manager wants to determine how well the production line is running. He randomly selected 90 items off of the assembly line and found that 8 were defective. (Assume all conditions have been satisfied for building a confidence interval). Find the 99% confidence interval.

(0.0234, 0.1099)

(0.0116, 0.1662)

(0.0301, 0.1477)

(0.0396, 0.1382)

In: Statistics and Probability

Year # AIDS cases diagnosed # AIDS deaths Year # AIDS cases diagnosed # AIDS deaths...

Year # AIDS cases diagnosed # AIDS deaths

Year # AIDS cases diagnosed # AIDS deaths
Pre–1981 91 29
1981 319 121
1982 1,170 453
1983 3,076 1,482
1984 6,240 3,466
1985 11,776 6,878
1986 19,032 11,987
1987 28,564 16,162
1988 35,447 20,868
1989 42,674 27,591
1990 48,634 31,335
1991 59,660 36,560
1992 78,530 41,055
1993 78,834 44,730
1994 71,874 49,095
1995 68,505 49,456
1996 59,347 38,510
1997 47,149 20,736
1998 38,393 19,005
1999 25,174 18,454
2000 25,522 17,347
2001 25,643 17,402
2002 26,464 16,371
Total 802,118 489,093

Graph "year" vs. "# AIDS deaths." Do not include pre-1981. Label both axes with words. Scale both axes. Calculate the following. (Round your answers to the nearest whole number. Round the correlation coefficient r to four decimal places.)

a =

b=

r=

n=

In: Statistics and Probability

A program was created to randomly choose customers at a clothing store to receive a discount....

A program was created to randomly choose customers at a clothing store to receive a discount. The program claims 22% of the receipts will get a discount in the long run. The owner of the clothing store is skeptical and believes the program's calculations are incorrect. He selects a random sample and finds that 17% received the discount. The confidence interval is 0.17 ± 0.05 with all conditions for inference met.

Part A: Using the given confidence interval, is it statistically evident that the program is not working? Explain. (3 points)

Part B: Is it statistically evident from the confidence interval that the program creates the discount with a 0.22 probability? Explain. (2 points)

Part C: Another random sample of receipts is taken. This sample is five times the size of the original. Seventeen percent of the receipts in the second sample received the discount. What is the value of margin of error based on the second sample with the same confidence level as the original interval? (2 points)

Part D: Using the margin of error from the second sample in part C, is the program working as planned? Explain. (3 points)

In: Statistics and Probability

A thermocouple is being calibrated with a well designed ice bath. The measurements are given in...

A thermocouple is being calibrated with a well designed ice bath. The measurements are given in the table to the right. Using any mathematical tool of your choice,the mean is .174 and standard deviation of the sample is .24228. I recommend excel for this problem and a simple example is included on the module for this class.

a) What is the bias error in this sample, assuming a perfect ice bath?

b) Assume you have corrected for the bias error and write an expression for the precision error in this sample. Use the formula T=X± ( 2.101)ST (95% ) where X is the sample mean (which will be 0 C after correction for the bias error in this case) ST is the sample standard deviation, and the coefficient 2.101 is a correction to the expression due to the limited amount of data. (Google “t-test” if you are not familiar with this procedure – if we had an infinite amount of data, this value would be 1.96.)   

Test number Measurement C

1 0.220

2 0.280

3 0.190

4 0.510

5    -0.130

6 0.230

7 0.400

8 0.110

9 0.100

10    0.390

11 -0.090

12    0.490

13 0.250

14    0.550

15 0.100

16 -0.180

17 -0.250

18    -0.030


4) In the previous test you were given a formula: The coefficient of the Standard Deviation, 2.101, is based on a test called a t-test and depends on the confidence interval and the number of measurements, which were 95% and 18 in that case. Assume that there are only 14 measurements, but you have exactly the same average and standard deviation that you calculated on the previous problem. Go to section 3.6.12 of the book referenced in problem 1, read that section if necessary, and determine what the coefficient should be for this smaller data set. (In other words, what should “2.101” be replaced with in this new data set with only 14 samples.)

In: Statistics and Probability

Word Problems with a Sample Data Set: A sample of Full-time SUNY Poly students were asked...

Word Problems with a Sample Data Set:

  1. A sample of Full-time SUNY Poly students were asked how many credit hours they were taking this semester and the following information was obtained. Assume credit hours are approximately normally distributed.

16        18        12        17        20        18        15        14

*Round all final answers to 2 decimal places if necessary

  1. Construct a 90% confidence interval for the population mean
  1. Construct a 95% confidence interval for the population standard deviation
  1. SUNY Poly claims that all their students average less than 18 credit hours per semester. Test SUNY’s claim at α = 0.05, what can you conclude about the claim?

In: Statistics and Probability

A company wants to apply k-NN to find out whether new customers response to its marketing...

  1. A company wants to apply k-NN to find out whether new customers response to its marketing campaign or not. We have data of six existing customers and their response. In the table below, we list their response and their distances to two new customers (Jack and Colleen).

Customer

Response

Distance to Jack

Distance to Colleen

Janet

Yes

1.7

1.3

Adam

Yes

1.6

1.3

Tom

No

3.8

0.9

Sarah

Yes

2.4

1.2

Nancy

No

3.7

0.9

Joseph

No

2.5

1.5

What will be the predicted responses of Jack and Colleen using k-NN when k is set to the following values? Justify your answer.

  1. k = 1
  2. k = 3
  3. k = 5

In: Statistics and Probability

A box in a certain supply room contains four 30W lightbulbs, five 60W lightbulbs, and six...

A box in a certain supply room contains four 30W lightbulbs, five 60W lightbulbs, and six 70W lightbulbs. Suppose that four lightbulbs are randomly selected. 1) What is the probability that at least one lightbulb of each type is selected? 2)Find the expected number of lightbulbs (incorrect ones) that need to be selected before a 60W or 70W lightbulb is selected. Show this by building a probability table.

In: Statistics and Probability

González Industries requires its sales personnel to keep track of their weekly contacts with customers. A...

González Industries requires its sales personnel to keep track of their weekly contacts with customers. A sample of 16 reports showed a mean of 32.4 customer contacts per week for the sales personnel, and a sample standard deviation of 5.7 contacts. Assuming customer contacts is a normally distributed variable, generate a 95% confidence interval estimate of the true mean number of customer contacts per week at González Industries. Begin by stating whether this estimation problem should use the student t distribution or the normal (Z) distribution. Should the t distribution be used?

If you indicated the t distribution should be used, give the value that should be used here. If you said, "no", then indicate which Z value should be used.

What is the lower limit of the confidence interval?

What is the upper limit of the confidence interval?

In: Statistics and Probability

Assume a binomial probability distribution has p = 0.60 and n = 300. (a) What are...

Assume a binomial probability distribution has

p = 0.60

and

n = 300.

(a)

What are the mean and standard deviation? (Round your answers to two decimal places.)

mean standard deviation

(b)

Is this situation one in which binomial probabilities can be approximated by the normal probability distribution? Explain.

No, because np ≥ 5 and n(1 − p) ≥ 5. Yes, because n ≥ 30.     Yes, because np ≥ 5 and n(1 − p) ≥ 5. No, because np < 5 and n(1 − p) < 5. Yes, because np < 5 and n(1 − p) < 5.

(c)

What is the probability of 160 to 170 successes? Use the normal approximation of the binomial distribution to answer this question. (Round your answer to four decimal places.)

(d)

What is the probability of 190 or more successes? Use the normal approximation of the binomial distribution to answer this question. (Round your answer to four decimal places.)

(e)

What is the advantage of using the normal probability distribution to approximate the binomial probabilities?

The advantage would be that using the normal probability distribution to approximate the binomial probabilities increases the number of calculations. The advantage would be that using the normal probability distribution to approximate the binomial probabilities makes the calculations less accurate.     The advantage would be that using the normal probability distribution to approximate the binomial probabilities makes the calculations more accurate. The advantage would be that using the the normal probability distribution to approximate the binomial probabilities reduces the number of calculations.

How would you calculate the probability in part (d) using the binomial distribution. (Use f(x) to denote the binomial probability function.)

P(x ≥ 190) = f(191) + f(192) + f(193) + f(194) +    + f(300)

P(x ≥ 190) = f(190) + f(191) + f(192) + f(193) +    + f(300)

    

P(x ≥ 190) = f(0) + f(1) +    + f(188) + f(189)

P(x ≥ 190) = f(0) + f(1) +    + f(189) + f(190)

P(x ≥ 190) = 1 − f(189) − f(190) − f(191) − f(192) −    − f(300)

In: Statistics and Probability

One pharmaceutical company claims their flu vaccine reduces the risk of illness by 40%. One political...

One pharmaceutical company claims their flu vaccine reduces the risk of illness by 40%. One political pundit believes that the real percentage is less. He does a study and the conclusion of the statistical test is to reject the null hypnosis. According to the CDC recent studies show that flu vaccinations reduce the risk of illness by between 40% and 60%. Given this information answer the following questions.

(a) State the null hypothesis as a mathematical statement.

(b) State the alternative hypothesis as a mathematical statement. (c)Was a correct decision made? Why or why not?

(d) If an error was made state what type of error. If no error was made state ”correct decision”.

In: Statistics and Probability