A researcher would like to predict the dependent variable Y from the two independent variables X1 and X2 for a sample of N=12 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test statistics to assess the significance of the regression model and partial slopes. Use a significance level α=0.05.
X1 X2 Y
34.4 | 26.4 | 59.4 |
53.7 | 38.3 | 90.4 |
72.8 | 43.2 | 71.3 |
25.4 | 21.2 | 64.5 |
75.9 | 46.5 | 71.1 |
60.4 | 27.9 | 72.6 |
28 | 56.4 | 29.9 |
40.1 | 43.6 | 53.7 |
27.2 | 64.5 | 61.5 |
48.2 | 60 | 43.8 |
78.6 | 53.6 | 53.5 |
69 | 43.9 | 85.5 |
R2=
F=
P-value for overall model =
t1=
for b1, P-value =
t2=
for b2, P-value =
What is your conclusion for the overall regression model (also
called the omnibus test)?
Which of the regression coefficients are statistically different
from zero?
In: Statistics and Probability
2. Given the following descriptive statistics,
N=18
X bar = 11.5
S sub x = 2
a. find the 95% confidence interval of the population mean u.
b. Suppose you know the population standard deviation = 2. What is the minimum sample size that makes the confidence interval length less than 1?
In: Statistics and Probability
Q4) A desk lamp produced by The Luminar Company was found to be defective (D). There are three factories (A, B, C) where such desk lamps are manufactured. A Quality Control Manager is responsible for investigating the source of found defects. They know that factory A supplies 50% of the total lamps, factory B supplies 30% of the total lamps, and factory C supplies 20% of the total lamps. They also know that on average, 2.0% or factory A’s lamps are defective, 1.0% of factory B’s lamps are defective, and 3.0% of factory C’s lamps are defective. If a randomly selected lamp is defective, what is the probability that it was manufactured in factory A? If a randomly selected lamp is defective, what is the probability that it was manufactured in factory B? If a randomly selected lamp is defective, what is the probability that it was manufactured in factory C? If a randomly selected lamp is not defective, what is the probability that it was manufactured in factory A?
In: Statistics and Probability
Annual high temperatures in a certain location have been tracked for several years. Let X represent the year and Y the high temperature. At the 0.01 significance level, does the data below show significant (linear) correlation between X and Y?
x | y |
---|---|
5 | 36.63 |
6 | 35.9 |
7 | 35.77 |
8 | 38.64 |
9 | 30.51 |
10 | 29.68 |
11 | 30.35 |
12 | 21.42 |
13 | 19.99 |
14 | 18.26 |
15 | 12.03 |
16 | 15.8 |
In: Statistics and Probability
A researcher would like to predict the dependent variable Y from the two independent variables X1 and X2 for a sample of N=16 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test the significance of the overall regression model. Use a significance level α=0.05.
X1 X2 Y
48 | 42.3 | 47.1 |
36.3 | 58.7 | 65.4 |
43.4 | 40.2 | 63.6 |
49.5 | 37.9 | 45.6 |
45.5 | 37.2 | 50.8 |
40.6 | 64.7 | 42.4 |
42.5 | 46.7 | 63.1 |
42.7 | 40 | 35.8 |
55.8 | 10.6 | 52.1 |
40.9 | 63 | 60.3 |
39.6 | 56.5 | 44 |
43.5 | 45.1 | 61.2 |
39 | 68.8 | 67.2 |
50.4 | 43.7 | 40.6 |
46.1 | 42.6 | 58 |
55.2 | 19.1 | 49.1 |
SSreg=
SSres=
R2=
F=
P-value =
What is your decision for the hypothesis test?
What is your final conclusion?
In: Statistics and Probability
Dr. Stambaugh hypothesizes that a novel anxiety drug will affect
memory. To examine this, the drug is administered to a random
sample of 23 trained rats. The rats were then placed in a maze and
timed on how long it took them to complete it. The average
completion time was 47 seconds. Normal rats have an average
completion time of 44 seconds with a standard deviation of 14
seconds. What can Dr. Stambaugh conclude with α = 0.01?
a) State the appropiate appropriate test statistic
(na, z-test, one sample t test, independent- samples t test,
related sample t test), population (normal rats, memory, trained
rats, time to complete maze, the maze), and sample (normal rats,
memory, trained rats, time to complete maze, the maze).
.
b) Compute the appropriate test statistic(s) to
make a decision about H0.
(Hint: Make sure to write down the null and alternative hypotheses
to help solve the problem.)
critical value = ; test statistic =
Decision: ---Select--- Reject H0 Fail to reject H0
c) If appropriate, compute the CI. If not
appropriate, input "na" for both spaces below.
[ , ]
d) Compute the corresponding effect size(s) and
indicate magnitude(s).
If not appropriate, input and select "na" below.
d = ; ---Select--- na, trivial effect ,small
effect ,medium effect , or large effect
r2 = ; ---Select--- na trivial
effect small effect medium effect large effect
Now, Make an interpretation based on the
results.
Rats on the anxiety drug were significantly slower completing the maze.
Rats on the anxiety drug were significantly quicker completing the maze. T
he anxiety drug did not affect rats in completing the maze.
In: Statistics and Probability
9.3.3
A study was conducted that measured the total brain volume (TBV) (in mm3) of patients that had schizophrenia and patients that are considered normal. Table #9.3.5 contains the TBV of the normal patients and table #9.3.6 contains the TBV of schizophrenia patients ("SOCR data oct2009," 2013). Is there enough evidence to show that the patients with schizophrenia have less TBV on average than a patient that is considered normal? Test at the 10% level.
Table #9.3.5: Total Brain Volume (in mm3) of Normal Patients
1663407 |
1583940 |
1299470 |
1535137 |
1431890 |
1578698 |
1453510 |
1650348 |
1288971 |
1366346 |
1326402 |
1503005 |
1474790 |
1317156 |
1441045 |
1463498 |
1650207 |
1523045 |
1441636 |
1432033 |
1420416 |
1480171 |
1360810 |
1410213 |
1574808 |
1502702 |
1203344 |
1319737 |
1688990 |
1292641 |
1512571 |
1635918 |
Table #9.3.6: Total Brain Volume (in mm3) of Schizophrenia Patients
1331777 |
1487886 |
1066075 |
1297327 |
1499983 |
1861991 |
1368378 |
1476891 |
1443775 |
1337827 |
1658258 |
1588132 |
1690182 |
1569413 |
1177002 |
1387893 |
1483763 |
1688950 |
1563593 |
1317885 |
1420249 |
1363859 |
1238979 |
1286638 |
1325525 |
1588573 |
1476254 |
1648209 |
1354054 |
1354649 |
1636119 |
9.3.4
A study was conducted that measured the total brain volume (TBV) (in mm3) of patients that had schizophrenia and patients that are considered normal. Table #9.3.5 contains the TBV of the normal patients and table #9.3.6 contains the TBV of schizophrenia patients ("SOCR data oct2009," 2013). Compute a 90% confidence interval for the difference in TBV of normal patients and patients with Schizophrenia.
9.3.8
The number of cell phones per 100 residents in countries in Europe is given in table #9.3.9 for the year 2010. The number of cell phones per 100 residents in countries of the Americas is given in table #9.3.10 also for the year 2010 ("Population reference bureau," 2013). Find the 98% confidence interval for the different in mean number of cell phones per 100 residents in Europe and the Americas.
Table #9.3.9: Number of Cell Phones per 100 Residents in Europe
100 |
76 |
100 |
130 |
75 |
84 |
112 |
84 |
138 |
133 |
118 |
134 |
126 |
188 |
129 |
93 |
64 |
128 |
124 |
122 |
109 |
121 |
127 |
152 |
96 |
63 |
99 |
95 |
151 |
147 |
123 |
95 |
67 |
67 |
118 |
125 |
110 |
115 |
140 |
115 |
141 |
77 |
98 |
102 |
102 |
112 |
118 |
118 |
54 |
23 |
121 |
126 |
47 |
Table #9.3.10: Number of Cell Phones per 100 Residents in the Americas
158 |
117 |
106 |
159 |
53 |
50 |
78 |
66 |
88 |
92 |
42 |
3 |
150 |
72 |
86 |
113 |
50 |
58 |
70 |
109 |
37 |
32 |
85 |
101 |
75 |
69 |
55 |
115 |
95 |
73 |
86 |
157 |
100 |
119 |
81 |
113 |
87 |
105 |
96 |
In: Statistics and Probability
In the following problem, check that it is appropriate to use
the normal approximation to the binomial. Then use the normal
distribution to estimate the requested probabilities.
Ocean fishing for billfish is very popular in the Cozumel region of
Mexico. In the Cozumel region about 46% of strikes (while trolling)
resulted in a catch. Suppose that on a given day a fleet of fishing
boats got a total of 26 strikes. Find the following probabilities.
(Round your answers to four decimal places.)
(a) 12 or fewer fish were caught
(b) 5 or more fish were caught
(c) between 5 and 12 fish were caught
In: Statistics and Probability
The following data show the brand, price, and the overall score for six stereo headphones that were tested by consumer reports. The overall score is based on sound quality and effectiveness of ambient noise reduction. Scores range from 0 to 100.
Price Score
Bose 180 76
Skullcandy 150 71
Koss 85 61
Phillips/O'Neill 70 56
Denon 70 40
JVC 35 26
a). Test for a significant relationship using the F test. What is your conclusion? Use α= 0.05
b). Develop a point estimate of the score for a headphone with a price of 100.
c). Develop a 95% confidence interval for the mean score for all headphones with a price of 100.
In: Statistics and Probability
Suppose the life of a particular brand of calculator battery is approximately normally distributed with a mean of 80 hours and a standard deviation of 9 hours. Complete parts a through c.
a. What is the probability that a single battery randomly selected from the population will have a life between 7575 and 85 hours?
b. What is the probability that 99 randomly sampled batteries from the population will have a sample mean life of between 75 and 85 hours?
c. If the manufacturer of the battery is able to reduce the standard deviation of battery life from 9 to 7 hours, what would be the probability that 9 batteries randomly sampled from the population will have a sample mean life of between 75 and 85 hours?
In: Statistics and Probability
Kilgore’s Deli is a small delicatessen located near a major university. Kilgore’s does a large walk-in carry-out lunch business. The deli offers two luncheon chili specials, Wimpy and Dial 911. At the beginning of the day, Kilgore needs to decide how much of each special to make (he always sells out of whatever he makes). The profit on one serving of Wimpy is $0.42, on one serving of Dial 911, $0.55. Each serving of Wimpy requires 0.22 pound of beef, 0.22 cup of onions, and 5 ounces of Kilgore’s special sauce. Each serving of Dial 911 requires 0.22 pound of beef, 0.37 cup of onions, 2 ounces of Kilgore’s special sauce, and 5 ounces of hot sauce. Today, Kilgore has 17 pounds of beef, 12 cups of onions, 86 ounces of Kilgore’s special sauce, and 57 ounces of hot sauce on hand.
A) Develop a linear programming model that will tell Kilgore how many servings of Wimpy and Dial 911 to make in order to maximize his profit today. If required, round your answers to two decimal places. For subtractive or negative numbers use a minus sign even if there is a + sign before the blank. (Example: -300) Let W = number of servings of Wimpy to make D = number of servings of Dial 911 to make
Max | W | + | D | |||
s.t. | ||||||
W | + | D | ≤ | (Beef) | ||
W | + | D | ≤ | (Onions) | ||
W | + | D | ≤ | (Special Sauce) | ||
W | + | D | ≤ | (Hot Sauce) | ||
W, D | ≥ | 0 |
B) Find an optimal solution. If required, round your answers to two decimal places.
W = , D = , Profit = $
C) What is the shadow price for special sauce? If required, round your answers to two decimal places. $ Interpret the shadow price.
D) The input in the box below will not be graded, but may be reviewed and considered by your instructor. Increase the amount of special sauce available by 1 ounce and re-solve. If required, round your answers to two decimal places.
W = , D = , Profit = $
In: Statistics and Probability
At the .05 significance level, does the data below show significant correlation?
x | y |
---|---|
5 | 11.85 |
6 | 11.72 |
7 | 17.09 |
8 | 13.46 |
9 | 23.33 |
10 | 14.9 |
11 | 22.07 |
In: Statistics and Probability
1) TRUE OR FALSE: Parameters yield populations
2) TRUE OR FALSE: In a study entitled the relationship between study time and grades, study time is the predictor variable.
3) TRUE OR FALSE: Only experiments involve direct measurement by the researcher.
4) A response variable is also an outcome variable.
5) Statistical methods that use sample data to make statements about populations are called inferential statistics.
6) The IV is the variable that can be randomly assigned by the researcher.
In: Statistics and Probability
We wish to predict the salary for baseball players (y) using the variables RBI (x1) and HR (x2), then we use a regression equation of the form ˆy=b0+b1x1+b2x2.
The following is a chart of baseball players' salaries and statistics from 2016.
Player Name | RBI's | HR's | Salary (in millions) |
---|---|---|---|
Miquel Cabrera | 108 | 38 | 28.050 |
Yoenis Cespedes | 86 | 31 | 27.500 |
Ryan Howard | 59 | 25 | 25.000 |
Albert Pujols | 119 | 31 | 25.000 |
Robinson Cano | 103 | 39 | 24.050 |
Mark Teixeira | 44 | 15 | 23.125 |
Joe Mauer | 49 | 11 | 23.000 |
Hanley Ramirez | 111 | 30 | 22.750 |
Justin Upton | 87 | 31 | 22.125 |
Adrian Gonzalez | 90 | 18 | 21.857 |
Jason Heyward | 49 | 7 | 21.667 |
Jayson Werth | 70 | 21 | 21.571 |
Matt Kemp | 108 | 35 | 21.500 |
Jacoby Ellsbury | 56 | 9 | 21.143 |
Chris Davis | 84 | 38 | 21.119 |
Buster Posey | 80 | 14 | 20.802 |
Shin-Soo Choo | 17 | 7 | 20.000 |
Troy Tulowitzki | 79 | 24 | 20.000 |
Ryan Braun | 91 | 31 | 20.000 |
Joey Votto | 97 | 29 | 20.000 |
Hunter Pence | 57 | 13 | 18.500 |
Prince Fielder | 44 | 8 | 18.000 |
Adrian Beltre | 104 | 32 | 18.000 |
Victor Martinez | 86 | 27 | 18.000 |
Carlos Gonzalez | 100 | 25 | 17.454 |
Matt Holliday | 62 | 20 | 17.000 |
Brian McCann | 58 | 20 | 17.000 |
Mike Trout | 100 | 29 | 16.083 |
David Ortiz | 127 | 38 | 16.000 |
Adam Jones | 83 | 29 | 16.000 |
Curtis Granderson | 59 | 30 | 16.000 |
Colby Rasmus | 54 | 15 | 15.800 |
Matt Wieters | 66 | 17 | 15.800 |
J.D. Martinez | 68 | 22 | 6.750 |
Brandon Crawford | 84 | 12 | 6.000 |
Rajai Davis | 48 | 12 | 5.950 |
Aaron Hill | 38 | 10 | 12.000 |
Coco Crisp | 55 | 13 | 11.000 |
Ben Zobrist | 76 | 18 | 10.500 |
Justin Turner | 90 | 27 | 5.100 |
Denard Span | 53 | 11 | 5.000 |
Chris Iannetta | 24 | 7 | 4.550 |
Leonys Martin | 47 | 15 | 4.150 |
Justin Smoak | 34 | 14 | 3.900 |
Jorge Soler | 31 | 12 | 3.667 |
Evan Gattis | 72 | 32 | 3.300 |
Logan Forsythe | 52 | 20 | 2.750 |
Jean Segura | 64 | 20 | 2.600 |
a) Use software to find the multiple linear regression equation.
Enter the coefficients rounded to 4 decimal places.
ˆy= ______ + _____ x1 + ______ x2
b) Use the multiple linear regression equation to predict the
salary for a baseball player with an RBI of 31 and HR of 20. Round
your answer to 1 decimal place, do not convert numbers to
dollars.
________ millions of dollars
c) Holding all other variables constant, what is the correct interpretation of the coefficient b1=0.111 in the multiple linear regression equation?
d) Holding all other variables constant, what is the correct interpretation of the coefficient b2=0.0371 in the multiple linear regression equation?
In: Statistics and Probability
Problem 3-25 (Algorithmic) The College Board National Office recently reported that in 2011–2012, the 547,038 high school juniors who took the ACT achieved a mean score of 540 with a standard deviation of 122 on the mathematics portion of the test (http://media.collegeboard.com/digitalServices/pdf/research/2013/TotalGroup-2013.pdf). Assume these test scores are normally distributed.
What is the probability that a high school junior who takes the
test will score no higher than 470 on the mathematics portion of
the test? If required, round your answer to four decimal
places.
P (x ≤ 470) =
What is the probability that a high school junior who takes the
test will score between 470 and 550 on the mathematics portion of
the test? If required, round your answer to four decimal
places.
P (470 ≤ x ≤ 550) =
In: Statistics and Probability