Questions
A researcher would like to predict the dependent variable Y from the two independent variables X1...

A researcher would like to predict the dependent variable Y from the two independent variables X1 and X2 for a sample of N=12 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test statistics to assess the significance of the regression model and partial slopes. Use a significance level α=0.05.

   X1          X2         Y

34.4 26.4 59.4
53.7 38.3 90.4
72.8 43.2 71.3
25.4 21.2 64.5
75.9 46.5 71.1
60.4 27.9 72.6
28 56.4 29.9
40.1 43.6 53.7
27.2 64.5 61.5
48.2 60 43.8
78.6 53.6 53.5
69 43.9 85.5

R2=
F=
P-value for overall model =

t1=
for b1, P-value =
t2=
for b2, P-value =

What is your conclusion for the overall regression model (also called the omnibus test)?

  • The overall regression model is statistically significant at α=0.05
  • The overall regression model is not statistically significant at α=0.05


Which of the regression coefficients are statistically different from zero?

  • neither regression coefficient is statistically significant
  • the slope for the first variable b1 is the only statistically significant coefficient
  • the slope for the second variable b2 is the only statistically significant coefficient
  • both regression coefficients are statistically significant

In: Statistics and Probability

2. Given the following descriptive statistics, N=18 X bar = 11.5 S sub x = 2...

2. Given the following descriptive statistics,

N=18

X bar = 11.5

S sub x = 2

a. find the 95% confidence interval of the population mean u.

b. Suppose you know the population standard deviation = 2. What is the minimum sample size that makes the confidence interval length less than 1?

In: Statistics and Probability

Q4) A desk lamp produced by The Luminar Company was found to be defective (D). There...

Q4) A desk lamp produced by The Luminar Company was found to be defective (D). There are three factories (A, B, C) where such desk lamps are manufactured. A Quality Control Manager is responsible for investigating the source of found defects. They know that factory A supplies 50% of the total lamps, factory B supplies 30% of the total lamps, and factory C supplies 20% of the total lamps. They also know that on average, 2.0% or factory A’s lamps are defective, 1.0% of factory B’s lamps are defective, and 3.0% of factory C’s lamps are defective. If a randomly selected lamp is defective, what is the probability that it was manufactured in factory A? If a randomly selected lamp is defective, what is the probability that it was manufactured in factory B? If a randomly selected lamp is defective, what is the probability that it was manufactured in factory C? If a randomly selected lamp is not defective, what is the probability that it was manufactured in factory A?

In: Statistics and Probability

Annual high temperatures in a certain location have been tracked for several years. Let X represent...

Annual high temperatures in a certain location have been tracked for several years. Let X represent the year and Y the high temperature. At the 0.01 significance level, does the data below show significant (linear) correlation between X and Y?

x y
5 36.63
6 35.9
7 35.77
8 38.64
9 30.51
10 29.68
11 30.35
12 21.42
13 19.99
14 18.26
15 12.03
16 15.8
  • Yes, significant correlation
  • No

In: Statistics and Probability

A researcher would like to predict the dependent variable Y from the two independent variables X1...

A researcher would like to predict the dependent variable Y from the two independent variables X1 and X2 for a sample of N=16 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test the significance of the overall regression model. Use a significance level α=0.05.

X1             X2             Y

48 42.3 47.1
36.3 58.7 65.4
43.4 40.2 63.6
49.5 37.9 45.6
45.5 37.2 50.8
40.6 64.7 42.4
42.5 46.7 63.1
42.7 40 35.8
55.8 10.6 52.1
40.9 63 60.3
39.6 56.5 44
43.5 45.1 61.2
39 68.8 67.2
50.4 43.7 40.6
46.1 42.6 58
55.2 19.1 49.1

SSreg=
SSres=
R2=
F=
P-value =

What is your decision for the hypothesis test?

  • Reject the null hypothesis, H0:β1=β2=0
  • Fail to reject H0


What is your final conclusion?

  • The evidence supports the claim that one or more of the regression coefficients is non-zero
  • The evidence supports the claim that all of the regression coefficients are zero
  • There is insufficient evidence to support the claim that at least one of the regression coefficients is non-zero
  • There is insufficient evidence to support the claim that all of the regression coefficients are equal to zero

In: Statistics and Probability

Dr. Stambaugh hypothesizes that a novel anxiety drug will affect memory. To examine this, the drug...

Dr. Stambaugh hypothesizes that a novel anxiety drug will affect memory. To examine this, the drug is administered to a random sample of 23 trained rats. The rats were then placed in a maze and timed on how long it took them to complete it. The average completion time was 47 seconds. Normal rats have an average completion time of 44 seconds with a standard deviation of 14 seconds. What can Dr. Stambaugh conclude with α = 0.01?

a) State the appropiate appropriate test statistic (na, z-test, one sample t test, independent- samples t test, related sample t test), population (normal rats, memory, trained rats, time to complete maze, the maze), and sample (normal rats, memory, trained rats, time to complete maze, the maze).
.


b) Compute the appropriate test statistic(s) to make a decision about H0.
(Hint: Make sure to write down the null and alternative hypotheses to help solve the problem.)
critical value =  ; test statistic =  
Decision:  ---Select--- Reject H0 Fail to reject H0

c) If appropriate, compute the CI. If not appropriate, input "na" for both spaces below.
[  ,  ]

d) Compute the corresponding effect size(s) and indicate magnitude(s).
If not appropriate, input and select "na" below.
d =  ;   ---Select--- na, trivial effect ,small effect ,medium effect , or large effect
r2 =  ;   ---Select--- na trivial effect small effect medium effect large effect

Now, Make an interpretation based on the results.

Rats on the anxiety drug were significantly slower completing the maze.

Rats on the anxiety drug were significantly quicker completing the maze.    T

he anxiety drug did not affect rats in completing the maze.

In: Statistics and Probability

9.3.3 A study was conducted that measured the total brain volume (TBV) (in mm3) of patients...

9.3.3

A study was conducted that measured the total brain volume (TBV) (in mm3) of patients that had schizophrenia and patients that are considered normal. Table #9.3.5 contains the TBV of the normal patients and table #9.3.6 contains the TBV of schizophrenia patients ("SOCR data oct2009," 2013). Is there enough evidence to show that the patients with schizophrenia have less TBV on average than a patient that is considered normal? Test at the 10% level.

Table #9.3.5: Total Brain Volume (in mm3) of Normal Patients

1663407

1583940

1299470

1535137

1431890

1578698

1453510

1650348

1288971

1366346

1326402

1503005

1474790

1317156

1441045

1463498

1650207

1523045

1441636

1432033

1420416

1480171

1360810

1410213

1574808

1502702

1203344

1319737

1688990

1292641

1512571

1635918

Table #9.3.6: Total Brain Volume (in mm3) of Schizophrenia Patients

1331777

1487886

1066075

1297327

1499983

1861991

1368378

1476891

1443775

1337827

1658258

1588132

1690182

1569413

1177002

1387893

1483763

1688950

1563593

1317885

1420249

1363859

1238979

1286638

1325525

1588573

1476254

1648209

1354054

1354649

1636119

9.3.4

A study was conducted that measured the total brain volume (TBV) (in mm3) of patients that had schizophrenia and patients that are considered normal. Table #9.3.5 contains the TBV of the normal patients and table #9.3.6 contains the TBV of schizophrenia patients ("SOCR data oct2009," 2013). Compute a 90% confidence interval for the difference in TBV of normal patients and patients with Schizophrenia.

9.3.8

The number of cell phones per 100 residents in countries in Europe is given in table #9.3.9 for the year 2010. The number of cell phones per 100 residents in countries of the Americas is given in table #9.3.10 also for the year 2010 ("Population reference bureau," 2013). Find the 98% confidence interval for the different in mean number of cell phones per 100 residents in Europe and the Americas.

Table #9.3.9: Number of Cell Phones per 100 Residents in Europe

100

76

100

130

75

84

112

84

138

133

118

134

126

188

129

93

64

128

124

122

109

121

127

152

96

63

99

95

151

147

123

95

67

67

118

125

110

115

140

115

141

77

98

102

102

112

118

118

54

23

121

126

47

Table #9.3.10: Number of Cell Phones per 100 Residents in the Americas

158

117

106

159

53

50

78

66

88

92

42

3

150

72

86

113

50

58

70

109

37

32

85

101

75

69

55

115

95

73

86

157

100

119

81

113

87

105

96

In: Statistics and Probability

In the following problem, check that it is appropriate to use the normal approximation to the...

In the following problem, check that it is appropriate to use the normal approximation to the binomial. Then use the normal distribution to estimate the requested probabilities.

Ocean fishing for billfish is very popular in the Cozumel region of Mexico. In the Cozumel region about 46% of strikes (while trolling) resulted in a catch. Suppose that on a given day a fleet of fishing boats got a total of 26 strikes. Find the following probabilities. (Round your answers to four decimal places.)

(a) 12 or fewer fish were caught


(b) 5 or more fish were caught


(c) between 5 and 12 fish were caught

In: Statistics and Probability

The following data show the brand, price, and the overall score for six stereo headphones that...

The following data show the brand, price, and the overall score for six stereo headphones that were tested by consumer reports. The overall score is based on sound quality and effectiveness of ambient noise reduction. Scores range from 0 to 100.

Price Score

Bose 180 76

Skullcandy 150 71

Koss 85 61

Phillips/O'Neill 70 56

Denon 70 40

JVC 35 26

a). Test for a significant relationship using the F test. What is your conclusion? Use α= 0.05

b). Develop a point estimate of the score for a headphone with a price of 100.

c). Develop a 95% confidence interval for the mean score for all headphones with a price of 100.

In: Statistics and Probability

Suppose the life of a particular brand of calculator battery is approximately normally distributed with a...

Suppose the life of a particular brand of calculator battery is approximately normally distributed with a mean of 80 hours and a standard deviation of 9 hours. Complete parts a through c.

a. What is the probability that a single battery randomly selected from the population will have a life between 7575 and 85 ​hours?

b. What is the probability that 99 randomly sampled batteries from the population will have a sample mean life of between 75 and 85 ​hours?

c. If the manufacturer of the battery is able to reduce the standard deviation of battery life from 9 to 7 ​hours, what would be the probability that 9 batteries randomly sampled from the population will have a sample mean life of between 75 and 85 ​hours?

In: Statistics and Probability

Kilgore’s Deli is a small delicatessen located near a major university. Kilgore’s does a large walk-in...

Kilgore’s Deli is a small delicatessen located near a major university. Kilgore’s does a large walk-in carry-out lunch business. The deli offers two luncheon chili specials, Wimpy and Dial 911. At the beginning of the day, Kilgore needs to decide how much of each special to make (he always sells out of whatever he makes). The profit on one serving of Wimpy is $0.42, on one serving of Dial 911, $0.55. Each serving of Wimpy requires 0.22 pound of beef, 0.22 cup of onions, and 5 ounces of Kilgore’s special sauce. Each serving of Dial 911 requires 0.22 pound of beef, 0.37 cup of onions, 2 ounces of Kilgore’s special sauce, and 5 ounces of hot sauce. Today, Kilgore has 17 pounds of beef, 12 cups of onions, 86 ounces of Kilgore’s special sauce, and 57 ounces of hot sauce on hand.

A) Develop a linear programming model that will tell Kilgore how many servings of Wimpy and Dial 911 to make in order to maximize his profit today. If required, round your answers to two decimal places. For subtractive or negative numbers use a minus sign even if there is a + sign before the blank. (Example: -300) Let W = number of servings of Wimpy to make D = number of servings of Dial 911 to make

Max W + D
s.t.
W + D (Beef)
W + D (Onions)
W + D (Special Sauce)
W + D (Hot Sauce)
W, D 0

B) Find an optimal solution. If required, round your answers to two decimal places.

W = , D = , Profit = $

C) What is the shadow price for special sauce? If required, round your answers to two decimal places. $ Interpret the shadow price.

D) The input in the box below will not be graded, but may be reviewed and considered by your instructor. Increase the amount of special sauce available by 1 ounce and re-solve. If required, round your answers to two decimal places.

W = , D = , Profit = $

In: Statistics and Probability

At the .05 significance level, does the data below show significant correlation? x y 5 11.85...

At the .05 significance level, does the data below show significant correlation?

x y
5 11.85
6 11.72
7 17.09
8 13.46
9 23.33
10 14.9
11 22.07
  • Yes, significant correlation
  • No

In: Statistics and Probability

1) TRUE OR FALSE: Parameters yield populations 2) TRUE OR FALSE: In a study entitled the...

1) TRUE OR FALSE: Parameters yield populations

2) TRUE OR FALSE: In a study entitled the relationship between study time and grades, study time is the predictor variable.

3) TRUE OR FALSE: Only experiments involve direct measurement by the researcher.

4) A response variable is also an outcome variable.

5) Statistical methods that use sample data to make statements about populations are called inferential statistics.

6) The IV is the variable that can be randomly assigned by the researcher.

In: Statistics and Probability

We wish to predict the salary for baseball players (y) using the variables RBI (x1) and...

We wish to predict the salary for baseball players (y) using the variables RBI (x1) and HR (x2), then we use a regression equation of the form ˆy=b0+b1x1+b2x2.

  • HR - Home runs - hits on which the batter successfully touched all four bases, without the contribution of a fielding error.
  • RBI - Run batted in - number of runners who scored due to a batters' action, except when batter grounded into double play or reached on an error
  • Salary is in millions of dollars.

The following is a chart of baseball players' salaries and statistics from 2016.

Player Name RBI's HR's Salary (in millions)
Miquel Cabrera 108 38 28.050
Yoenis Cespedes 86 31 27.500
Ryan Howard 59 25 25.000
Albert Pujols 119 31 25.000
Robinson Cano 103 39 24.050
Mark Teixeira 44 15 23.125
Joe Mauer 49 11 23.000
Hanley Ramirez 111 30 22.750
Justin Upton 87 31 22.125
Adrian Gonzalez 90 18 21.857
Jason Heyward 49 7 21.667
Jayson Werth 70 21 21.571
Matt Kemp 108 35 21.500
Jacoby Ellsbury 56 9 21.143
Chris Davis 84 38 21.119
Buster Posey 80 14 20.802
Shin-Soo Choo 17 7 20.000
Troy Tulowitzki 79 24 20.000
Ryan Braun 91 31 20.000
Joey Votto 97 29 20.000
Hunter Pence 57 13 18.500
Prince Fielder 44 8 18.000
Adrian Beltre 104 32 18.000
Victor Martinez 86 27 18.000
Carlos Gonzalez 100 25 17.454
Matt Holliday 62 20 17.000
Brian McCann 58 20 17.000
Mike Trout 100 29 16.083
David Ortiz 127 38 16.000
Adam Jones 83 29 16.000
Curtis Granderson 59 30 16.000
Colby Rasmus 54 15 15.800
Matt Wieters 66 17 15.800
J.D. Martinez 68 22 6.750
Brandon Crawford 84 12 6.000
Rajai Davis 48 12 5.950
Aaron Hill 38 10 12.000
Coco Crisp 55 13 11.000
Ben Zobrist 76 18 10.500
Justin Turner 90 27 5.100
Denard Span 53 11 5.000
Chris Iannetta 24 7 4.550
Leonys Martin 47 15 4.150
Justin Smoak 34 14 3.900
Jorge Soler 31 12 3.667
Evan Gattis 72 32 3.300
Logan Forsythe 52 20 2.750
Jean Segura 64 20 2.600

a) Use software to find the multiple linear regression equation. Enter the coefficients rounded to 4 decimal places.
ˆy= ______ + _____ x1 + ______ x2


b) Use the multiple linear regression equation to predict the salary for a baseball player with an RBI of 31 and HR of 20. Round your answer to 1 decimal place, do not convert numbers to dollars.
      ________ millions of dollars

c) Holding all other variables constant, what is the correct interpretation of the coefficient b1=0.111 in the multiple linear regression equation?

  • For each HR, a baseball player's predicted salary increases by 0.111 million dollars.
  • For each RBI, a baseball player's predicted salary increases by 0.111 million dollars.
  • If the baseball player's salary increases by 0.111 million dollars, then the predicted RBI will increase by one.
  • If the baseball player's salary increases by 0.111 million dollars, then the predicted RBI will increase by 0.0371.

d) Holding all other variables constant, what is the correct interpretation of the coefficient b2=0.0371 in the multiple linear regression equation?

  • If the baseball player's salary increases by 0.0371 million dollars, then the predicted HR will increase by one.
  • If the baseball player's salary increases by 0.0371 million dollars, then the predicted HR will increase by 0.111.
  • For each RBI, a baseball player's predicted salary increases by 0.0371 million dollars.
  • For each HR, a baseball player's predicted salary increases by 0.0371 million dollars.

In: Statistics and Probability

Problem 3-25 (Algorithmic) The College Board National Office recently reported that in 2011–2012, the 547,038 high...

Problem 3-25 (Algorithmic) The College Board National Office recently reported that in 2011–2012, the 547,038 high school juniors who took the ACT achieved a mean score of 540 with a standard deviation of 122 on the mathematics portion of the test (http://media.collegeboard.com/digitalServices/pdf/research/2013/TotalGroup-2013.pdf). Assume these test scores are normally distributed.

What is the probability that a high school junior who takes the test will score no higher than 470 on the mathematics portion of the test? If required, round your answer to four decimal places.

P (x ≤ 470) =

What is the probability that a high school junior who takes the test will score between 470 and 550 on the mathematics portion of the test? If required, round your answer to four decimal places.

P (470 ≤ x ≤ 550) =

In: Statistics and Probability