a.
Which of the following is true?
Group of answer choices
Additional variables can add noise to the model that slightly increases R-squared
All the options are true
You can use multiple independent variables to predict a dependent variable
The R-Squared value is a measure of how good the model is.
b.This term refers to when two predictor variables are highly correlated with each other and so the effect of the variables on the dependent response is questionable.
In: Statistics and Probability
1. Find the (a) mean, (b) median, (c) mode, and (d) midrange for the given sample data.
An experiment was conducted to determine whether a deficiency of carbon dioxide in the soil affects the phenotype of peas. Listed below are the phenotype codes where 1=smooth-yellow,2=smooth-green, 3=wrinkled-yellow, and 4=wrinkled-green.
Do the results make sense?
3 |
1 |
3 |
4 |
4 |
1 |
2 |
4 |
1 |
4 |
3 |
3 |
3 |
3 |
(a) The mean phenotype code is _____.
2. Statistics are sometimes used to compare or identify authors of different works. The lengths of the first 10 words in a book by Terry are listed with the first 10 words in a book by David. Find the mean and median for each of the two samples, then compare the two sets of results.
Terry: |
2 |
2 |
2 |
11 |
8 |
9 |
2 |
6 |
3 |
3 |
|
---|---|---|---|---|---|---|---|---|---|---|---|
David: |
3 |
4 |
3 |
2 |
3 |
1 |
3 |
1 |
4 |
3 |
The mean number of letters per word in Terry's book is _____.
3. Refer to the data set of times, in minutes, required for an airplane to taxi out for takeoff, listed below. Find the mean and median. How is it helpful to find the mean?
36 |
35 |
25 |
11 |
26 |
29 |
30 |
32 |
19 |
39 |
45 |
31 |
|
31 |
14 |
40 |
27 |
24 |
48 |
10 |
43 |
18 |
30 |
45 |
31 |
|
13 |
34 |
17 |
16 |
23 |
40 |
47 |
31 |
27 |
34 |
17 |
47 |
|
28 |
35 |
26 |
44 |
14 |
43 |
30 |
14 |
30 |
18 |
38 |
13 |
Click the icon for the taxi out takeoff data.
Find the mean and median of the data set using a calculator or similar data analysis technology.
The mean of the data set is _____ minutes.
4. Find the mean of the data summarized in the given frequency distribution. Compare the computed mean to the actual mean of 50.4 miles per hour.
Speed (miles per hour) |
42−45 |
46−49 |
50−53 |
54−57 |
58−61 |
|
---|---|---|---|---|---|---|
Frequency |
29 |
12 |
6 |
3 |
2 |
The mean of the frequency distribution is _____ miles per hour.
5.Six different second-year medical students at Bellevue Hospital measured the blood pressure of the same person. The systolic readings (in mmHg) are listed below. Find the range, variance, and standard deviation for the given sample data. If the subject's blood pressure remains constant and the medical students correctly apply the same measurement technique, what should be the value of the standard deviation?
126 126 138 125 137 134
Range= ______ mmHg
In: Statistics and Probability
A fair die is rolled twice. Let X be the maximum of the two rolls. Find the distribution of X.
Let Y be the minimum of the two rolls. Find the variance of Y.
In: Statistics and Probability
A researcher would like to examine how the chemical tryptophan, contained in foods such as turkey, can affect mental alertness. A sample of n = 8 college students is obtained, and each student’s performance on a familiar video game (total points earned in the game) is measured before and after eating a traditional Thanksgiving dinner including roast turkey. The following table are the scores for each participant before and after the meal:
Participants: 1,2,3,4,5,6,7,8,
Before Meal (X1): 220,245,215,260,300,280,250,310
After Meal (X2) 210,220,195,265,275,290,220,285
a.For a two-tailed test, what is the null hypothesis using statistical notation?
b. For a two-tailed test, what is the alternative hypothesis using statistical notation?
c. What is the sum of the difference scores (D)?
d. What is the value of MD?
e. What is the value for Sum of Squares for the difference scores (SSD)?
f. What is the sample variance for the difference scores (sD 2 )?
g. What is estimated standard error (sMD)?
h. For a two-tailed test with α = .05, what is the value/s for tcrit?
i. What is the value for tobt?
j. Do these data indicate a significant difference between the treatments at the .05 level of significance? NOTE: simply writing that the effect is significant (e.g., only writing “reject the null”) without showing any work/calculations in parts a-e will result in point zero points.
k. Compute estimated Cohen’s d to measure the size of the treatment effect.
In: Statistics and Probability
Find the critical value or values of
2 based on the given information.
H0: = 8.0
n = 10
= 0.01
In: Statistics and Probability
Here is the data for our experiment.
The data are the SMUT scores of the students in each group. Notice that we have a different number (n) for the lecture group. This is to show you that we can have uneven sets of data for ANOVA. Note: If we were doing a real study, we would have larger n’s. Enter the data into the Excel spread sheet, SPSS or your calculator
Independent Variable | Data |
Computer (n1= 8) | 85, 87, 91, 78, 90, 83, 86, 89 |
Lecture (n2= 7) | 79, 76, 75, 79, 75, 72, 81, |
Cooperative (n3= 8) | 80, 99, 87, 84, 83, 77, 76, 80 |
This assignment is part of my ANOVA Exercise, I will please need help in completing it.
Thanks
In: Statistics and Probability
log(salary)=β0+β1log(sales)+β2roe+β3ros+u.
salary | sales | roe | ros |
1095 | 27595 | 14.1 | 191 |
1001 | 9958 | 10.9 | 13 |
1122 | 6125.9 | 23.5 | 14 |
578 | 16246 | 5.9 | -21 |
1368 | 21783.2 | 13.8 | 56 |
1145 | 6021.4 | 20 | 55 |
1078 | 2266.7 | 16.4 | 62 |
1094 | 2966.8 | 16.3 | 44 |
1237 | 4570.2 | 10.5 | 37 |
833 | 2830 | 26.3 | 37 |
567 | 596.8 | 25.9 | 109 |
933 | 19773 | 26.8 | -10 |
1339 | 40047 | 14.8 | 41 |
937 | 2513.8 | 22.3 | 44 |
2011 | 1580.6 | 56.3 | 63 |
1585 | 6754 | 12.6 | 17 |
905 | 1066.3 | 20.4 | 141 |
1058 | 3199.9 | 1.9 | -15 |
922 | 1452.7 | 19.9 | 56 |
1220 | 8995 | 15.4 | 28 |
1022 | 1212.3 | 38.7 | 83 |
759 | 2824.2 | 16.4 | 21 |
1414 | 7621 | 24.4 | -10 |
1041 | 4418.3 | 15.6 | 74 |
1688 | 12343 | 14.4 | 15 |
In: Statistics and Probability
Consider the following data drawn independently from normally
distributed populations: (You may find it useful to
reference the appropriate table: z table
or t table)
x−1x−1 = 25.7 | x⎯⎯2x¯2 = 30.6 |
σ12 = 98.2 | σ22 = 87.4 |
n1 = 20 | n2 = 25 |
a. Construct the 95% confidence interval for the
difference between the population means. (Negative values
should be indicated by a minus sign. Round all intermediate
calculations to at least 4 decimal places and final answers to 2
decimal places.)
Confidence internval is _____ to ____.
Consider the following competing hypotheses and accompanying
sample data drawn independently from normally distributed
populations. (You may find it useful to reference the
appropriate table: z table or t
table)
H0: μ1 −
μ2 = 0
HA: μ1 −
μ2 ≠ 0
x−1x−1 = 51 | x−2x−2 = 60 |
σ1 = 13.00 | σ2 = 1.64 |
n1 = 25 | n2 = 25 |
a-1. Calculate the value of the test statistic.
(Negative values should be indicated by a minus sign. Round
all intermediate calculations to at least 4 decimal places and
final answer to 2 decimal places.)
Test statistic:
Researchers at The Wharton School of Business have found that men and women shop for different reasons. While women enjoy the shopping experience, men are on a mission to get the job done. Men do not shop as frequently, but when they do, they make big purchases like expensive electronics. The accompanying table shows the amount spent (in $) over the weekend by 40 men and 60 women at a local mall. The Excel file is also provided. (You may find it useful to reference the appropriate table: z table or t table)
Spending by Men | Spending by Women | Spending by Men | Spending by Women |
85 | 90 | 87 | 38 |
102 | 79 | 92 | 66 |
139 | 71 | 92 | 100 |
90 | 119 | 72 | 57 |
89 | 90 | 97 | 59 |
52 | 180 | 83 | 89 |
49 | 88 | 118 | 95 |
140 | 56 | 108 | 37 |
90 | 110 | 104 | 86 |
64 | 82 | 110 | 62 |
96 | 64 | 66 | |
132 | 129 | 129 | |
117 | 28 | 119 | |
88 | 13 | 76 | |
92 | 140 | 75 | |
105 | 62 | 101 | |
95 | 32 | 85 | |
119 | 220 | 68 | |
118 | 72 | 67 | |
124 | 90 | 36 | |
131 | 80 | 90 | |
113 | 56 | 99 | |
124 | 82 | 64 | |
71 | 56 | 54 | |
115 | 88 | 86 | |
95 | 104 | 79 | |
102 | 54 | 82 | |
94 | 108 | 65 | |
111 | 86 | 110 | |
85 | 88 | 69 | |
Click here for the Excel Data File
Let µ1 represent the population mean amount
spent by men and µ2 represent the population
mean amount spent by women.
a. Specify the competing hypotheses that determine
if the mean amount spent by men is more than that by
women.
H0: μ1 − μ2 = 0; HA: μ1 − μ2 ≠ 0
H0: μ1 − μ2 ≥ 0; HA: μ1 − μ2 < 0
H0: μ1 − μ2 ≤ 0; HA: μ1 − μ2 > 0
b. Calculate the value of the test statistic.
Assume that the population variances are unknown but equal.
(Round intermediate calculations to at least 4 decimal
places and final answer to 2 decimal places.)
test statistic:
In: Statistics and Probability
Consider the following excerpts from a New York Times
article:
Despite its early promise … Restoration has had trouble becoming a
mass-market
player … What went wrong? High on its own buzz, the company
expanded at breakneck
speed, more than doubling the number of stores, to 94, in the year
and a half after the
stock offering … Company managers agree, for example, that
Restoration’s original
inventory system, which called for all furniture to be kept at
stores instead of a central
warehouse, was a disaster.
Let’s look at one Restoration Hardware product, a leather chair.
Average weekly sales
of this chair in each store is normally distributed with mean 1.25
units and standard
deviation 0.5 units. The replenishment lead time is 12 weeks. There
is information
system in place.
If each store holds its own inventory, then what is the company’s
average
inventory if the company policy is to target a 99.25 percent
in-stock probability?
Suppose Restoration Hardware builds a central warehouse to serve
the 94
stores. The lead time from the supplier to the central warehouse is
12 weeks.
The lead time from the central warehouse to each store is one week.
Suppose
the warehouse operates with a 99 percent in-stock probability, but
the stores
maintain a 99.25 percent in-stock probability. If only inventory at
the retail
stores is considered, what is Restoration’s average inventory?
In: Statistics and Probability
A popular theory is that presidential candidates have an advantage if they are taller than their main opponents. Listed are heights (in centimeters) of randomly selected presidents along with the heights of their main opponents. Complete parts (a) and (b) below. Height left parenthesis cm right parenthesis of President Height (cm) of President 189 189 174 174 181 181 180 180 187 187 166 166 Height left parenthesis cm right parenthesis of Main Opponent Height (cm) of Main Opponent 177 177 189 189 170 170 166 166 180 180 180 180 a. Use the sample data with a 0.05 0.05 significance level to test the claim that for the population of heights for presidents and their main opponents, the differences have a mean greater than 0 cm.
In: Statistics and Probability
A data set about speed dating includes "like" ratings of male dates made by the female dates. The summary statistics are
nequals=195195,
x overbarxequals=7.537.53,
sequals=1.971.97.
Use a
0.050.05
significance level to test the claim that the population mean of such ratings is less than
8.008.00.
Assume that a simple random sample has been selected. Identify the null and alternative hypotheses, test statistic, P-value, and state the final conclusion that addresses the original claim.
In: Statistics and Probability
1. Suppose that personal daily water usage in California is normally distributed with a mean of 17 gallons and a standard deviation of 6 gallons.
(a) What proportion of California’s population uses between 10 and 20 gallons daily?
(b) The governor of California wants to give a tax rebate to the 20% of the population that uses the least amount of water. What should the governor use as the maximum daily water usage for a person to qualify for the tax rebate?
(c) The governor randomly samples 100 people from the California population. What is the probability that the average usage among these 100 people is more than 18 gallons/day?
In: Statistics and Probability
To study the effect of curing temperature on shear strength of a certain rubber compound, 80 specimens were cured at 150°C and 105 were cured at 130°C. The specimens cured at 150°C had an average shear strength of 620 psi, with a standard deviation of 20 psi. Those cured at 130°C had an average shear strength of 750 psi, with a standard deviation of 30 psi. Let μXμX represent the population mean strength for the specimens cured at 130°C and let μYμY represent the population mean strength for the specimens cured at 150°C. Find a 95% confidence interval for the difference μX−μYμX−μY . Round the answers to two decimal places.
In: Statistics and Probability
2. A 95% confidence interval for ?? − ?? is (-4.5, -0.15). Based upon the data from which the confidence interval was constructed, someone wants to test H0: ?? = ?? versus HA: ?? ≠ ?? at the ? = 5% significance level.
(a) Based upon the confidence interval, what is your conclusion of the hypothesis test? Explain.
(b) Can we use the above confidence interval without any additional information to conduct the hypothesis test at the ? = 10% level? Why or why not?
In: Statistics and Probability
A) Two arthroscopic surgeries were compared among patients suffering from osteoarthritis who had at least moderate knee pain. The lavage type has the joint flushed with fluid and no instrument is used to remove tissue; the debridement type has the joint flushed with fluid and an instrument is used to remove tissue. Knee pain scores, ranging from 0 to 100 with higher scores indicating severe pain, were obtained for patients randomly assigned to the two groups after the surgery and the results are given below:
------------Sameple Size--------Sample Mean-------Sample Standard Deviation
Lavage 11 59 24
Debridment 13 49 23
Determine a 95% confidence interval for the difference between the mean knee pain score for the lavage group and that for the debridement group
B) A test of abstract reasoning is given to a random sample of 10 students before and after completing a formal logic course. The results are shown below.
Before----------After
71 71
81 86
85 82
67 75
92 95
71 66
61 67
78 81
64 71
80 88
Determine a 95% confidence interval for the difference between the mean score after completing the course and the mean score before completing the course
C) The quality control officer collects a random sample of 100 yardsticks from the day's production run. The sample mean is 36 inches. The population standard deviation for that day’s production is σ =1 inches.
1)Find a 99% confidence interval for the mean length of all yardsticks made that day
2)The margin of error associated with the 99% confidence interval for the mean length of all yardsticks made that day is
3)You decide that margin of error in previous question is too large, and you want it to be at most 0.1 inches, what is the minimum sample size in order to obtain this margin of error with a confidence level of 99%?
In: Statistics and Probability