The Condé Nast Traveler Gold List provides ratings for the top 20 small cruise ships. The data shown below are the scores each ship received based upon the results from Condé Nast Traveler's Annual Readers' Choice Survey. Each score represents the percentage of respondents who rated a ship as excellent or very good on several criteria, including Shore Excursions and Food/Dining. An overall score was also reported and used to rank the ships. The highest ranked ship, the Seabourn Odyssey, has an overall score of 94.4, the highest component of which is 97.8 for Food/Dining.
Ship | Overall | Shore Excursions |
Food/Dining |
---|---|---|---|
Seabourn Odyssey | 94.4 | 90.9 | 97.8 |
Seabourn Pride | 93.0 | 84.2 | 96.7 |
National Geographic Endeavor | 92.9 | 100.0 | 88.5 |
Seabourn Sojourn | 91.3 | 94.8 | 97.1 |
Paul Gauguin | 90.5 | 87.9 | 91.2 |
Seabourn Legend | 90.3 | 82.1 | 98.8 |
Seabourn Spirit | 90.2 | 86.3 | 92.0 |
Silver Explorer | 89.9 | 92.6 | 88.9 |
Silver Spirit | 89.4 | 85.9 | 90.8 |
Seven Seas Navigator | 89.2 | 83.3 | 90.5 |
Silver Whisperer | 89.2 | 82.0 | 88.6 |
National Geographic Explorer | 89.1 | 93.1 | 89.7 |
Silver Cloud | 88.7 | 78.3 | 91.3 |
Celebrity Xpedition | 87.2 | 91.7 | 73.6 |
Silver Shadow | 87.2 | 75.0 | 89.7 |
Silver Wind | 86.6 | 78.1 | 91.6 |
SeaDream II | 86.2 | 77.4 | 90.9 |
Wind Star | 86.1 | 76.5 | 91.5 |
Wind Surf | 86.1 | 72.3 | 89.3 |
Wind Spirit | 85.2 | 77.4 | 91.9 |
(a)
Determine an estimated regression equation that can be used to predict the overall score given the score for Shore Excursions. (Round your numerical values to two decimal places. Let x1 represent the Shore Excursions score and y represent the overall score.)
ŷ =
(b)
Consider the addition of the independent variable Food/Dining. Develop the estimated regression equation that can be used to predict the overall score given the scores for Shore Excursions and Food/Dining. (Round your numerical values to two decimal places. Let x1 represent the Shore Excursions score, x2 represent the Food/Dining score, and yrepresent the overall score.)
ŷ =
(c)
Predict the overall score for a cruise ship with a Shore Excursions score of 82 and a Food/Dining Score of 91. (Round your answer to one decimal place.)
In: Statistics and Probability
to estimate the fuel efficiency of a new compact automobile, an engineer performed n=16 test runs and recorded the miles per gallon (mpg) for each run. the sample mean was 42.2 mpg with standard deviation 8.8.
a) construct a 99% confidence interval for the true mean mileage for this automobile.
- Lower limit= ?
-upper limit= ?
(please round your answers to 1 decimal place)
b) the manufacturer of this automobile claims that it gets 50mpg. Based on the confidence interval, do you think this is a reasonable claim? why or why not?
In: Statistics and Probability
2. Suppose a hurricane hits south Florida in any given year with probability 0.2. Describe the distribution of the following random variables, including both the name of the distribution and the parameters (such as X~Bernoulli(0.4)). (2 points each)
a.Let X be the number of years until the next hurricane hits South Florida.
b.Let X be the number of hurricanes that will hit south Florida in the next 10 years.
c.Let X indicate whether a hurricane will hit south Florida in 2020.
d.Let X indicate whether south Florida will avoid getting a hurricane in 2020.
In: Statistics and Probability
You have a company with 300 employees. Every year they have to
go through a mandatory training which requires them to watch
company made videos. You want to test if the employees will pay
more attention to the videos if they include popular music (top 50
hits) or classical music. You select 40 employees to take part in a
study where they watch the videos alone in a room and someone
watching from a webcam counts how many times the employee looked
away from the screen (you call this 'Distractions'). Half of the
employees watch the videos with popular music and the other half
watch with classical music. Use Popular music as group 1 and
Classical music as group 2. The first 5 rows of the data are shown
here.
a. What is the sample?
b. What is the population of interest?
c. What is the parameter you are interested in? ---
x1-x2 μ x μ1-μ2
There is also a summary of the data:
Group | Mean | Standard deviation |
Popular | 27.9 | 8.2 |
Classical | 36.2 | 9.5 |
(For the numerical answers below, use 2 decimal places.)
d. The point estimate is:
e. The standard error is:
f. The 95% confidence interval goes from
to
g. Which group had more distractions in the sample? ---
Classical They were equal Popular
h. Would you say that there is a difference between the groups at
the population level? Explain why or why not. If you say there is a
difference explain which group has more distractions?
In: Statistics and Probability
You are interested in gathering data from State Senators in Nebraska to find out their opinions on major social and political issues. You decide to go to the capitol building on a day the senators will be in session (gathering to do business) and stand outside the main doors to the legislative chamber (the room where the senators gather). You ask every 5th senator that enters whether they would be willing to fill out your survey and be part of your sample.
A. Is this a probability sampling method or a non-probability sampling method? Justify your answer.
B. Would the data you collect using this sampling method be representative of the population of state senators? Justify your answer.
Do not use outside interval calculators and show all work
In: Statistics and Probability
These data are observations collected using a completely randomized design.
Sample 1 | Sample 2 | Sample 3 |
4 | 4 | 2 |
2 | 3 | 0 |
4 | 5 | 2 |
3 | 2 | 1 |
2 | 6 |
(a) Calculate CM and Total SS. (Round your answer for CM to six decimal places and your Total SS to four decimal places.
(b) Calculate SST and MST. (Round your answers to four decimal places.)
(c) Calculate SSE and MSE. (Round your answers to four decimal places.)
(d) Construct an ANOVA table for the data. (Round your answer for F to two decimal places. Round your answers for SS and MS to four decimal places.)
Source | df | SS | MS | F |
Treatments | ||||
Error | ||||
Total |
In: Statistics and Probability
A manufacturing company produces part QV2Y for the aerospace industry. This particular part can be manufactured using 3 different production processes. The management wants to know if the quality of the units of part QV2Y is the same for all three processes. The production supervisor obtained the following data: Process 1 had 29 defective units in 240 items, Process 2 produced 12 defective units in 180 items, and Process 3 manufactured 9 defective units in 150 items. At a significance level of .05, we performed a chi-square test to determine whether the quality of the items produced appears to be the same for all three processes. What is the null hypothesis?
H0: The proportion of defective units produced by the three production processes is the same.
Both "H0: The number of defectives produced is independent of the production process used." and "H0: The proportion of defective units produced by the three production processes is the same." are correct or at least acceptable ways of stating the null hypothesis.
All of the other choices are acceptable ways of stating the null hypothesis.
H0: The row and column variables are associated with each other.
H0: The number of defectives produced is independent of the production process used.
In: Statistics and Probability
One of the primary advantages of a repeated-measures design, compared to an independent-measures design, is that it reduces the overall variability by removing variance caused by individual differences. The following data are from a research study comparing three treatment conditions.
treatment |
|||
A |
B |
C |
P |
6 |
9 |
12 |
27 |
8 |
8 |
8 |
24 |
5 |
7 |
9 |
21 |
0 |
4 |
8 |
12 |
2 |
3 |
4 |
9 |
3 |
5 |
7 |
15 |
N=18, G=108, SUM=108
Treatment A: -
M=4
T=24
SS=42
Treatment B:-
M=6
T=36
SS=28
Treatment c:-
M=8
T=48
SS=34
a) Assume that the data are from an independent-measures study using three separate samples, each with n = 6 participants. Ignore the column of P totals and use an independent-measures ANOVA with alpha = .05 to test the significance of the mean differences.
b) Now assume that the data are from a repeated-measures study using the same sample of n = 6 participants in all three treatment conditions. Use a repeated-measures ANOVA with alpha = .05 to test the significance of the mean differences.
c) Explain why the two analyses lead to different conclusions.
In: Statistics and Probability
The table below describes the smoking habits of a group of asthma sufferers. Use this table to answer questions 7-9.
Nonsmoker |
Light Smoker |
Heavy Smoker |
Total |
|
Men |
330 |
75 |
65 |
470 |
Women |
315 |
83 |
87 |
485 |
Total |
645 |
158 |
152 |
955 |
If one of the subjects is randomly selected, find the probability that she is a heavy smoker given that she is a woman.
In: Statistics and Probability
2. In terms of estimating population parameters using sample statistics, one would think that the highest level of confidence in an estimate (in other words, the 99% confidence level) would always be preferred by researchers.
A. What is/are the advantage(s) of reporting a confidence interval at the 99% confidence level?
B. Explain why researchers might rather choose a lower level of confidence (say the 95% or 90% confidence level) in their estimation of confidence intervals.
Show all work, do not use outside interval calculator.
In: Statistics and Probability
A dietician read in a survey that 82.04% of adults in the U.S.
do not eat breakfast at least 2 days a week. She believes that a
smaller proportion skip breakfast 2 days a week. To verify her
claim, she selects a random sample of 77 adults and asks them how
many days a week they skip breakfast. 49 of them report that they
skip breakfast at least 2 days a week. Test her claim at αα =
0.01.
The correct hypotheses would be:
Since the level of significance is 0.01 the critical value is
-2.326
The test statistic is: (round to 3 places)
The p-value is: (round to 3 places)
The decision can be made to:
The final conclusion is that:
In: Statistics and Probability
Question 1
After a number of complaints about its directory assistance, a telephone company examined samples of calls to determine the frequency of wrong numbers given to callers. Each sample consisted of 100 calls.
SAMPLE |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
Number of errors |
5 |
3 |
5 |
7 |
4 |
6 |
8 |
4 |
SAMPLE |
9 |
10 |
11 |
12 |
13 |
14 |
15 |
16 |
Number of errors |
5 |
9 |
3 |
4 |
5 |
6 |
6 |
7 |
a. Set the control limits to include 95% of random variation in
complaint rate.
b. Plot the values of the sample statistic on a chart. Comment on
the results.
c. Discuss THREE major management decisions related to the use of
the control charts.
In: Statistics and Probability
. As businesses try to survive in the current crisis, Route 22 Honda sought to estimate the satisfaction score from 915 who have purchased a car or truck in 2019. The company goal is to achieve an average satisfaction score of at least 80 on a scale of 0-100. A random sample of 80 customers had an average satisfaction score of 82.7 and a sample deviation of 6.7
a. Construct a 95% confidence interval to estimate the actual average satisfaction score form customers in 2019.
b. Based on this sample, can the manager conclude that Route 22 Honda reached its goal?
2. 2. The state of Florida wanted to estimate the proportion of voters who intend to vote in the presidential election. A pilot sample of 50 voters found that 36 of them intend to vote in the election. Determine the additional number of voters that need to be sampled to construct a 95% interval with a margin of error equal to 0.05 to estimate the proportion.
3. 3. Apple claims that the average wait time for a customer calling Apple Care support line is 175 seconds. A random sample of 40 customers had an average wait time of 187 seconds. Assume the population standard deviation for wait time is 50 seconds
a. Using a 95% confidence interval, does this sample support Apple’s claim?
In: Statistics and Probability
You have set a professional goal to become the head of your workplace’s Party Planning Committee. Your CEO has agreed that the role will be yours if you can inspire your team to create enough workplace goodwill that more than 60% of this year’s Holiday Party attendees participate in the ‘ugly sweater’ contest. When you see that 34 of the 47 attendees at the party have on some of the ugliest holiday sweaters you’ve ever seen, you think the new title is yours. However, the next day the CEO comes up to you and says she thinks it was just random chance that so many employees participated in the contest and she is going to hand your committee over to your arch-nemesis, Karen from finance. Claim your new title by proving her wrong!
2. Verify all assumptions required for this test
In: Statistics and Probability
1. A laboratory is interested in how many times they can use a specific brand of test tube on a Bunsen burner before it cracks. Assume that the population is normally distributed. They test 20 test tubes and find that on average, it took reheating them 1111 times until they cracked, with a standard deviation of 234. Compute the 98% confidence interval to estimate the average number of times they can use the tube. Interpret this confidence interval. You do NOT need to check CLT here.
In: Statistics and Probability