"A test to determine whether a certain antibody is present is 99.6% effective. This means that the test will accurately come back negative if the antibody is not present (in the test subject) 99.6% of the time. The probability of a test coming back positive when the antibody is not present (a false positive) is 0.004. Suppose the test is given to five randomly selected people who do not have the antibody. (a) What is the probability that the test comes back negative for all five people? (b) What is the probability that the test comes back positive for at least one of the five people?"
In: Statistics and Probability
Rosa Diaz is one of the best detectives at the 99th precinct in Brooklyn. She averages 4.89 felony arrests a week. Assume a 5-day workweek.
A) What is the probability that Rosa has 5 felony arrests in a week?
B) What is the probability that Rosa has at least 3 felony arrests in a week?
C) What is the probability that Rosa has between 5 and 10 felony arrests in a week exclusive?
D) What is the probability that Rosa has 3 felony arrests in a day?
In: Statistics and Probability
Problem 1
The time for an employee to complete a certain manufacturing task was was approximately Normally distributed with a mean of 138 seconds and a standard deviation of 39 seconds. Suppose you plan to take an SRS of 250 employees and look at the average manufacturing time.
a) Determine the mean and standard deviation of the distribution of sample means.
b) Calculate the probability of coming up with a sample that shows a manufacturing time of 114 seconds or less.
Problem 2
You want to research the cost of a certain trendy ergonomic chair. In your online search of the cost of that chair on 20 different sites, you note that the mean cost of the chair is $562. Assume that the SD of the chair among all sites is $74. Find the 90%, 95%, and 99% confidence intervals for the mean cost of this chair.
Look at the 95% confidence interval and say whether the following statement is true or false. “This interval describes the price of 95% for all chairs of this model that are being sold.” If you think this is incorrect, be sure to explain why.
Problem 3
A credit card company wants to estimate the average length of client churn (turning over their card for a different one). In a sample of 5000 customers obtained at random from the company’s database, the mean churn is 418. The the standard deviation of the calls from that sample is 93. Provide the mean and SD of the distribution of this sample.
Problem 4
A web-based travel search engine is designed to complete searches for cruise ships involving price retrieval and scheduling information. You need to analyze how long it takes to complete a typical search. The search time for 2500 randomly selected searches is recorded and detreemined as sample mean = 0.94 seconds and sample standard deviation = 0.36 seconds. Choose the most typical value for C and provide the confidence interval.
In: Statistics and Probability
15. To measure the relationship between anxiety level and test performance, a psychologist obtains a sample of n=7 college students from an introductory statistics course. The students sre asked to come to the laboratory 15 minutes before the final exam. In the lab, the psychologist record physiological measures of anxiety (heart rate, skin resistance, blood pressure, and so on) for each participant. In addition, the psychologist obtains the exam score for each participant.
Student Anxiety Rating Exam Score
A 3 81
B 4 83
C 7 80
D 6 81
E 2 87
F 7 82
G 5 79
a. Compute the Pearson correlation for the data. Be sure to show all formulas with symbols (and plug in numbers), steps, processes and calculations for all steps and all parts of all answers. b. Is there a significant relationship between anxiety and exam score? Use a two-tailed test with alpha =.05. Be sure to show all formulas with symbols (and plug in numbers), steps, processes and calculations for all steps and all parts of all answers.
In: Statistics and Probability
Age Browsing Time (min/wk)
34 372
48 198
24 469
33 387
43 262
58 85
20 469
49 192
64 86
55 119
46 233
36 345
35 357
29 415
37 327
d) Check the residuals to see if the conditions for inference are met.
The equal spread condition IS/IS NOT satisfied because the scatterplot of the residuals against the predicted values shows NO PATTERN/A THICKENING OF THE SPREAD/A THINNING OF THE SPREAD/A CURVATURE.
.The nearly normal condition IS/IS NOT satisfied because the Normal probability plot SHOWS/DOES NOT SHOW an approximately straight line. (This is also the case because a histogram of the residuals IS APPROXIMATELY/IS NOT unimodal andsymmetric.)
The outlier condition IS/IS NOT satisfied because the Normal probability plot SHOWS SOME/DOES NOT SHOW ANY extreme outliers.
In: Statistics and Probability
A magazine uses a survey of readers to obtain customer satisfaction ratings for the nation's largest retailers. Each survey respondent is asked to rate a specified retailer in terms of six factors: quality of products, selection, value, checkout efficiency, service, and store layout. An overall satisfaction score summarizes the rating for each respondent with 100 meaning the respondent is completely satisfied in terms of all six factors. Sample data representative of independent samples of Retailer A and Retailer B customers are shown below.
Retailer A | Retailer B |
---|---|
n1 = 25 |
n2 = 30 |
x1 = 80 |
x2 = 72 |
(a)
Formulate the null and alternative hypotheses to test whether there is a difference between the population mean customer satisfaction scores for the two retailers. (Let μ1 = population mean satisfaction score for Retailer A customers and μ2 = population mean satisfaction score for Retailer B customers.)
H0: μ1 − μ2 ≠ 0
Ha: μ1 − μ2 = 0
H0: μ1 − μ2 = 0
Ha: μ1 − μ2 ≠ 0
H0: μ1 − μ2 < 0
Ha: μ1 − μ2 = 0
H0: μ1 − μ2 ≤ 0
Ha: μ1 − μ2 > 0
H0: μ1 − μ2 ≥ 0
Ha: μ1 − μ2 < 0
(b)
Assume that experience with the satisfaction rating scale of the magazine indicates that a population standard deviation of 11 is a reasonable assumption for both retailers. Conduct the hypothesis test.
Find the value of the test statistic. (Round your answer to two decimal places.)
Find the p-value. (Round your answer to four decimal places.)
p-value =
At a 0.05 level of significance what is your conclusion?
Reject H0. There is insufficient evidence to conclude that the population mean satisfaction scores differ for the two retailers.Reject H0. There is sufficient evidence to conclude that the population mean satisfaction scores differ for the two retailers. Do not Reject H0. There is sufficient evidence to conclude that the population mean satisfaction scores differ for the two retailers.Do not reject H0. There is insufficient evidence to conclude that the population mean satisfaction scores differ for the two retailers.
(c)
Provide a 95% confidence interval for the difference between the population mean customer satisfaction scores for the two retailers. (Round your answers to two decimal places.)
to
Which retailer, if either, appears to have the greater customer satisfaction?
The 95% confidence interval ---Select--- is completely below is completely above contains zero. This suggests that the Retailer A has a ---Select--- higher lower population mean customer satisfaction score than Retailer B.
In: Statistics and Probability
An automatic filling machine is used to fill bottles with liquid detergent. A random sample of 20 bottles results in a sample variance of fill volume of s2 = 0.01532 (fluid ounce). If the variance of fill volume is too large, an unacceptable proportion of bottles will be under- or overfilled. We will assume that the fill volume is approximately normally distributed. Determine the 95% upper confidence bound of the variance.
In: Statistics and Probability
In: Statistics and Probability
Important: please show work for each question. Thank you!
In a previous section of PSY230, the second exam was worth 80 points. The scores from that class were normally distributed with a mean (μ ) of 65 and a standard deviation (σ) of 5. If the exam scores were converted to a Z distribution, the distribution would form a perfect bell shape. The following questions require locating individual exam scores on the Z distribution and examine the percentage (or proportion) of cases above or below a score.
Hints: It helps to draw a Z distribution (bell curve) and place John’s and Tom’s Z scores on the distribution for answering the questions. Use the Z table for converting between Z score and area (percentage) of the distribution.
In: Statistics and Probability
The organisation ‘Australian Hearing’, an agency of the federal government, published a report on ‘binge listening’. As part of the research for the report, they carried out a survey, which was described as follows:
'One thousand Australians aged between 18 and 35 years in metropolitan and regional locations in all states across all education levels participated in a 15- minute online survey. The questions related to their exposure to noise during leisure activities, their perception of noise, perceived causes of hearing loss and attitudes towards hearing protection. The research was conducted by Inside Story.'
Which one of the following propositions is reasonable, in the circumstances and given only the information provided in the paragraph?
a) Although there is not much detail in the description here, it is reasonable to assume that a survey conducted by a federal agency was well-conducted and that the results will be reliable.
b) The large sample size of 1000 means that the results will be unbiased.
c) The coverage of all areas and states in Australia and all education levels means that the result will be reliable.
d) The online mechanism used to provide responses was a reasonable way for the survey participants to provide their views.
In: Statistics and Probability
For 2018, Charlie’s would like to discontinue one of the appetizers. Which one would you suggest they discontinue, and why do you make that recommendation? Find two different ways to do an analysis that leads you to two different conclusions. (Using regression, correlation or descriptive stats)
Appetizer | Month | Sales (Orders) |
French Fries | April | 8897 |
French Fries | August | 8424 |
French Fries | October | 8203 |
French Fries | July | 8446 |
French Fries | May | 8059 |
French Fries | June | 8290 |
French Fries | September | 8966 |
Cheese Curds | July | 1842 |
Cheese Curds | June | 1920 |
Cheese Curds | October | 1697 |
Cheese Curds | April | 1609 |
Cheese Curds | September | 1755 |
Cheese Curds | May | 1706 |
Cheese Curds | August | 2071 |
Chicken Strips | July | 749 |
Chicken Strips | September | 796 |
Chicken Strips | October | 619 |
Chicken Strips | August | 802 |
Chicken Strips | May | 634 |
Chicken Strips | June | 766 |
Chicken Strips | April | 869 |
Onion Rings | July | 6541 |
Onion Rings | August | 6365 |
Onion Rings | May | 6699 |
Onion Rings | June | 6626 |
Onion Rings | April | 7070 |
Onion Rings | October | 6123 |
Onion Rings | September | 6360 |
Corn Dogs | July | 5973 |
Corn Dogs | June | 6103 |
Corn Dogs | May | 6051 |
Corn Dogs | August | 6054 |
Corn Dogs | October | 6431 |
Corn Dogs | September | 6087 |
Corn Dogs | April | 5995 |
In: Statistics and Probability
Note: Optical Character Recognition, or OCR, is a technology that enables you to convert different types of documents, such as scanned paper documents, PDF files or images captured by a digital camera into editable and searchable data.
Source: Montgomery, D. & Runger, G. (2014). Applied Statistics and Probability for Engineers. 6th edition, Hoboken, NJ: Wiley.
Using RStudio, construct and interpret a 95% confidence interval for the true proportion of correct digits that can be automatically read by OCR technology in USPS.
Note: Report CI (3 decimal places):
Note: Attach your R codes in the following space:
In: Statistics and Probability
The Director of Acquisition marketing for L.L. Bean is testing a series of new email lists and comparing it to a cross section of her usual lists to serve as a control. To do this she set up a spread sheet and computed the test statistic and pvalue of all of the test lists as compared to the control. The test statistics are representative to a two tailed hypothesis test where Ho: u1= u2 vs Ha: u1 ≠ u2.The results are shown below:
n |
opens |
open rate |
Test Statistic for Open Rate |
Pvalue |
clicks |
click rate |
Test Statistic for Click Rate |
Pvalue |
|
control |
15,000 |
4,733 |
31.6% |
3,219 |
68.0% |
||||
Ionic email list 3 month recent |
15,000 |
5,355 |
35.7% |
-4.41 |
<.0001 |
2,566 |
47.9% |
15.65 |
<.0001 |
Lucid email list new moms |
15,000 |
5,721 |
38.1% |
-7.07 |
<.0001 |
3,901 |
68.2% |
-0.16 |
0.8728 |
Collegiate student list |
15,000 |
7,332 |
48.9% |
-19.41 |
<.0001 |
5,007 |
68.3% |
-0.26 |
0.7948 |
Experian select Women 24-35 |
15,000 |
6,021 |
40.1% |
-9.28 |
<.0001 |
4,022 |
66.8% |
1.09 |
0.2758 |
Apparel shoppers opt in list |
15,000 |
4,899 |
32.7% |
-1.16 |
3,390 |
69.2% |
-1.04 |
||
a. Compute the pvalue for the Apparel shoppers opt in list. Note that the pvalue for the open rate and the click rate are both missing.
b. All comparisons are made relative to the control. Answer what is the best test list for click rate. Is it the best directionally or with statistical significance?
c. All comparisons are made relative to the control. Answer what is the worst test list for open rate. Is it the worst directionally or with statistical significance?
In: Statistics and Probability
A service process has three serial stages. The defect percentage at stage one is 7%. The defect percentage at stage two is 13%. And, the defect percentage at stage three is 15%. Use 5 decimals for probabilities and 2 decimals for sigma levels in the following situations. (a)[2] Situation A: the connection logic of the three stages is that a good overall outcome only happens if all three stages individually have good outcomes. Draw the event tree for this situation. Calculate the probability of defective overall outcomes and the probability of good overall outcomes. Using Excel, calculate the corresponding sigma level and make a statement. (b)[2] Situation B: the connection logic of the three stages is that a good overall outcome happens when at least one stage has good outcome. Draw the event tree for this situation. Calculate the probability of defective overall outcomes and the probability of good overall outcomes. Calculate the corresponding sigma level and make a statement. (c)[3] Situation C: the connection logic of the three stages is that a good overall outcome happens when at least two stages individually have good outcomes. Draw the event tree for this situation. Calculate the probability of defective overall outcomes and the probability of good overall outcomes. Calculate the corresponding sigma level and make a statement. (d)[3] Give some possible real‐life processes for the three situations above.
In: Statistics and Probability
Below are the number of hours spent exercising:
2 |
3 |
4 |
4 |
4 |
5 |
1 |
1 |
4 |
4 |
4 |
1 |
2 |
3 |
3 |
2 |
Hint: You shouldn’t have to recalculate from the raw data to answer questions 7 & 8.
In: Statistics and Probability