The data for a random sample of 10 paired observations are shown in the following table.
Pair | Population1 | Population 2 |
1 | 19 | 24 |
2 | 25 | 27 |
3 | 31 | 36 |
4 | 52 | 53 |
5 | 49 | 55 |
6 | 34 | 34 |
7 | 59 | 66 |
8 | 47 | 51 |
9 | 17 | 20 |
10 | 51 | 55 |
If you wish to test whether these data are sufficient to indicate that the mean for population 2 is larger than that for population 1, what are the appropriate null and alternative hypotheses? Define any symbols you use.
Conduct the test from part a, using α=.05. What is your decision?
Find a 95% confidence interval for μd. Interpret this interval.
What assumptions are necessary to ensure the validity of the preceding analysis?
In: Math
In 2005, 0.76% of all airline flights were on-time. If we choose a simple random sample of 2000 flights, find the probability that... (to four decimal places, using normal chart, no continuity correction)
(a) at least 79% of the sample's flights were on time
(b) at most 1580 of the sample's flights were on time
(c) the sample proportion of on-time flights (p-hat) differs from the truth by more than three percent
In: Math
Please double-check, provide work, and show explanation
Assume I toss a fair coin exactly 17 times. Find the probability of the following outcomes:
A) 4 tails
B) 17 tails
C) 5 heads
D) 17 heads
E) 13 or more tails
F) 5 or fewer tails
In: Math
Eric wants to estimate the percentage of elementary school children who have a social media account. He surveys 450 elementary school children and finds that 280 have a social media account. Identify the values needed to calculate a confidence interval at the 99% confidence level. Then find the confidence interval. z0.10 z0.05 z0.025 z0.01 z0.005 1.282 1.645 1.960 2.326 2.576
In: Math
The director of the IRS has been flooded with complaints that people must wait more than 35 minutes before seeing an IRS representative. To determine the validity of these complaints, the IRS randomly selects 400 people entering IRS offices across the country and records the times which they must wait before seeing an IRS representative. The average waiting time for the sample is 50 minutes with a standard deviation of 23 minutes. Is there overwhelming evidence to support the claim that the wait time to see an IRS representative is more than 35 minutes at a 0.025 significance level?
Step 1 of 3 :
Find the value of the test statistic. Round your answer to three decimal places, if necessary.
In: Math
A college's data about the incoming freshmen indicates that the mean of their high school GPAs was
3.5,
with a standard deviation of
.20;
the distribution was roughly mound-shaped and only slightly skewed. The students are randomly assigned to freshman writing seminars in groups of
25.
What might the mean GPA of one of these seminar groups be? Describe the appropriate sampling distribution model, including shape, center, and spread, with attention to assumptions and conditions. Make a sketch using the 68-95-99.7 Rule.
In: Math
PLEASE USE R PROGRAMMING TO SOLVE THIS AND GIVE A COPYABLE CODE
Professor David teach Math 1054. In that course, there are 5 exams, 4 midterm exams, plus final exam. You are asked to help him to check whether there is significant difference among the difficulty level in these 5 exams. The data is stored in Math1054Exam.txt.
a). read in data (copy your R code)
b). Find averages for these exams. You can use mean(V) function, or summary(data) function.
c). Are these averages differ significantly? We need to use ANOVA. For this purpose, you need to use stack function to change the data format as two column, one is score, the
other is exam index.
d). Produce a boxplot for the five exams.
e). Conduct ANOVA test.
f). Copy your ANOVA table.
g). Make your conclusion.
------------------------------------------------------------------------------4
Math1054Exam.txt.
T1 T2
T3 T4 Final
90 69 62 65
92
67 59 78 63
43
76 73 66 41
71
80 92 46 92
70
98 79 60 84
99
82 80 69 83
94
69 45 47 70
37
83 90 54 66
50
93 94 90 94
99
66 67 50 52
32
45 71 55 58
83
76 92 65 72
95
73 92 84 70
77
81 68 30 53
34
60 58 43 50
36
92 90 86 89
95
62 54 52 60
88
85 80 65 82
88
69 80 81 67
75
71 88 43 61
89
In: Math
You are a social worker in an OBGYN department of a hospital. The CEO of the hospital has asked you to write a report about the length of time that women spend in the hospital after giving birth. The CEO has told you that in the thinks that insured women spend longer on average in the hospital than uninsured women after childbirth. She wants you to test this claim with data from your hospital. Two samples of 16 women were taken. Test the claim that insured women spend longer in the hospital than uninsured women using an α = .01.
Insured:
Mean = 2.3 days
Standard Deviation = 0.77
Sample Size = 16
Uninsured:
Mean = 1.9 days
Standard Deviation = 0.77
Sample Size = 16
To complete this lab exercise, you should:
Identify whether you will test this claim using a 1-tailed hypothesis or a 2-tailed hypothesis.
State the Null and Research Hypotheses
Find the Critical Value using the T-Table and interpret what you will do with the null hypothesis given that Critical Value
Identify the Correct Degrees of Freedom you’ll use
HAND CALCULATE the t-value. Show your work. Start with the formula and then plug in the correct values from there.
Make a decision regarding the null. Interpret your decision with regard to this question.
Find the p-value range for the hypothesis test using your test statistic and the t-table.
BY HAND, construct a 98% Confidence Interval for the difference between insured and uninsured women. What does this confidence interval tell you?
What are your overall conclusions? Is the confidence interval interpretation consistent with your interpretation from the t-test?
In: Math
The data shown below contains the monthly sales (in thousands of dollars) at a local department store for each of the past 24 months. Copy the data to Excel, perform a runs test, and compute a few autocorrelations to determine whether this time series is random. Give your conclusion by referring to the results of your runs test and autocorrelations (no need to try to attach the Excel output in this text box, just mention the specific values you refer to).
Month |
Sales |
Month |
Sales |
1 |
987 |
13 |
1080 |
2 |
1080 |
14 |
1002 |
3 |
975 |
15 |
968 |
4 |
1060 |
16 |
984 |
5 |
1030 |
17 |
1045 |
6 |
895 |
18 |
945 |
7 |
908 |
19 |
1025 |
8 |
1059 |
20 |
950 |
9 |
940 |
21 |
1004 |
10 |
1038 |
22 |
1075 |
11 |
1050 |
23 |
969 |
12 |
1030 |
24 |
1029 |
In: Math
Please be sure to show all work to receive full credit. No credit will be given for an answer that is correct but has no work where it is necessary, or if the work showed does not logically equate to the given answer. No credit will be given for an incorrect answer with no work. Please try to answer all at one time.. please
1. The World Health Organization estimates that 5% of all adults in sub-Saharan Africa are living with HIV/AIDS. A survey takes a random sample of 1600 adults from all over sub-Saharan Africa and finds that 72 have HIV/AIDS.
a) What is the sample proportion and what is the population proportion?
b) What is the mean of the sampling distribution?
c) What is the standard deviation of the sampling distribution?
d) What is the probability that a random sample of 1600 adults in sub-Saharan Africa would have less than 4.5% living with HIV/AIDS?
2. In a random sample of 36 students at a college, the average IQ was x = 107. Assume that the IQ of a student in college follows a Normal distribution with unknown mean and standard deviation σ = 16.
a) Give a 90% confidence interval for μ.
b)Give a 99% confidence interval for µ.
c) Which confidence interval has a larger margin of error? Why does this make sense?
3) A researcher collects a random sample of size n from
a population with standard deviation σ and, from the data
collected, computes a 95% confidence interval for the mean of the
population. Which of the following would produce a new confidence
interval with a smaller width (smaller margin of error) based
on these same data?
A. Increase σ.
B. Use a lower confidence level.
C. Use a smaller sample size.
Explain why you chose your answer.
B) The IQ scores for adults in the entire population have an approximately normal distribution with mean 100 and standard deviation 15. The researcher’s question in “Do college students aged 20 to 25 years old have a higher mean IQ than the rest of the population?” What are the appropriate null and alternative hypothesis a researcher would have to use if they chose to do an experiment to try to answer this question? Explain.
C) A survey on the number of hours spent working out was given to professors from the US. Assume the standard deviation of the number of hours spent by all professors in the US was found to be 1.94 hours. At a 98% confidence level, how many professors would have to be surveyed in order to bring the margin of error down to 0.1?
In: Math
Sociologists say that 85% of married women claim that their husband's mother is the biggest bone of contention in their marriages (sex and money are lower-rated areas of contention). Suppose that ten married women are having coffee together one morning. Find the following probabilities. (For each answer, enter a number. Round your answers to three decimal places.)
(a) All of them dislike their mother-in-law. = 0.197
(b) None of them dislike their mother-in-law. = 0.00
(c) At least eight of them dislike their mother-in-law. =
(d) No more than seven of them dislike their mother-in-law. =
In: Math
n a survey of a group of men, the heights in the 20-29 age group were normally distributed, with a mean of 68.6 inches and a standard deviation of 4.0 inches. A study participant is randomly selected. Complete parts (a) through (d) below. (a) Find the probability that a study participant has a height that is less than 66 inches. The probability that the study participant selected at random is less than 66 inches tall is (b) Find the probability that a study participant has a height that is between 66 and 71 inches. The probability that the study participant selected at random is between 66 and 71 inches tall is (c) Find the probability that a study participant has a height that is more than 71 inches. The probability that the study participant selected at random is more than 71 inches tall
In: Math
1) Use worksheet “baseball stats” to perform a multiple regression analysis on the dataset found in BB2011 tab, using Wins as the dependent variable, and League, ERA, Runs Scored, Hits Allowed, Walks Allowed, Saves, and Errors as candidates for the independent variables. Perform the analysis at the 5% significance level.
Team | League | Wins | E.R.A. | Runs Scored | Hits Allowed | Walks Allowed | Saves | Errors |
Baltimore | 0 | 69 | 4.89 | 708 | 1568 | 535 | 32 | 110 |
Boston | 0 | 90 | 4.20 | 875 | 1366 | 540 | 36 | 92 |
Chicago White Sox | 0 | 79 | 4.10 | 654 | 1463 | 439 | 42 | 79 |
Cleveland | 0 | 80 | 4.23 | 704 | 1482 | 463 | 38 | 110 |
Detroit | 0 | 95 | 4.04 | 787 | 1406 | 492 | 52 | 103 |
Kansas City | 0 | 71 | 4.44 | 730 | 1487 | 557 | 37 | 95 |
Los Angeles Angels | 0 | 86 | 3.57 | 667 | 1388 | 476 | 39 | 93 |
Minnesota | 0 | 63 | 4.58 | 619 | 1564 | 480 | 32 | 119 |
New York Yankees | 0 | 97 | 3.73 | 867 | 1423 | 507 | 47 | 102 |
Oakland | 0 | 74 | 3.71 | 645 | 1380 | 519 | 39 | 124 |
Seattle | 0 | 67 | 3.90 | 556 | 1369 | 436 | 39 | 108 |
Tampa Bay | 0 | 91 | 3.58 | 707 | 1263 | 504 | 32 | 73 |
Texas | 0 | 96 | 3.79 | 855 | 1327 | 461 | 38 | 114 |
Toronto | 0 | 81 | 4.32 | 743 | 1433 | 540 | 33 | 110 |
Arizona | 1 | 94 | 3.80 | 731 | 1414 | 442 | 58 | 90 |
Atlanta | 1 | 89 | 3.48 | 641 | 1332 | 521 | 52 | 83 |
Chicago Cubs | 1 | 71 | 4.33 | 654 | 1439 | 580 | 40 | 134 |
Cincinnati | 1 | 79 | 4.16 | 735 | 1414 | 539 | 39 | 91 |
Colorado | 1 | 73 | 4.43 | 735 | 1471 | 522 | 41 | 98 |
Houston | 1 | 56 | 4.51 | 615 | 1477 | 560 | 25 | 116 |
Los Angeles Dodgers | 1 | 82 | 3.54 | 644 | 1287 | 507 | 40 | 85 |
Miami | 1 | 72 | 3.95 | 625 | 1403 | 500 | 40 | 93 |
Milwaukee | 1 | 96 | 3.63 | 721 | 1348 | 440 | 47 | 111 |
New York Mets | 1 | 77 | 4.19 | 718 | 1482 | 514 | 43 | 116 |
Philadelphia | 1 | 102 | 3.02 | 713 | 1320 | 404 | 47 | 74 |
Pittsburgh | 1 | 72 | 4.04 | 610 | 1513 | 535 | 43 | 112 |
St. Louis | 1 | 90 | 3.74 | 762 | 1461 | 448 | 47 | 116 |
San Diego | 1 | 71 | 3.42 | 593 | 1324 | 521 | 44 | 94 |
San Francisco | 1 | 86 | 3.20 | 570 | 1260 | 559 | 52 | 104 |
Washington | 1 | 80 | 3.58 | 624 | 1403 | 477 | 49 | 104 |
a) Create a full write up, where you write your statistical analysis step by step. In your write up, make sure you address the following points.
· Methodology and steps that you took to get to your final regression equation.
· Final regression equation output.
· What is the final regression equation?
· Interpret all the coefficients in the equation.
· Speak to whether the signs on the coefficients make sense.
· Interpret R squared.
· Include a full residual analysis.
· What is the residual of the Tampa Bay observation?
b) Now, use tab BB2012 to make predictions of wins in 2012, using the model you created with the 2011 stats.
· How many games are the Giants (SFG) expected to win in 2012?
· Which team is predicted by the model to have the worst record in 2012?
· Which team is predicted by the model to have the best record in 2012?
Team | WINS | E.R.A. | Runs Scored | Hits Allowed | Walks Allowed | Saves | Errors |
ARI | 81 | 3.93 | 734 | 1432 | 417 | 39 | 91 |
ATL | 94 | 3.42 | 700 | 1310 | 464 | 47 | 89 |
BAL | 93 | 3.9 | 712 | 1433 | 481 | 55 | 109 |
BOS | 69 | 4.7 | 734 | 1449 | 529 | 35 | 100 |
CHC | 61 | 4.51 | 613 | 1399 | 573 | 28 | 105 |
CHW | 85 | 4.02 | 748 | 1365 | 503 | 37 | 70 |
CIN | 97 | 3.34 | 669 | 1356 | 427 | 56 | 89 |
CLE | 68 | 4.78 | 667 | 1503 | 543 | 43 | 97 |
COL | 64 | 5.22 | 758 | 1637 | 566 | 36 | 122 |
DET | 88 | 3.75 | 726 | 1409 | 438 | 40 | 97 |
HOU | 55 | 4.56 | 583 | 1493 | 540 | 31 | 118 |
KCR | 72 | 4.3 | 676 | 1504 | 542 | 44 | 113 |
LAA | 89 | 4.02 | 767 | 1339 | 483 | 38 | 97 |
LAD | 86 | 3.34 | 637 | 1277 | 539 | 40 | 97 |
MIA | 69 | 4.09 | 609 | 1448 | 495 | 38 | 104 |
MIL | 83 | 4.22 | 776 | 1458 | 525 | 44 | 99 |
MIN | 66 | 4.77 | 701 | 1536 | 465 | 35 | 107 |
NYM | 74 | 4.09 | 650 | 1368 | 488 | 36 | 102 |
NYY | 95 | 3.84 | 804 | 1401 | 431 | 51 | 76 |
OAK | 94 | 3.48 | 713 | 1360 | 462 | 47 | 110 |
PHI | 81 | 3.83 | 684 | 1387 | 409 | 42 | 100 |
PIT | 79 | 3.86 | 651 | 1357 | 490 | 45 | 112 |
SDP | 76 | 4.01 | 651 | 1356 | 539 | 43 | 123 |
SEA | 75 | 3.76 | 619 | 1359 | 449 | 43 | 71 |
SFG | 94 | 3.68 | 718 | 1361 | 489 | 53 | 112 |
STL | 88 | 3.71 | 765 | 1420 | 436 | 42 | 107 |
TBR | 90 | 3.19 | 697 | 1233 | 469 | 50 | 113 |
TEX | 93 | 3.99 | 808 | 1378 | 446 | 43 | 86 |
TOR | 73 | 4.64 | 716 | 1439 | 574 | 29 | 102 |
WSN | 98 | 3.33 | 731 | 1296 | 497 | 51 | 96 |
In: Math
Take a guess: if 1000 balls are dropped from the top, how will they be distributed in those slots in the bottom? Are they going to be distributed evenly at the bottom and form a horizontal line, or unevenly and form a roof-top shape, or something else? Jot down your guesses and your rationales. Go back to finish the video. 1. Share your initial guesses and final observations here. 2. If you agree it is a binomial distribution, what is the parameter n, and pi, in this case? (Hint: n is NOT the number of balls dropped.)
In: Math
Below, n is the sample size, p is the population proportion of successes, and X is the number of successes in the sample. Use the normal approximation and the TI-84 Plus calculator to find the probability. Round the answer to at least four decimal places.
n=76, p=0.41
P(28<X<38) = ____?
In: Math