A college's data about the incoming freshmen indicates that the mean of their high school GPAs was
3.5,
with a standard deviation of
.20;
the distribution was roughly mound-shaped and only slightly skewed. The students are randomly assigned to freshman writing seminars in groups of
25.
What might the mean GPA of one of these seminar groups be? Describe the appropriate sampling distribution model, including shape, center, and spread, with attention to assumptions and conditions. Make a sketch using the 68-95-99.7 Rule.
In: Math
PLEASE USE R PROGRAMMING TO SOLVE THIS AND GIVE A COPYABLE CODE
Professor David teach Math 1054. In that course, there are 5 exams, 4 midterm exams, plus final exam. You are asked to help him to check whether there is significant difference among the difficulty level in these 5 exams. The data is stored in Math1054Exam.txt.
a). read in data (copy your R code)
b). Find averages for these exams. You can use mean(V) function, or summary(data) function.
c). Are these averages differ significantly? We need to use ANOVA. For this purpose, you need to use stack function to change the data format as two column, one is score, the
other is exam index.
d). Produce a boxplot for the five exams.
e). Conduct ANOVA test.
f). Copy your ANOVA table.
g). Make your conclusion.
------------------------------------------------------------------------------4
Math1054Exam.txt.
T1 T2
T3 T4 Final
90 69 62 65
92
67 59 78 63
43
76 73 66 41
71
80 92 46 92
70
98 79 60 84
99
82 80 69 83
94
69 45 47 70
37
83 90 54 66
50
93 94 90 94
99
66 67 50 52
32
45 71 55 58
83
76 92 65 72
95
73 92 84 70
77
81 68 30 53
34
60 58 43 50
36
92 90 86 89
95
62 54 52 60
88
85 80 65 82
88
69 80 81 67
75
71 88 43 61
89
In: Math
You are a social worker in an OBGYN department of a hospital. The CEO of the hospital has asked you to write a report about the length of time that women spend in the hospital after giving birth. The CEO has told you that in the thinks that insured women spend longer on average in the hospital than uninsured women after childbirth. She wants you to test this claim with data from your hospital. Two samples of 16 women were taken. Test the claim that insured women spend longer in the hospital than uninsured women using an α = .01.
Insured:
Mean = 2.3 days
Standard Deviation = 0.77
Sample Size = 16
Uninsured:
Mean = 1.9 days
Standard Deviation = 0.77
Sample Size = 16
To complete this lab exercise, you should:
Identify whether you will test this claim using a 1-tailed hypothesis or a 2-tailed hypothesis.
State the Null and Research Hypotheses
Find the Critical Value using the T-Table and interpret what you will do with the null hypothesis given that Critical Value
Identify the Correct Degrees of Freedom you’ll use
HAND CALCULATE the t-value. Show your work. Start with the formula and then plug in the correct values from there.
Make a decision regarding the null. Interpret your decision with regard to this question.
Find the p-value range for the hypothesis test using your test statistic and the t-table.
BY HAND, construct a 98% Confidence Interval for the difference between insured and uninsured women. What does this confidence interval tell you?
What are your overall conclusions? Is the confidence interval interpretation consistent with your interpretation from the t-test?
In: Math
The data shown below contains the monthly sales (in thousands of dollars) at a local department store for each of the past 24 months. Copy the data to Excel, perform a runs test, and compute a few autocorrelations to determine whether this time series is random. Give your conclusion by referring to the results of your runs test and autocorrelations (no need to try to attach the Excel output in this text box, just mention the specific values you refer to).
|
Month |
Sales |
Month |
Sales |
|
1 |
987 |
13 |
1080 |
|
2 |
1080 |
14 |
1002 |
|
3 |
975 |
15 |
968 |
|
4 |
1060 |
16 |
984 |
|
5 |
1030 |
17 |
1045 |
|
6 |
895 |
18 |
945 |
|
7 |
908 |
19 |
1025 |
|
8 |
1059 |
20 |
950 |
|
9 |
940 |
21 |
1004 |
|
10 |
1038 |
22 |
1075 |
|
11 |
1050 |
23 |
969 |
|
12 |
1030 |
24 |
1029 |
In: Math
Please be sure to show all work to receive full credit. No credit will be given for an answer that is correct but has no work where it is necessary, or if the work showed does not logically equate to the given answer. No credit will be given for an incorrect answer with no work. Please try to answer all at one time.. please
1. The World Health Organization estimates that 5% of all adults in sub-Saharan Africa are living with HIV/AIDS. A survey takes a random sample of 1600 adults from all over sub-Saharan Africa and finds that 72 have HIV/AIDS.
a) What is the sample proportion and what is the population proportion?
b) What is the mean of the sampling distribution?
c) What is the standard deviation of the sampling distribution?
d) What is the probability that a random sample of 1600 adults in sub-Saharan Africa would have less than 4.5% living with HIV/AIDS?
2. In a random sample of 36 students at a college, the average IQ was x = 107. Assume that the IQ of a student in college follows a Normal distribution with unknown mean and standard deviation σ = 16.
a) Give a 90% confidence interval for μ.
b)Give a 99% confidence interval for µ.
c) Which confidence interval has a larger margin of error? Why does this make sense?
3) A researcher collects a random sample of size n from
a population with standard deviation σ and, from the data
collected, computes a 95% confidence interval for the mean of the
population. Which of the following would produce a new confidence
interval with a smaller width (smaller margin of error) based
on these same data?
A. Increase σ.
B. Use a lower confidence level.
C. Use a smaller sample size.
Explain why you chose your answer.
B) The IQ scores for adults in the entire population have an approximately normal distribution with mean 100 and standard deviation 15. The researcher’s question in “Do college students aged 20 to 25 years old have a higher mean IQ than the rest of the population?” What are the appropriate null and alternative hypothesis a researcher would have to use if they chose to do an experiment to try to answer this question? Explain.
C) A survey on the number of hours spent working out was given to professors from the US. Assume the standard deviation of the number of hours spent by all professors in the US was found to be 1.94 hours. At a 98% confidence level, how many professors would have to be surveyed in order to bring the margin of error down to 0.1?
In: Math
Sociologists say that 85% of married women claim that their husband's mother is the biggest bone of contention in their marriages (sex and money are lower-rated areas of contention). Suppose that ten married women are having coffee together one morning. Find the following probabilities. (For each answer, enter a number. Round your answers to three decimal places.)
(a) All of them dislike their mother-in-law. = 0.197
(b) None of them dislike their mother-in-law. = 0.00
(c) At least eight of them dislike their mother-in-law. =
(d) No more than seven of them dislike their mother-in-law. =
In: Math
n a survey of a group of men, the heights in the 20-29 age group were normally distributed, with a mean of 68.6 inches and a standard deviation of 4.0 inches. A study participant is randomly selected. Complete parts (a) through (d) below. (a) Find the probability that a study participant has a height that is less than 66 inches. The probability that the study participant selected at random is less than 66 inches tall is (b) Find the probability that a study participant has a height that is between 66 and 71 inches. The probability that the study participant selected at random is between 66 and 71 inches tall is (c) Find the probability that a study participant has a height that is more than 71 inches. The probability that the study participant selected at random is more than 71 inches tall
In: Math
1) Use worksheet “baseball stats” to perform a multiple regression analysis on the dataset found in BB2011 tab, using Wins as the dependent variable, and League, ERA, Runs Scored, Hits Allowed, Walks Allowed, Saves, and Errors as candidates for the independent variables. Perform the analysis at the 5% significance level.
| Team | League | Wins | E.R.A. | Runs Scored | Hits Allowed | Walks Allowed | Saves | Errors |
| Baltimore | 0 | 69 | 4.89 | 708 | 1568 | 535 | 32 | 110 |
| Boston | 0 | 90 | 4.20 | 875 | 1366 | 540 | 36 | 92 |
| Chicago White Sox | 0 | 79 | 4.10 | 654 | 1463 | 439 | 42 | 79 |
| Cleveland | 0 | 80 | 4.23 | 704 | 1482 | 463 | 38 | 110 |
| Detroit | 0 | 95 | 4.04 | 787 | 1406 | 492 | 52 | 103 |
| Kansas City | 0 | 71 | 4.44 | 730 | 1487 | 557 | 37 | 95 |
| Los Angeles Angels | 0 | 86 | 3.57 | 667 | 1388 | 476 | 39 | 93 |
| Minnesota | 0 | 63 | 4.58 | 619 | 1564 | 480 | 32 | 119 |
| New York Yankees | 0 | 97 | 3.73 | 867 | 1423 | 507 | 47 | 102 |
| Oakland | 0 | 74 | 3.71 | 645 | 1380 | 519 | 39 | 124 |
| Seattle | 0 | 67 | 3.90 | 556 | 1369 | 436 | 39 | 108 |
| Tampa Bay | 0 | 91 | 3.58 | 707 | 1263 | 504 | 32 | 73 |
| Texas | 0 | 96 | 3.79 | 855 | 1327 | 461 | 38 | 114 |
| Toronto | 0 | 81 | 4.32 | 743 | 1433 | 540 | 33 | 110 |
| Arizona | 1 | 94 | 3.80 | 731 | 1414 | 442 | 58 | 90 |
| Atlanta | 1 | 89 | 3.48 | 641 | 1332 | 521 | 52 | 83 |
| Chicago Cubs | 1 | 71 | 4.33 | 654 | 1439 | 580 | 40 | 134 |
| Cincinnati | 1 | 79 | 4.16 | 735 | 1414 | 539 | 39 | 91 |
| Colorado | 1 | 73 | 4.43 | 735 | 1471 | 522 | 41 | 98 |
| Houston | 1 | 56 | 4.51 | 615 | 1477 | 560 | 25 | 116 |
| Los Angeles Dodgers | 1 | 82 | 3.54 | 644 | 1287 | 507 | 40 | 85 |
| Miami | 1 | 72 | 3.95 | 625 | 1403 | 500 | 40 | 93 |
| Milwaukee | 1 | 96 | 3.63 | 721 | 1348 | 440 | 47 | 111 |
| New York Mets | 1 | 77 | 4.19 | 718 | 1482 | 514 | 43 | 116 |
| Philadelphia | 1 | 102 | 3.02 | 713 | 1320 | 404 | 47 | 74 |
| Pittsburgh | 1 | 72 | 4.04 | 610 | 1513 | 535 | 43 | 112 |
| St. Louis | 1 | 90 | 3.74 | 762 | 1461 | 448 | 47 | 116 |
| San Diego | 1 | 71 | 3.42 | 593 | 1324 | 521 | 44 | 94 |
| San Francisco | 1 | 86 | 3.20 | 570 | 1260 | 559 | 52 | 104 |
| Washington | 1 | 80 | 3.58 | 624 | 1403 | 477 | 49 | 104 |
a) Create a full write up, where you write your statistical analysis step by step. In your write up, make sure you address the following points.
· Methodology and steps that you took to get to your final regression equation.
· Final regression equation output.
· What is the final regression equation?
· Interpret all the coefficients in the equation.
· Speak to whether the signs on the coefficients make sense.
· Interpret R squared.
· Include a full residual analysis.
· What is the residual of the Tampa Bay observation?
b) Now, use tab BB2012 to make predictions of wins in 2012, using the model you created with the 2011 stats.
· How many games are the Giants (SFG) expected to win in 2012?
· Which team is predicted by the model to have the worst record in 2012?
· Which team is predicted by the model to have the best record in 2012?
| Team | WINS | E.R.A. | Runs Scored | Hits Allowed | Walks Allowed | Saves | Errors |
| ARI | 81 | 3.93 | 734 | 1432 | 417 | 39 | 91 |
| ATL | 94 | 3.42 | 700 | 1310 | 464 | 47 | 89 |
| BAL | 93 | 3.9 | 712 | 1433 | 481 | 55 | 109 |
| BOS | 69 | 4.7 | 734 | 1449 | 529 | 35 | 100 |
| CHC | 61 | 4.51 | 613 | 1399 | 573 | 28 | 105 |
| CHW | 85 | 4.02 | 748 | 1365 | 503 | 37 | 70 |
| CIN | 97 | 3.34 | 669 | 1356 | 427 | 56 | 89 |
| CLE | 68 | 4.78 | 667 | 1503 | 543 | 43 | 97 |
| COL | 64 | 5.22 | 758 | 1637 | 566 | 36 | 122 |
| DET | 88 | 3.75 | 726 | 1409 | 438 | 40 | 97 |
| HOU | 55 | 4.56 | 583 | 1493 | 540 | 31 | 118 |
| KCR | 72 | 4.3 | 676 | 1504 | 542 | 44 | 113 |
| LAA | 89 | 4.02 | 767 | 1339 | 483 | 38 | 97 |
| LAD | 86 | 3.34 | 637 | 1277 | 539 | 40 | 97 |
| MIA | 69 | 4.09 | 609 | 1448 | 495 | 38 | 104 |
| MIL | 83 | 4.22 | 776 | 1458 | 525 | 44 | 99 |
| MIN | 66 | 4.77 | 701 | 1536 | 465 | 35 | 107 |
| NYM | 74 | 4.09 | 650 | 1368 | 488 | 36 | 102 |
| NYY | 95 | 3.84 | 804 | 1401 | 431 | 51 | 76 |
| OAK | 94 | 3.48 | 713 | 1360 | 462 | 47 | 110 |
| PHI | 81 | 3.83 | 684 | 1387 | 409 | 42 | 100 |
| PIT | 79 | 3.86 | 651 | 1357 | 490 | 45 | 112 |
| SDP | 76 | 4.01 | 651 | 1356 | 539 | 43 | 123 |
| SEA | 75 | 3.76 | 619 | 1359 | 449 | 43 | 71 |
| SFG | 94 | 3.68 | 718 | 1361 | 489 | 53 | 112 |
| STL | 88 | 3.71 | 765 | 1420 | 436 | 42 | 107 |
| TBR | 90 | 3.19 | 697 | 1233 | 469 | 50 | 113 |
| TEX | 93 | 3.99 | 808 | 1378 | 446 | 43 | 86 |
| TOR | 73 | 4.64 | 716 | 1439 | 574 | 29 | 102 |
| WSN | 98 | 3.33 | 731 | 1296 | 497 | 51 | 96 |
In: Math
Take a guess: if 1000 balls are dropped from the top, how will they be distributed in those slots in the bottom? Are they going to be distributed evenly at the bottom and form a horizontal line, or unevenly and form a roof-top shape, or something else? Jot down your guesses and your rationales. Go back to finish the video. 1. Share your initial guesses and final observations here. 2. If you agree it is a binomial distribution, what is the parameter n, and pi, in this case? (Hint: n is NOT the number of balls dropped.)
In: Math
Below, n is the sample size, p is the population proportion of successes, and X is the number of successes in the sample. Use the normal approximation and the TI-84 Plus calculator to find the probability. Round the answer to at least four decimal places.
n=76, p=0.41
P(28<X<38) = ____?
In: Math
For this problem, carry at least four digits after the decimal
in your calculations. Answers may vary slightly due to
rounding.
A random sample of 5300 permanent dwellings on an entire
reservation showed that 1690 were traditional hogans.
(a) Let p be the proportion of all permanent dwellings
on the entire reservation that are traditional hogans. Find a point
estimate for p. (Round your answer to four decimal
places.)
(b) Find a 99% confidence interval for p. (Round your
answer to three decimal places.)
| lower limit = | |
| upper limit = |
Give a brief interpretation of the confidence interval.
1% of the confidence intervals created using this method would
include the true proportion of traditional hogans.1
% of all confidence intervals would include the true proportion of
traditional hogans.
99% of all confidence intervals would include the true proportion
of traditional hogans.
99% of the confidence intervals created using this method would
include the true proportion of traditional hogans.
(c) Do you think that np > 5 and nq > 5 are
satisfied for this problem? Explain why this would be an important
consideration.
No, the conditions are not satisfied. This is important because
it allows us to say that p̂ is approximately
binomial.
Yes, the conditions are satisfied. This is important because it
allows us to say that p̂ is approximately
normal.
Yes, the conditions are satisfied. This is important because it
allows us to say that p̂ is approximately binomial.
No, the conditions are not satisfied. This is important because it
allows us to say that p̂ is approximately normal.
In: Math
A group of 51 college students from a certain liberal arts college were randomly sampled and asked about the number of alcoholic drinks they have in a typical week. The purpose of this study was to compare the drinking habits of the students at the college to the drinking habits of college students in general. In particular, the dean of students, who initiated this study, would like to check whether the mean number of alcoholic drinks that students at his college in a typical week differs from the mean of U.S. college students in general, which is estimated to be 4.73.
The group of 51 students in the study reported an average of 4.35 drinks per with a standard deviation of 3.88 drinks.
Find the p-value for the hypothesis test.
The p-value should be rounded to 4-decimal places.
In: Math
A company received a shipment from a supplier. The supplier company said the mean height of a tire is 21 inches and a standard deviation is 3 inches. The company is planning to choose 55 tires at random. What is the sample mean between 20 and 22? How many tires are likely to be taller than 25 iches?
In: Math
Gru's schemes have a 9% chance of succeeding. An agent of the Anti-Villain League obtains access to a simple random sample of 1100 of Gru's upcoming schemes. Find the probability that... (Answers should be to four places after the decimal, using chart method, do NOT use the continuity correction):
...less than 101 schemes will succeed:
...more than 95 schemes will succeed:
...between 95 and 101 schemes will succeed:
...less than 8.5% of schemes will succeed:
...more than 9.5% of schemes will succeed:
...between 8.5% and 9.5% of schemes will succeed:
In: Math
How are the population mean and the mean of the sampling distribution of the mean related?
-What is the standard error of the mean? What does it measure?
-What effect does increasing sample size have on the size of the standard error?
-What does the central limit theorem state?
-What are the units of a z-score?
-How are z-scores related to standard deviation and the mean?How are they interpreted?
-Is the α always a certain value, or is it up to the investigator?
-Is the p-value of a calculated statistic a designated number, or is it up to the investigator?
-How can we reduce Type I error?
-In hypothesis testing, how can you reach a decision about the null hypothesis by comparing yourcalculated statistic to the appropriate critical value?
-In hypothesis testing, how can you reach a decision about the null hypothesis by comparing the p-value from your calculated statistic to your alpha level?
-What is the interpretation of a confidence interval?
-What determines the width of a confidence interval? How is it affected by sample size and α?
-What do we use hypothesis testing to determine? What do we use confidence intervals to determine?
-When can we use z-tests and when do we have to, instead, use t-tests?
-How does the t-distribution compare to the z-distribution?
-What does it mean when two samples are “related”?
-Given two related samples, what are “repeated measures”? What are “matched pairs”?
In: Math