Paired-samples t test and graduate admissions: Is it harder to get into graduate programs in psychology or in history? We randomly selected five institutions from among all U.S. institutions with graduate programs. The first number for each is the minimum grade point average (GPA) for applicants to the psychology doctoral program, and the second is for applicants to the history doctoral program. These GPAs were posted on the Web site of the well-known college guide company Peterson’s. Wayne State University: 3.0, 2.75 University of Iowa: 3.0, 3.0 University of Nevada, Reno: 3.0, 2.75 George Washington University: 3.0, 3.0 University of Wyoming: 3.0, 3.0
The participants are not people; explain why it is appropriate to use a paired-samples t test for this situation.
Conduct all six steps of a paired-samples t test. Be sure to label all six steps.
Calculate the effect size and explain what this adds to your analysis.
Report the statistics as you would in a journal article.
In: Statistics and Probability
Determine whether the BEST, most common interpretation of the given statement is:
TRUE - Select 1
FALSE - Select 2
Question 1 options:
|
|
Question 2 (20 points)
In: Statistics and Probability
3.
(3 pt) A certain data scientist is testing his predictor (essentially a tool given historical
data and would make predictions using said data) that would make predictions about
Philadelphia’s temperatures. The predictor produces a correct temperature if the
prediction falls within three standard deviations from the target value. Assume that the
expected value of each prediction equals the target value. What is the accuracy of the
predictor if the distribution of measurements is uniform?
In: Statistics and Probability
Two plots at Rothamsted Experimental Station were studied for production of wheat straw. For a random sample of years, the annual wheat straw production (in pounds) from one plot was as follows.
6.61 | 6.89 | 6.75 | 5.91 | 7.31 | 7.18 |
7.06 | 5.79 | 6.24 | 5.91 | 6.14 |
Use a calculator to verify that, for this plot, the sample
variance is s2 ≈ 0.305.
Another random sample of years for a second plot gave the following
annual wheat production (in pounds).
7.38 | 7.31 | 7.10 | 7.03 | 7.22 | 5.58 | 5.47 | 5.86 |
Use a calculator to verify that the sample variance for this
plot is s2 ≈ 0.685.
Test the claim that there is a difference (either way) in the
population variance of wheat straw production for these two plots.
Use a 5% level of signifcance.
(a) What is the level of significance?
State the null and alternate hypotheses.
Ho: σ12 =
σ22; H1:
σ12 >
σ22
Ho: σ12 >
σ22; H1:
σ12 =
σ22
Ho: σ22 =
σ12; H1:
σ22 >
σ12
Ho: σ12 =
σ22; H1:
σ12 ≠
σ22
(b) Find the value of the sample F statistic. (Use 2
decimal places.)
What are the degrees of freedom?
dfN | |
dfD |
What assumptions are you making about the original distribution?
The populations follow independent normal distributions.
The populations follow independent normal distributions. We have
random samples from each population.
The populations follow independent chi-square distributions. We
have random samples from each population.
The populations follow dependent normal distributions. We have
random samples from each population.
(c) Find or estimate the P-value of the sample test
statistic. (Use 4 decimal places.)
p-value > 0.200
0.100 < p-value <
0.200
0.050 < p-value < 0.100
0.020 < p-value < 0.050
0.002 < p-value < 0.020
p-value < 0.002
(d) Based on your answers in parts (a) to (c), will you reject or
fail to reject the null hypothesis?
At the α = 0.05 level, we reject the null hypothesis
and conclude the data are not statistically significant.
At the α = 0.05 level, we reject the null hypothesis and
conclude the data are statistically significant.
At the α = 0.05 level, we fail to reject the null
hypothesis and conclude the data are not statistically
significant.
At the α = 0.05 level, we fail to reject the null
hypothesis and conclude the data are statistically significant.
(e) Interpret your conclusion in the context of the
application.
Fail to reject the null hypothesis, there is sufficient evidence
that the variance in annual wheat production differs between the
two plots.
Reject the null hypothesis, there is insufficient evidence that the
variance in annual wheat production differs between the two
plots.
Reject the null hypothesis, there is sufficient evidence that the
variance in annual wheat production differs between the two
plots.
Fail to reject the null hypothesis, there is insufficient evidence
that the variance in annual wheat production differs between the
two plots.
In: Statistics and Probability
Suppose that the weight of an newborn fawn is Uniformly distributed between 2.5 and 3.2 kg. Suppose that a newborn fawn is randomly selected. Round answers to 4 decimal places when possible.
a. The mean of this distribution is
b. The standard deviation is
c. The probability that fawn will weigh exactly 3.1 kg is P(x = 3.1) =
d. The probability that a newborn fawn will be weigh between 2.7 and 3 is P(2.7 < x < 3) =
e. The probability that a newborn fawn will be weigh more than 2.74 is P(x > 2.74) = f. P(x > 2.6 | x < 3) =
g. Find the 50th percentile.
In: Statistics and Probability
A) Use the results of Groth (1992), where researchers conducted a large observational study where n=3647 crossbills were observed in the wild, 1752 of which were right-billed and 1895 of which were left-billed to conduct a z-test. Report your test statistic, p-value, and estimated effect size, each rounded to four decimal places:
z-score:
p-value:
est. effect size:
B) Create a 95% confidence interval for the true proportion of European cross-bills that are right-billed. Submit your lower and upper bounds rounded out to four decimal places.
CI( , )
In: Statistics and Probability
64, 66, 64, 66, 59, 62, 67, 61, 64, 58, 54, 66
I think this claim is (Valid / Invalid). Reasons:
In: Statistics and Probability
For each description below, identify each underlined number as a parameter or statistic. Use appropriate notation to describe each number, e.g., p^ = 0.96.
A) The National Center for Health Statistics reports that the mean systolic blood pressure for males 35 to 44 years of age is 128 and the standard deviation is 15. The medical director of a large company looks at the medical records of 72 executives in this age group and finds that the mean systolic blood pressure for these executives is 126.07.
B) A 1993 survey conducted by the Richmond Times-Dispatch one week before Election Day asked voters which candidate for the state’s attorney general they would vote for. 37% of the respondents said they would vote for the Democratic candidate. On Election Day, 41% actually voted for the Democratic candidate.
In: Statistics and Probability
Mr. Mario Mendoza is a famous baseball player (Shortstop) in the Major League Baseball. Usually, he can successfully hit 2 out of 10 at bats. Therefore, the Mendoza line is named after him.
In: Statistics and Probability
(15.12) The gypsy moth is a serious threat to oak and aspen trees. A state agriculture department places traps throughout the state to detect the moths. When traps are checked periodically, the mean number of moths trapped is only 0.6 , but some traps have several moths. The distribution of moth counts is discrete and strongly skewed, with standard deviation 0.8 .
|
|
(15.28) Almost all medical schools in the United States require students to take the Medical College Admission Test (MCAT). To estimate the mean score μμ of those who took the MCAT on your campus, you will obtain the scores of an SRS of students. The scores follow a Normal distribution, and from published information you know that the standard deviation is 6.5. Suppose that (unknown to you) the mean score of those taking the MCAT on your campus is 26. In answering the following, use z-scores rounded to two decimal places. If you choose one student at random, what is the probability (±±0.0001) that the student's score is between 20 and 30? ___ You sample 21 students. What is the standard deviation (±±0.01) of sampling distribution of their average score x¯¯¯x¯? ____ What is the probability (±±0.0001) that the mean score of your sample is between 20 and 30? ____ |
In: Statistics and Probability
Let that across the two semesters of this statistics course, you have been shown both Bayesian and Classical approaches to Inference. A fellow student in a biological sciences course is working with statistics, but does not understand the two approaches. Provide them with a short explanation of the differences and similarities between the approaches, to help them with their use of statistics. {approx. 500 words}.
In: Statistics and Probability
We play a game with a deck of 52 regular playing cards, of which 26 arered and 26 are black. They’re randomly shuffled and placed face down ona table. You have the option of “taking” or “skipping” the top card. Ifyou skip the top card, then that card is revealed and we continue playingwith the remaining deck. If you take the top card, then the game ends;you win if the card you took was revealed to be black, and you lose if itwas red. If we get to a point where there is only one card left in the deck,you must take it. Prove that you have no better strategy than to take thetop card – which means your probability of winning is 1/2.
Hint: Prove by induction the more general claim that for a randomlyshuffled deck ofncards that are red or black – not necessarily with thesame number of red cards and black cards – there is no better strategy than taking the top card
In: Statistics and Probability
Geo stats question:
2)Drivers wishing to turn left at a particular intersection arrive at an average rate of five per minute. (i) if the left-turn arrow is red for 30 seconds, and there is room in the left-turn lane for five cars, what is the probability that the capacity of the lane will be exceeded for a given cycle of the signal? (ii) Given that transportation planners and traffic engineers wish to reduce the probability to less then 0.05 by shortening the length of the red signal, how would you determine the maximal time the signal could remain red?
In: Statistics and Probability
popgro~h lexp gnppc safewa~r
-------------+------------------------------------
popgrowth | 1.0000
lexp | -0.4750 1.0000
gnppc | -0.4902 0.6948 1.0000
safewater | -0.3935 0.8177 0.7063 1.000
Given this table,
In: Statistics and Probability
The FBI wants to determine the effectiveness of their 10 Most Wanted list. To do so, they need to find out the fraction of people who appear on the list that are actually caught.
Suppose a sample of 879 suspected criminals is drawn. Of these people, 677 were not captured. Using the data, construct the 95% confidence interval for the population proportion of people who are captured after appearing on the 10 Most Wanted list. Round your answers to three decimal places. Answer
In: Statistics and Probability