A university wants to study the experience of students enrolled in its big classes, defined asclasses with enrollments of 500 or more. There are 20 such classes. From each of these classes,one enrolled student is chosen uniformly at random to take part in the university’s survey. Youcan assume that the selection from each class is performed independently of the selections inthe other classes. In this scenario: (T / F)
1. The method of sampling produces a probability sample of students enrolled in the big classes. (T / F)
2. The method of sampling produces a simple random sample of students enrolled in the big classes. (T / F)
3. Because a student is chosen from each class, all students in the big classes have the same chance of being selected. (T / F)
4. Because a student is chosen from each of 20 big classes, there will be 20 students in the sample. (T / F)
In: Statistics and Probability
Consider the following hypothesis test:
H 0: 50
H a: > 50
A sample of 65 is used and the population standard deviation is 7. Use the critical value approach to state your conclusion for each of the following sample results. Use = .05.
a. With = 52.5, what is the value of the test
statistic (to 2 decimals)?
Can it be concluded that the population mean is greater than
50?
SelectYesNoItem 2
b. With = 51, what is the value of the test statistic
(to 2 decimals)?
Can it be concluded that the population mean is greater than
50?
SelectYesNoItem 4
c. With = 51.8, what is the value of the test
statistic (to 2 decimals)?
Can it be concluded that the population mean is greater than
50?
SelectYesNoItem 6
In: Statistics and Probability
You want to estimate the difference between the average grades on a certain math exam before the students take the associated math class and after they take the associated math class. You take the random samples of 5 students who have taken the class and 5 students who have not taken the class. You get the following results:
Not Taken | Taken |
54 | 82 |
25 | 76 |
73 | 98 |
23 | 43 |
42 | 38 |
A) Determine the population(s) and parameter(s) being discussed.
B) Determine which tool will help us find what we need (one sample z test, one sample t test, two sample t test, one sample z interval, one sample t interval, two sample t interval).
C) Check if the conditions for this tool hold.
D) Whether or not the conditions hold, use the tool you choose in part B. Use C=95% for all confidence intervals and alpha=5% for all significance tests.
* Be sure that all methods end with a sentence describing the results *
In: Statistics and Probability
You want to know if, on average, households have more cats or more dogs. You take an SRS of 8 households and find the data below:
# of cats | # of dogs |
2 | 1 |
0 | 1 |
3 | 3 |
2 | 4 |
0 | 0 |
4 | 2 |
0 | 2 |
2 | 1 |
A) Determine the population(s) and parameter(s) being discussed.
B) Determine which tool will help us find what we need (one sample z test, one sample t test, two sample t test, one sample z interval, one sample t interval, two sample t interval).
C) Check if the conditions for this tool hold.
D) Whether or not the conditions hold, use the tool you choose in part B. Use C=95% for all confidence intervals and alpha=5% for all significance tests.
* Be sure that all methods end with a sentence describing the results *
In: Statistics and Probability
Independent samples t-test: An experimenter is interested in how the “foot-in-the-door” tactic could increase compliance in college students. The “foot-in-the-door” tactic involves asking first for a small request to butter someone up for a larger (originally intended) request. Typically, the “foot-in-the-door” tactic increases compliance because people do not like to appear inconsistent and since they originally agreed to the small request, they feel compelled to also agree with the large request.
Our experimenter was seeking to increase the number of students who volunteer for research in the department of psychological sciences. One group of students were subject to the “foot-in- the-door” tactic and these students were first asked to wear a sticker promoting research participation (small request) around campus. Later, these same students were asked to participate in a research project. The second group of students were simply asked to participate in research with no initial, small request.
This study was conducted across multiple semesters, and the value below represents how many students agreed to participate in research each semester.
“foot-in-the-door” (FITD): 11 8 10 6 5
Large request only (Large): 6 6 9 10 8
Now we’ll use this information for hypothesis testing....
What are your degrees of freedom for this independent samples t-test? 2pts.
What is your critical t-value (t-crit) for this hypothesis test? 2pts.
Next we calculate the t-value for our sample to compare to this critical value (see table below for reminders)......
What is the pooled variance for our sample? 4 pts. Pooled Variance: ??1+ ??2 =
?? 1+ ?? 2
What is the standard error of the mean difference (??1−?2)for our sample? 4 pts.
What is the value for t-observed? 4 pts.
Is there a statistically significant difference between these groups? 3pts.
In: Statistics and Probability
Which variable (lending interest rate or investment in physical capital) do you think should be the dependent variable in this regression model? Please justify your answer. [2 points]
In: Statistics and Probability
5 Two different fish attractors were compared during 16 time periods spanning 4 years. (Wilbur, R. L. (1978). Two types of fish attractors compared in Lake Tohopekaliga, Florida. Transactions of the American Fisheries Society, 107(5), 689-695.) I present the data as a set of ordered pairs where in each case the first entry is the “pipe attractor” and the second entry is the “brush attractor”
{(6.64,9.73), (7.89,8.21),(1.83,2.17),(.42,.75),(.85,1.61),(.29,.75),(.57,.83),(.63,.56),(.32,.76),(.37,.32),(.00,.48),(.11.,.52),(4.86,5.38),(1.80,2.33),(.23,.91,),(.58,.79)}
(a)(4 pts) Perform the appropriate paired parameteric t-based test to compare the means.
(b)(6 pts) What happens if the two independent samples t-test is used? Make sure to perform all appropriate tests.
Is it possible to answer this questions with the code in Rstudio codes? If not the regular answer will be appreciated.
In: Statistics and Probability
Case Study 1 - Data Visualization and Descriptive Statistics The data file Home_Values.xlsx contains median home values (Home Value), median household income (HH Inc), median per capita (Per Cap Inc) and percent of homes that are owner occupied (Pct Owner Occ) for each state and the District of Columbia. Prior to a more detailed analysis of the data, a company wants to get a good understanding of the 4 variables (e.g. central tendency, variability, shape of the distribution, pattern of relationship between the variables). A company representative contracts with you to help with this process. To help the company get a better understanding of the data, you are asked to perform the following analysis steps:
State | Home Value | HH Inc | Per Cap Inc | Pct Owner Occ |
New York | 303900 | 55603 | 30948 | 55.2 |
North Carolina | 149100 | 45570 | 24745 | 68.1 |
North Dakota | 111300 | 46781 | 25803 | 66.6 |
Ohio | 136400 | 47358 | 25113 | 69.2 |
Oklahoma | 104300 | 42979 | 23094 | 68.2 |
Oregon | 252600 | 49260 | 26171 | 63.8 |
Pennsylvania | 159300 | 50398 | 27049 | 71 |
Rhode Island | 279300 | 54902 | 28707 | 62.5 |
South Carolina | 134100 | 43939 | 23443 | 69.9 |
South Dakota | 122200 | 46369 | 24110 | 68.9 |
Tennessee | 134100 | 43314 | 23722 | 69.6 |
Texas | 123500 | 49646 | 24870 | 64.8 |
Utah | 218100 | 56330 | 23139 | 71.2 |
Vermont | 208400 | 51841 | 27478 | 71.4 |
Virginia | 255100 | 61406 | 32145 | 68.9 |
Washington | 285400 | 57244 | 29733 | 64.8 |
West Virginia | 94500 | 38380 | 21232 | 74.6 |
Wisconsin | 169000 | 51598 | 26624 | 69.5 |
Wyoming | 174000 | 53802 | 27860 | 70.2 |
In: Statistics and Probability
From public records, individuals were identified as having been charged with drunken driving not less than 6 months or more than 12 months from the starting date of the study. Two random samples from this group were studied. In the first sample of 35 individuals, the respondents were asked in a face-to-face interview if they had been charged with drunken driving in the last 12 months. Of these 35 people interviewed face to face, 15 answered the question accurately. The second random sample consisted of 50 people who had been charged with drunken driving. During a telephone interview, 28 of these responded accurately to the question asking if they had been charged with drunken driving during the past 12 months. Assume the samples are representative of all people recently charged with drunken driving.
(a) Categorize the problem below according to parameter being estimated, proportion p, mean μ, difference of means μ1 – μ2, or difference of proportions p1 – p2. Then solve the problem.
μp p1 – p2μ1 – μ2
(b) Let p1 represent the population proportion
of all people with recent charges of drunken driving who respond
accurately to a face-to-face interview asking if they have been
charged with drunken driving during the past 12 months. Let
p2 represent the population proportion of all
people who respond accurately to the question when it is asked in a
telephone interview. Find a 95% confidence interval for
p1 – p2. (Use 3 decimal
places.)
lower limit | |
upper limit |
(c) Does the interval found in part (a) contain numbers that are all positive? all negative? mixed? Comment on the meaning of the confidence interval in the context of this problem. At the 95% level, do you detect any differences in the proportion of accurate responses to the question from face-to- face interviews as compared with the proportion of accurate responses from telephone interviews?
Because the interval contains only positive numbers, we can say that there is a higher proportion of accurate responses in face-to-face interviews.Because the interval contains both positive and negative numbers, we can not say that there is a higher proportion of accurate responses in face-to-face interviews. We can not make any conclusions using this confidence interval.Because the interval contains only negative numbers, we can say that there is a higher proportion of accurate responses in telephone interviews.
In: Statistics and Probability
How profitable are different sectors of the stock market? One way to answer such a question is to examine profit as a percentage of stockholder equity. A random sample of 25 retail stocks such as Toys 'R' Us, Best Buy, and Gap was studied for x1, profit as a percentage of stockholder equity. The result was x1 = 14.8. A random sample of 30 utility (gas and electric) stocks such as Boston Edison, Wisconsin Energy, and Texas Utilities was studied for x2, profit as a percentage of stockholder equity. The result was x2 = 10.0. Assume that σ1 = 4.9 and σ2 = 2.3.
Categorize the problem below according to parameter being estimated, proportion p, mean μ, difference of means μ1 – μ2, or difference of proportions p1 – p2. Then solve the problem.
pμ1 – μ2 p1 – p2μ
(b) Let μ1 represent the population mean profit
as a percentage of stockholder equity for retail stocks, and let
μ2 represent the population mean profit as a
percentage of stockholder equity for utility stocks. Find a 95%
confidence interval for μ1 –
μ2. (Use 1 decimal place.)
lower limit | |
upper limit |
(c) Examine the confidence interval and explain what it means in the context of this problem. Does the interval consist of numbers that are all positive? all negative? of different signs? At the 95% level of confidence, does it appear that the profit as a percentage of stockholder equity for retail stocks is higher than that for utility stocks?
In: Statistics and Probability
In: Statistics and Probability
A researcher wanted to test the theory that people who live in urban environments are more liberal compared to people who live in suburban environments. To test this, they gave a group of people from the suburbs and a group of people from the city a measure of liberalness (the higher the score, the more liberal-minded you are). The data are given below.
Suburban: ?=15, n=60, ?$ = 3.81 $# Urban: ?=20, n=60, ?$ = 3.39
Run the appropriate hypothesis test (all 4 steps) to test the hypothesis people in urban and suburban environments have significantly different levels of liberalness. Use an alpha of α=0.01, two-tailed test. (Use the back of your sheet to complete this problem). Are you surprised at all with your results? Why/Why not?
In: Statistics and Probability
What is the probability that you would get heads-up when flipping a coin ONE time?
What is the probability that you would get heads-up when flipping a coin TEN times?
What is the probability that you would get heads-up on two coins flipped at the same time?
What is the probability that you would roll a 5 on a single dice if you rolled it five times?
What is the probability that you would roll a 3 on a single dice if you rolled it five times?
In: Statistics and Probability
One population proportion Two population proportions
One population mean Two populations means (Independent samples)
One population standard deviation Two populations means (paired samples)
Two population standard deviations
Proportion Stats – One sample Proportion Stats-Two samples
Variance Stats – One sample Variance Stats – Two samples
T Stats - One sample T Stats-Two samples (Independent)
T Stats - Paired
In: Statistics and Probability
You wish to test the following claim (HaHa) at a significance
level of α=0.05α=0.05.
Ho:μ=87.9Ho:μ=87.9
Ha:μ≠87.9Ha:μ≠87.9
You believe the population is normally distributed, but you do not
know the standard deviation. You obtain the following sample of
data:
Column A | Column B | Column C | Column D | Column E |
---|---|---|---|---|
94.5 | 85.8 | 82.7 | 93.3 | 86.4 |
81.6 | 96 | 92.2 | 82.5 | 92.5 |
83.8 | 93.3 | 79.2 | 85.4 | 100 |
93.9 | 79.2 | 82.9 | 84 | 87.9 |
86.7 | 93.6 | 87.3 | 89.9 | 91 |
70.7 | 91.3 | 86 | 102.4 | 81.1 |
86.9 | 91.7 | 86.2 | 91.3 | 84 |
99.2 | 76.2 | 96 | 72.1 | 91 |
92.2 | 88.5 | 93 | 81.1 | 85.4 |
81.6 | 90.4 | 83.2 | 94.9 | 100 |
76.7 | 80.6 | 81.1 | 93.6 | 82.9 |
What is the test statistic for this sample? (Report answer accurate
to three decimal places.)
test statistic =
What is the p-value for this sample? (Report answer accurate to
four decimal places.)
p-value =
The p-value is...
This test statistic leads to a decision to...
As such, the final conclusion is that...
In: Statistics and Probability