In: Math
indicate the term not synonymous with the others
In: Math
How is the rejection region defined, and how is that related to the p value? When do you reject or fail to reject the null hypothesis? Why do you think statisticians are asked to complete hypothesis testing? Can you think of examples in courts, in medicine, or in your area?
In: Math
Type “ineffective charts,” “ineffective graphs,” “unethical charts,” or some version thereof into Google Images. Find two or three differently-designed graphs and charts and discuss why the data visualizations are not effective.
In: Math
Suppose your research assistant screwed up and lost the information that linked the person’s identity across the two weight loss periods. This makes it impossible to run a paired t-test. Rather than start over:
a) Compute the mean and standard deviation of the two samples (2-pts)
b) Compute the two sample t-statistic (2pts)
c) How many degrees of freedom do you have(3pts)?
d) compute the P-value (4pts)
e) How does this P-value compare to the one you just computed using the paired ttest? (3pts))
Two Sample t-test (16pts):
Suppose you are interested in deciding if the 1990 Toyota Four Runner has been equally reliable as the 1990 Honda Passport. You go out a randomly sample of 5 people who own a 1990 Toyota and 5 other people who own a 1990 Honda and you ask them how often they have to take their vehicles in for maintenance. Here are your data (in thousands of miles):
Toyota: 30 35 32 34 30
Honda: 29 33 28 31 27
a) State the null and alternative hypotheses (2pts)
b) Compute the means and standard deviations of the two samples (2-pts)
c) Compute the two sample t-statistic (2 pts)
c) How many degrees of freedom do you have? (3pts)
d) Compute the P-value (4pts)
e) At an alpha = 0.05 would you accept or reject the null hypothesis? (3pts)
Please show work! thank you!
In: Math
8. A fair coin is tossed 60 times. Find the probability that the head appears between 22 and 40 times by using
a. binomial distribution,
b. approximation of Binomial distribution by normal distribution. Discuss why b. is better in practice.
In: Math
ID Year
CornYield SoyBeanYield
1 1957
48.3 23.2
2 1958
52.8 24.2
3 1959
53.1 23.5
4 1960
54.7 23.5
5 1961
62.4 25.1
6 1962
64.7 24.2
7 1963
67.9 24.4
8 1964
62.9 22.8
9 1965
74.1 24.5
10 1966
73.1 25.4
11 1967
80.1 24.5
12 1968
79.5 26.7
13 1969
85.9 27.4
14 1970
72.4 26.7
15 1971
88.1 27.5
16 1972
97 27.8
17 1973
91.3 27.8
18 1974
71.9 23.7
19 1975
86.4 28.9
20 1976
88 26.1
21 1977
90.8 30.6
22 1978
101 29.4
23 1979
109.5 32.1
24 1980
91 26.5
25 1981
108.9 30.1
26 1982
113.2 31.5
27 1983
81.1 26.2
28 1984
106.7 28.1
29 1985
118 34.1
30 1986
119.4 33.3
31 1987
119.8 33.9
32 1988
84.6 27.0
33 1989
116.3 32.3
34 1990
118.5 34.1
35 1991
108.6 34.2
36 1992
131.5 37.6
37 1993
100.7 32.6
38 1994
138.6 41.4
39 1995
113.5 35.3
40 1996
127.1 37.6
41 1997
126.7 38.9
42 1998
134.4 38.9
43 1999
133.8 36.6
44 2000
136.9 38.1
45 2001
138.2 39.6
46 2002
129.3 38.0
47 2003
142.2 33.9
48 2004
160.3 42.2
49 2005
147.9 43.1
50 2006
149.1 42.9
51 2007
150.7 41.7
Use both predictors. From the previous two exercises, we conclude that year and soybean may be useful together in a model for predicting corn yield. Run this multiple regression.
a) Explain the results of the ANOVA F test. Give the null and alternate hypothesis, test statistic with degrees of freedom, and p-value. What do you conclude?
b) What percent of the variation in corn yield in explained by these two variables? Compare it with the percent explained in the previous simple linear regression models.
c) State the regression model. Why do the coefficients for year and soybean differ from those in the previous exercises?
d) Summarize the significance test results for the regression coefficients for year and soybean yield.
e) Give a 95% confidence interval for each of these coefficients.
f) Plot the residual versus year and soybean yield. What do you conclude?
In: Math
Complete the following by writing a response to
three of the four following questions. For each
question, your response should be 2 or more paragraphs.
Describe how you could use confidence intervalsto help make a
decision in your current job, a past job, or life situation.
Include a description of the decision, how the interval would
impact the decision, and how data could ideally be collected to
determine the interval.
In: Math
Analyses of drinking water samples for 100 homes in each of two different sections of a city gave the following means and standard deviations of lead levels (in parts per million).
| section 1 | section 2 | |
| sample size | 100 | 100 |
| mean | 34.5 | 36.2 |
| standard deviation | 5.8 | 6.0 |
(a) Calculate the test statistic and its p-value to test for a difference in the two population means. (Use Section 1 − Section 2. Round your test statistic to two decimal places and your p-value to four decimal places.)
z =
p-value =
Use the p-value to evaluate the statistical significance of the results at the 5% level.
a. H0 is not rejected. There is sufficient evidence to indicate a difference in the mean lead levels for the two sections of the city.
b. H0 is rejected. There is sufficient evidence to indicate a difference in the mean lead levels for the two sections of the city.
c. H0 is rejected. There is insufficient evidence to indicate a difference in the mean lead levels for the two sections of the city.
d. H0 is not rejected. There is insufficient evidence to indicate a difference in the mean lead levels for the two sections of the city.
(b) Calculate a 95% confidence interval to estimate the difference in the mean lead levels in parts per million for the two sections of the city. (Use Section 1 − Section 2. Round your answers to two decimal places.)
parts per million______ to________parts per million
(c) Suppose that the city environmental engineers will be concerned only if they detect a difference of more than 5 parts per million in the two sections of the city. Based on your confidence interval in part (b), is the statistical significance in part (a) of practical significance to the city engineers? Explain.
a. Since all of the probable values of μ1 − μ2 given by the interval are all less than −5, it is likely that the difference will be more than 5 ppm, and hence the statistical significance of the difference is of practical importance to the the engineers.
b. Since all of the probable values of μ1 − μ2 given by the interval are all greater than 5, it is likely that the difference will be more than 5 ppm, and hence the statistical significance of the difference is of practical importance to the the engineers.
c. Since all of the probable values of μ1 − μ2 given by the interval are between −5 and 5, it is not likely that the difference will be more than 5 ppm, and hence the statistical significance of the difference is not of practical importance to the the engineers.
In: Math
A random variable is normally distributed. It has a mean of 245 and a standard deviation of 21.
e.) For a sample of size 35, state the mean of the sample mean and the standard deviation of the sample mean.
f.) For a sample of size 35, find the probability that the sample mean is more than 241.
g.) Compare your answers in part c and f. Why is one smaller than the other?
In: Math
Consider the results from a completely randomized design showing commuting times in three states. Use an appropriate Excel ANOVA tool, to test for any significant differences in commuting times between the three states. Use α = 0.05.
| Illinois | Ohio | Texas |
| 26.8 | 27.5 | 10.1 |
| 17.6 | 28.9 | 18.8 |
| 27 | 19.1 | 31.4 |
| 20 | 36.9 | 44.2 |
| 50.7 | 40.8 | 24.6 |
| 24.4 | 9.5 | 29.5 |
| 36.8 | 37.4 | 38.1 |
| 42.2 | 38.9 | 30.3 |
| 26.3 | 46.2 | 11.7 |
| 14 | 35.8 | 35.8 |
| 28.5 | 20.7 | 22.4 |
| 36.9 | 37.8 | 17 |
| 25.6 | 49.7 | 15.4 |
| 25.9 | 44.3 | 15.4 |
| 29.5 | 12.1 | 6.8 |
| 29.7 | 43.7 | 14.8 |
| 30.5 | 35.9 | 59.3 |
| 20 | 30.2 | 5.3 |
| 23.2 | 8.5 | 0.6 |
| 20.7 | 34.6 | 20.7 |
| 6.2 | 37.9 | 18.6 |
| 44.2 | 50.9 | 24.9 |
| 28.2 | 24.2 | 9.3 |
| 28.8 | 39.1 | 11.9 |
| 16.6 | 20.4 | 19.6 |
| 20.2 | 12.4 | 31 |
| 13.1 | 28 | 25.9 |
| 16.9 | 28.4 | 52.6 |
| 32.4 | 19.4 | 38.3 |
| 19.6 | 42.5 | 34 |
| 12.8 | 27.2 | 24.9 |
| 30.2 | 22.6 | 32.1 |
| 65.1 | 50.8 | 43 |
| 25.5 | 34.1 | 31.1 |
| 17.5 | 27.1 | 16.8 |
| 11.1 | 38.9 | 34.1 |
| 48.8 | 28.7 | 40.4 |
| 38.9 | 54.2 | 29.4 |
| 23.1 | 30.6 | 9.8 |
| 21.6 | 15.9 | 19.5 |
| 22.3 | 15.1 | 9.6 |
| 27.3 | 30.1 | 21.6 |
| 30.7 | 32.2 | 26.5 |
In: Math
We are considering a launch of a new type of raisin into the packaged raisin market. To do so, we collected product ratings on a 1-10 Likert-scale from consumers utilizing the following attributes and corresponding levels,
| Attribute | Level1 | level 2 | level 3 | level 4 |
| Rasin Chewiness | low | medium | high | n/a |
| Rasin Color | white | grey | brown | black |
| Packaging Size | small | large | n/a | n/a |
| Free Gift | no | yes | n/a | n/a |
| Raisin Aroma | none | medium | heavy | n/a |
| Price Compared to Market Leader | lower | same | higher | n/a |
Please base your answer to the following questions on this data. Note that each attribute is coded numerically. For instance, for Chewiness (Low =1, Medium =2, High =3) and similarly for the other attributes reading left to right in the table above.
For each of the attributes, we code the levels into multiple dummy variables to include in our regression. The variables we used are as follows:
|
|
Note that for each category, the number of variables is equal to the number of levels – 1.
For example, for chewiness, we only need 2 dummy variables to show 3 levels:
Regression Results from creating dummy variables
| Coefficents | beta | std. error | t-value | p-value |
| intercept | 5.2991 | 0.3240 | 16.353 | 0.0000 |
| chew 1 | -0.8659 | 0.2437 | -3.5535 | 0.0006 |
| chew 2 | -0.3461 | 0.2438 | -1.4195 | 0.1593 |
| color 1 | 0.1211 | 0.2871 | 0.4218 | 0.6742 |
| color 2 | 0.2145 | 0.2802 | 0.7657 | 0.4459 |
| color 3 | 0.4799 | 0.2696 | 1.7801 | 0.0785 |
| Size large | 0.8992 | 0.2000 | 4.4969 | 0.0000 |
| gift dummy | 0.0916 | 0.2099 | 0.4365 | 0.6635 |
| aroma 1 | 0.5468 | 0.2563 | 2.1334 | 0.0357 |
| aroma 2 | 0.9715 | 0.2327 | 4.1742 | 0.0001 |
| Price 1 | 0.6548 | 0.2157 | 3.0362 | 0.0032 |
| Price 2 | 0.3237 | 0.2895 | 1.1180 | 0.2666 |
Residual standard error: 0.9418 on 88 degrees of freedom
Multiple R-Squared: 0.4276
F-statistic: 5.976 on 11 and 88 degrees of freedom, the p-value is 3.412e-007
In: Math
3. Independent random samples of n1 = 16 and n2 = 13 observations were selected from two normal populations with equal variances. The sample means and variances are shown below: Population 1 Population 2 Sample size 16 13 Sample mean 34.6 32.2 Sample variance 4.0 4.84 a) Suppose you wish to test if there is difference between the population means with significance level of α = 0.05. State the null and alternative hypotheses that you use for the test. b) Find the value of the test statistic c) Find the value of the critical value d) Conduct the test and state your conclusions.
In: Math
Paired Samples t-test (30pts) Suppose you are interested in deciding if a particular diet is effective in changing people’s weight. You decide to run a “within subject” experiment. You select 6 people and weight each of them. Two weeks, you weight them again. For each person you compute how much weight they lost over this period. This is what you find: Non-diet(subject 1-6): 0 7 3 2 -10 -1
You then put them on the diet and weigh them again after two weeks and compute how much they lost over this period. Diet
(subject 1-6) : 1 6 4 3 -8 2
a) State the null and alternative hypotheses (2pts)
b) Compute the mean and standard deviation of the difference distribution (4pts)
c) How many degrees of freedom do you have? (3pts)
d) Assume the Null Hypothesis is True and compute the t-statistic (2-pts)
e) Compute the P-value (2pts)
f) At an alpha = 0.05 would you accept or reject the null hypothesis? (3pts)
Please show work, thank you!
In: Math
Describe the difference between a one tailed and a two tailed test. What is the difference between a z test and a t test, and how do you determine which one to use? Also, discuss when a two sample test would be used, and provide an example.
In: Math