Question

In: Statistics and Probability

DATA 3 8 2 15 2 2 0 0 4 5 2 7 0 1 5...

DATA

3
8
2
15
2
2
0
0
4
5
2
7
0
1
5
3
0
2
5
4
1
6
9
5
3
1
2
10
6
1
1
2
1
19
6
6
6
7
0
4
1
1
1
0
1
9
2
2
2
1
16
10
10
5
2
3
1
4
4
4
3
6
2
8
5
2
7
1
6
4
0
3
1
1
1

Background:

A group of 75 college students from a certain liberal arts college were randomly sampled and asked about the number of alcoholic drinks they have in a typical week. The file containing the data is linked below. The purpose of this study was to compare the drinking habits of the students at the college to the drinking habits of college students in general. In particular, the dean of students, who initiated this study, would like to check whether the mean number of alcoholic drinks that students at his college have in a typical week differs from the mean of U.S. college students in general, which is estimated to be 4.73.

Question 1:

Let μ be the mean number of alcoholic beverages that students in the college drink in a typical week. State the hypotheses that are being tested in this problem.

Question 2:

Here is a histogram of the data. Can we safely use the t-test with this data?

Instructions

Click on the link corresponding to your statistical package to see instructions for completing the activity, and then answer the questions below.

R | StatCrunch | Minitab | Excel 2007 | TI Calculator

Question 3:

State the test statistic and interpret its value.

Question 4:

Based on the P-value, draw your conclusions in context.

Question 5:

What would your conclusions be if the dean of students suspected that the mean number of alcoholic drinks that students in the college consume in a typical week is lower than the mean of U.S. college students in general? In other words, if this were a test of the hypotheses:

H0: μ = 4.73 drinks per week

Ha: μ < 4.73 drinks per week

Question 6:

Now suppose that instead of the 75 students having been randomly selected from the entire student body, the 75 students had been randomly selected only from the engineering classes at the college (for the sake of convenience).

Address the following two issues regarding the effect of such a change in the study design:

a. Would we still be mathematically justified in using the T-test for obtaining conclusions, as we did previously?

b. Would the resulting conclusions still address the question of interest (which, remember, was to investigate the drinking habits of the students at the college as whole)?

Solutions

Expert Solution

Solution-1:

Ho:

Ha:

Soluiton-b:

df1 =read.table(header = TRUE, text ="
drinks
3
8
2
15
2
2
0
0
4
5
2
7
0
1
5
3
0
2
5
4
1
6
9
5
3
1
2
10
6
1
1
2
1
19
6
6
6
7
0
4
1
1
1
0
1
9
2
2
2
1
16
10
10
5
2
3
1
4
4
4
3
6
2
8
5
2
7
1
6
4
0
3
1
1
1
"
)
df1

histogram(df1$drinks,main='Histogram')

From histogram we could see that shape is not symmetrical

shape is positively skewed

But as for central limt theorem,n=75

sample follows normal distribution

State the test statistic and interpret its value.

Solution-3:

Rcode to get

test statistic and p value using t.tets function

Rcode:

t.test(df1$drinks,mu=4.73)

Output:

data: df1$drinks
t = -1.8275, df = 74, p-value = 0.07165
alternative hypothesis: true mean is not equal to 4.73
95 percent confidence interval:
3.064735 4.801932
sample estimates:
mean of x
3.933333

test statitstic,t=-1.8275

Solution-4:

p-value = 0.07165

p>0.05

Fail to reject Ho

Accept Ho

Conclusion;
There is no suffcient statistical evidence at 5% level of signficance to conclude that the mean number of alcoholic drinks that students at his college have in a typical week differs from the mean of U.S. college students in general, which is estimated to be 4.73.

Solution-5:

R code;

t.test(df1$drinks,mu=4.73,alternative="less")

Output;

data: df1$drinks
t = -1.8275, df = 74, p-value = 0.03583
alternative hypothesis: true mean is less than 4.73
95 percent confidence interval:
-Inf 4.659456
sample estimates:
mean of x
3.933333

t=-1.8275

, p-value = 0.03583

p<0.05

Reject Ho

Accept Ha

Comclusion:

There is sufcient statistical evidence at 5% level of significance to conclude that mean number of alcoholic drinks that students in the college consume in a typical week is lower than the mean of U.S. college students in general.


Related Solutions

A = (1 −7 5 0 0 10 8 2 2 4 10 3 −4 8...
A = (1 −7 5 0 0 10 8 2 2 4 10 3 −4 8 −9 6) (1) Count the number of rows that contain negative components. (2) Obtain the inverse of A and count the number of columns that contain even number of positive components. (3) Assign column names (a,b,c,d) to the columns of A. (4) Transform the matrix A into a vector object a by stacking rows. (5) Replace the diagonal components of A with (0,0,2,3). Hint:...
Data: 7,-5, -8, 7, 9, 15, 0, 2, 13, 8, 6, -2, 4 (a) Mean= Mode=...
Data: 7,-5, -8, 7, 9, 15, 0, 2, 13, 8, 6, -2, 4 (a) Mean= Mode= median= (b) Variance= Standard deviation= (c) Range= IQR(Interquartilerange)= (d) Mid-Range= Mid-Hinge=
Find regression line for the data X 0   1   2   3    4   5   6   7  8          &nbsp
Find regression line for the data X 0   1   2   3    4   5   6   7  8               [3 MARKS] Y 11 21 31 41 51 61 71 81 91 b. X  0   2   4   6   8  10                            [3 MARKS]       Y  12 15 17 18 20 22
For the data set 1 2 3 4 7 7 7 8 11 12 12 15...
For the data set 1 2 3 4 7 7 7 8 11 12 12 15 15 16 17 17 17 18 20 20 22 24 24 25 26 26 26 26 27 30 32 32 33 34 34 36 38 39 43 44 45 46 47 47 48 51 52 52 53 54 54 54 55 56 58 58 59 61 63 65 65 67 69 70 73 75 75 76 77 77 79 80 81 82 82 (a)...
Given: x y -5 1 -4 5 -3 4 -2 7 -1 10 0 8 1...
Given: x y -5 1 -4 5 -3 4 -2 7 -1 10 0 8 1 9 2 13 3 14 4 13 5 18 What are the confidence limits (alpha = 0.05) for the true mean value of Y when X = 3?
Consider the data. xi 1 2 3 4 5 yi 4 7 5 11 15 The...
Consider the data. xi 1 2 3 4 5 yi 4 7 5 11 15 The estimated regression equation for these data is  ŷ = 0.60 + 2.60x. (a)Compute SSE, SST, and SSR using equations SSE = Σ(yi − ŷi)2, SST = Σ(yi − y)2, and SSR = Σ(ŷi − y)2. SSE=SST=SSR= (b) Compute the coefficient of determination r2. r2 = Comment on the goodness of fit. (For purposes of this exercise, consider a proportion large if it is at least...
Input Data Month 0 1 2 3 4 5 6 7 8 9 10 11 12...
Input Data Month 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Revenue $             -   $            -   $            -   $        -   $        -   $         -   $    2,500 $    2,875 $    3,306 $    3,802 $    4,373 $    5,028 $    5,783 $    6,650 $    7,648 $      8,795 $   10,114 $   11,631 $   13,376 $   15,382 $   17,689 $   20,343 $   23,394 $   26,903 Monthly Revenue Growth...
Input Data Month 0 1 2 3 4 5 6 7 8 9 10 11 12...
Input Data Month 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Revenue $             -   $            -   $            -   $        -   $        -   $         -   $    2,500 $    2,875 $    3,306 $    3,802 $    4,373 $    5,028 $    5,783 $    6,650 $    7,648 $      8,795 $   10,114 $   11,631 $   13,376 $   15,382 $   17,689 $   20,343 $   23,394 $   26,903 Monthly Revenue Growth...
x (Bins) frequency 0 0 1 0 2 0 3 2 4 5 5 8 6...
x (Bins) frequency 0 0 1 0 2 0 3 2 4 5 5 8 6 13 7 33 8 42 9 66 10 77 11 105 12 103 13 110 14 105 15 84 16 70 17 51 18 40 19 27 20 27 21 15 22 5 23 7 24 2 25 2 26 1 27 0 28 0 29 0 30 0 (7) On the Histogram worksheet, calculate all frequencies of the distribution using the table shown....
Given the following data (2, 5, 1, 0, 5, 0, 7, 2, 3) Calculate the 3rd...
Given the following data (2, 5, 1, 0, 5, 0, 7, 2, 3) Calculate the 3rd quartile from your "data". What's the probability that a randomly selected number from your data is even? Consider the events A = "a randomly selected number from your data is even" and B = "a randomly selected number from your data is above the 3rd quartile". Are these events disjoint, independent, neither, or both? Show your work.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT