Question

In: Biology

The following data were collected as part of a study of coffee consumption among graduate students....

The following data were collected as part of a study of coffee consumption among graduate students. The following reflect cups per day consumed:

3          4          6          8          2          1          0          2

X

X2

0

0

1

1

2

4

2

4

3

9

4

16

6

36

8

64

26

134

  1. Compute the sample mean.
  1. Compute the sample standard deviation.
  1. Compute the median.
  1. Compute the first and third quartiles.
  1. Which measure, the mean or median, is a better measure of a typical value? Justify.
  1. Which measure, the standard deviation or the interquartile range, is a better measure of dispersion? Justify.

Solutions

Expert Solution

Coffee consumption among graduate students.

(a.) Sample mean (X̅) is defined as sum of total events (∑x) divided by number of total events(n).

i.e. X̅= (∑x)/ (n)

Here, total number of events is (n)=8

So,X̅= (3+4+6+8+2+1+0+2)/8

X̅= 26/8

X̅= 3.25

Hence, sample mean (X̅) is 3.25.

(b.) Sample standard deviation (σ) is the measure of the spread of the data. The formula to calculate it is square root of ((xi-X̅)^2) divided by (n-1)

σ= √[∑(xi-X̅)^2/(n-1)]

where ,

s= sample standard deviation

xi= observed values of sample

n= number of observed values

X̅= sample mean

observed values (xi)

(xi-X̅)

(xi-X̅)^2

3 (3-3.25)= (-0.25) 6.25
4 (4-3.25)= (0.75) 0.5625
6 (6-3.25)= (2.75) 7.5625
8 (8-3.25)= (4.75) 22.5625
2 (2-3.25)= (-1.25) 1.5625
1 (1-3.25)= (-2.25) 5.0625
0 (0-3.25)= (-3.25) 10.5625
2 (2-3.25)= (-1.25) 1.5625
total [∑(xi-X̅)^2] = 55.6875

Now, σ= √[∑(xi-X̅)^2/(n-1)]

σ=√55.6875/(8-1)

σ= √55.6875/7

σ= √7.95

σ= 2.81

Hence, sample standard deviation (σ) is 2.81.

(c.) Median is the middle number of the set of numbers of a data.

First, we will arrange the data as- 0,1,2,2,3,4,6,8.

Since, the number of values in the data set is even. So, median is calculated as-

Then, we will find n/2th observation from the data. It is 8/2 th = 4th observation.

And, (n+1)/2th observation from the data. It is (8/2)+1 th = 4+1 = 5th observation

The 4th observation= 2

The 5th observation= 3

Now, the median is the average of 4th and 5th observation.

median= (4th obs+ 5th obs)/ 2

median= (2+3)/2 =5/2 = 2.5

Hence, median is 2.5.

(d.) For data with even number of observed values,

such as - 0,1,2,2,3,4,6,8

3rd and 4th observations are included in the calculation of median. This divides the data in lower half and upper half. The values in lower half are 0,1,2. The value in upper half are 4,6,8. The middle number in these halves are the 1st and 3rd quartile respectively.

So,for 1st quartile /Q1: the nos- 0,1,2. And the middle no. is 1. So, Q1 = 1.

Further, for 3rd quartile /Q3: the nos- 4,6,8. And the middle no. is 6. So, Q3 = 6.

Hence, Q1 is 1 and Q3 is 6.

(e.) For, normally distributed data, mean is better as using it will include all the values of observation.But, for a skewed data, median is better, as it is less effected by extreme values of a skewed data.

(f.) Both standard deviation and inter- quartile range are measures of standard deviation. Inter- quartile range is dependent on few values Q, Q2 and Q3. While, standard deviation takes into acount all the observed values in the data. So, I feel standard deviation is a better measure of dispersion over inter- quartile range.


Related Solutions

. Average coffee consumption among graduate students is commonly known to be 3.89 cups per day....
. Average coffee consumption among graduate students is commonly known to be 3.89 cups per day. A graduate student took a sample of 10 fellow graduate students in her statistics class, and in the sample produced a mean of 4.02 cups per day and a standard deviation of .19. Because of this, she believes that the average is actually greater than what is commonly thought (3.89 cups per day). Do we believe the graduate student’s claim or the common belief?...
A study is conducted concerning automobile speeds and fuel consumption rates. The following data is collected:                        &
A study is conducted concerning automobile speeds and fuel consumption rates. The following data is collected:                                                  X Speed   45        52         49         60         67         61           Y MPG    22         26         21         28         33         32                                                                                                                                              Draw Scatter plot Compute the coefficient of correlation, and make a comment about that. Compute the coefficient of determination, and make a comment about that. Compute the coefficient of Non- determination, and make a comment about that. Find Regression Line and make a comment about that. Also, at what location the line crosses    the y-axis. What is the “Residual’ at...
A study is conducted concerning automobile speeds and fuel consumption rates. The following data is collected:                        &
A study is conducted concerning automobile speeds and fuel consumption rates. The following data is collected:                                                   X Speed   45        52        49         60         67         61          Y MPG    22         26        21        28        33         32                                                                                                                                             Draw Scatter plot Compute the coefficient of correlation and make a comment about that. Compute the coefficient of determination and make a comment about that. Compute the coefficient of Non- determination and make a comment about that. Find Regression Line and make a comment about that. Also, at what location the line crosses               the y-axis. What is the “Residual’ at...
The following data were collected Determine if there are any significant differences among the four treatments....
The following data were collected Determine if there are any significant differences among the four treatments. Use the .05 level of significance. Participant A B C D 1 6 3 3 0 2 4 4 2 2 3 4 2 0 2 4 6 3 3 0 For the given data above, calculate the f-ratio. Show ALL your work
You conducted a study on the relationship between alcohol consumption and depression among college students in...
You conducted a study on the relationship between alcohol consumption and depression among college students in which you distributed a survey to 300 college seniors. Students who completed the survey responded to a standardized questionnaire measuring depression symptoms and to a series of questions about their typical alcohol consumption patterns. The results of your study indicated that the correlation between depression and alcohol consumption was +.34. Explain what this correlation means in terms of direction and strength. One possible explanation...
The following data were collected from a survey of 10 randomly selected college students: Find the...
The following data were collected from a survey of 10 randomly selected college students: Find the mean, median, mode, variance, standard deviation, the five number summary report of the hours per week of the sample of students’ studied. Show your work. Student ID Facebook # hours of study per week 244701130 Yes 8 302896051 no 5 734077249 yes 11 891072704 yes 5 730265917 yes 9 894866913 no 6 644678646 no 1 369417477 yes 1 388511718 yes 2 554470987 no 1
As part of a study on transportation safety, the U.S. Department of Transportation collected data on...
As part of a study on transportation safety, the U.S. Department of Transportation collected data on the number of fatal accidents per 1000 licenses and the percentage of licensed drivers under the age of 21 in a sample of 42 cities. Data collected over a one-year period follow. These data are contained in the file named “Safety.csv”. 1- Find the sample mean and standard deviation for each variable. Round your answers to the nearest thousandth. 2- Use the function lm()...
As part of a study on transportation safety, the U.S. Department of Transportation collected data on...
As part of a study on transportation safety, the U.S. Department of Transportation collected data on the number of fatal accidents per 1000 licenses and the percentage of licensed drivers under the age of 21 in a sample of 42 cities. Data collected over a one-year period follow. These data are contained in the file named “Safety.csv”. 1- Find the sample mean and standard deviation for each variable. Round your answers to the nearest thousandth. 2- Use the function lm()...
The following data come from a study designed to investigate drinking problems among college students. In...
The following data come from a study designed to investigate drinking problems among college students. In 1983, a group of students were asked whether they had ever driven an automobile while drinking. In 1987, after the legal drinking age was raised, a different group of college students were asked the same question. Drove While Drinking                 Year Total 1983                       1987 Yes No 1250                         991 1387                       1666 2241 3053 Total 2637                       2657 5294 Use the chi-square test to evaluate the null...
USE R AND SHOW CODES 2. The following data were collected in a multisite observational study...
USE R AND SHOW CODES 2. The following data were collected in a multisite observational study of medical effectiveness in Type II diabetes. These sites were involved: a healthy maintenance organization (HMO), a university teaching hospital (UTH), and an independent practice assumption (IPA). The following data display the treatment regimens of patients measured at baseline by site. Use the data to test that no difference in treatment regimens across sites. (in addition, calculate the expected frequency for each cell.)                                                              ...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT