In: Statistics and Probability
Fuming because you are stuck in traffic? Roadway congestion is a costly item, both in time wasted and fuel wasted. Let xrepresent the average annual hours per person spent in traffic delays and let y represent the average annual gallons of fuel wasted per person in traffic delays. A random sample of eight cities showed the following data.
x (hr) | 25 | 5 | 23 | 37 | 23 | 23 | 18 | 5 |
y (gal) | 46 | 3 | 35 | 56 | 32 | 37 | 26 | 9 |
(a) Draw a scatter diagram for the data.
Verify that Σx = 159, Σx2 = 3955,
Σy = 244, Σy2 = 9636, and Σxy
= 6142.
Compute r.
The data in part (a) represent average annual hours lost per person and average annual gallons of fuel wasted per person in traffic delays. Suppose that instead of using average data for different cities, you selected one person at random from each city and measured the annual number of hours lost x for that person and the annual gallons of fuel wasted y for the same person.
x (hr) | 24 | 4 | 22 | 41 | 15 | 28 | 2 | 36 |
y (gal) | 60 | 8 | 13 | 54 | 24 | 31 | 4 | 74 |
(b) Compute x and y for both sets of data pairs and compare the averages.
x | y | |
Data 1 | ||
Data 2 |
Compute the sample standard deviations sx and
sy for both sets of data pairs and compare the
standard deviations.
sx | sy | |
Data 1 | ||
Data 2 |
In which set are the standard deviations for x and y larger?
The standard deviations for x and y are larger for the first set of data.The standard deviations for x and y are larger for the second set of data. The standard deviations for x and y are the same for both sets of data.
Look at the defining formula for r. Why do smaller
standard deviations sx and
sy tend to increase the value of
r?
Dividing by smaller numbers results in a larger value.Multiplying by smaller numbers results in a larger value. Multiplying by smaller numbers results in a smaller value.Dividing by smaller numbers results in a smaller value.
(c) Make a scatter diagram for the second set of data pairs.
Verify that Σx = 172, Σx2 = 5066,
Σy = 268, Σy2 = 13,778, and
Σxy = 7872.
Compute r.
(d) Compare r from part (a) with r from part (b).
Do the data for averages have a higher correlation coefficient than
the data for individual measurements?
No, the data for averages do not have a higher correlation coefficient than the data for individual measurements.Yes, the data for averages have a higher correlation coefficient than the data for individual measurements.
List some reasons why you think hours lost per individual and fuel
wasted per individual might vary more than the same quantities
averaged over all the people in a city.