In: Statistics and Probability
Children's height as a function of their age has been researched so extensively that we can consider known results to describe the relationship for all children in the United States. For instance, between the ages of 13 and 15, population mean height for teenage males (in inches) satisfies
μy = 22 + 3x, where x is age in years. Spread about the line is 3.1 inches.
1. Notice that the slope of the regression line for the population is β1 = 3. If we were to take repeated random samples of 25 males between the ages of 13 and 15 and regress their heights on their ages, then the slopes b1 would vary from sample to sample. At what slope value would their distribution be centered? (Answer as a whole number.)
2. On average, how much shorter do you predict a 13-year-old to be compared to a 15-year-old? (Answer as a whole number.)
3. The linear regression model does a good job of summarizing the relationship between height and age for males in a particular age range, such as between 13 and 15 years old. Which two conditions would not be met if we attempted to perform inference about the height/age relationship based on a random sample of 250 males all the way from newborn to 25 years old?
a. Scatterplot should appear linear.
b. Sample size should be large enough to offset non-normality in responses.
c. Spread of responses should appear fairly constant over the range of explanatory values.
d. Explanatory/response values should constitute a random sample of independent pairs.
1. Notice that the slope of the regression line for the population is β1 = 3. If we were to take repeated random samples of 25 males between the ages of 13 and 15 and regress their heights on their ages, then the slopes b1 would vary from sample to sample. At what slope value would their distribution be centred? (Answer as a whole number.)
The slope will be centred at 3 as its the average value of all the values.
2. On average, how much shorter do you predict a 13-year-old to be compared to a 15-year-old? (Answer as a whole number.)
So on average, the difference will be 67-31=6 inches
3. The linear regression model does a good job of summarizing the relationship between height and age for males in a particular age range, such as between 13 and 15 years old. Which two conditions would not be met if we attempted to perform inference about the height/age relationship based on a random sample of 250 males all the way from newborn to 25 years old?
a. Scatterplot should appear linear.
d. Explanatory/response values should constitute a random sample of independent pairs.