Question

In: Statistics and Probability

The weights (in pounds) and ages (in months) of 35 randomly selected male bears in Yellowstone...

The weights (in pounds) and ages (in months) of 35 randomly selected male bears in Yellowstone Park were recorded in a file male_bears.csv. A researcher runs the following code in R to read in the data.

> bears <- read.table(file="male_bears.csv", header=T, sep=",")

The researcher then runs the following code in R to fit a simple linear regression model oftheformyi =α+βxi +εi, i=1,2...n.Notethatsomevaluesintheoutputhave been removed.

> result <- lm(WEIGHT ~ AGE, data=bears) > summary(result)

Call:
lm(formula = WEIGHT ~ AGE, data Residuals:

Min 1Q Median 3Q -204.96 -45.82 -11.69 22.78

Coefficients:
Estimate Std. Error

(Intercept) 73.6415 19.3023
AGE 3.2052 0.3695
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘’1

Residual standard error: 75.13 on 33 degrees of freedom Multiple R-squared: 0.6952, Adjusted R-squared: 0.6859
F-statistic: 75.25 on 1 and 33 DF, p-value: **REMOVED**

  1. i) Identify the response and predictor variables in the output above. [2]

  2. ii) Using values from the output, write out the estimated linear regression equation.

    [2]

= bears)

Max 174.33

t value 3.815 8.675

Pr(>|t|) 0.000567 *** **REMOVED**

‘*’ 0.05 ‘.’ 0.1

2

  1. iii) Provide an interpretation of the coefficient of the variable AGE in the output. [1]

  2. iv) Write out the hypotheses being tested by the t-statistic circled in the output.

    [1]

  3. v) What are your conclusions about the hypotheses in (iv)? Use statistical tables to arrive at your conclusion. [3]

The researcher then produced the following analysis for variance for the regression model.

> anova(result)
Analysis of Variance Table
Response: WEIGHT
          Df Sum Sq Mean Sq F value    Pr(>F)

AGE 1 424736 424736 75.251 5.017e-10 *** Residuals 33 186260 5644

  1. vi) Using values from the analysis of variance output, compute the coefficient of determination for the regression model. [2]

  2. vii) Provide an interpretation of the coefficient of determination. [1]

Some summary statistics for the variable AGE are provided below. > summary(bears$AGE)

Min. 1st Qu. Median Mean 3rd Qu. Max. 8.00 16.50 32.00 39.34 53.00 177.00

  1. viii) Is the estimated regression equation in (ii) suitable for predicting the weight of a male bear who is 100 months old? Give a reason for your answer. If your answer was yes, find the predicted weight of a 100-month old male bear. [3]

  2. ix) Is the estimated regression equation in (ii) suitable for predicting the weight of a female bear who is 100 months old? Give a reason for your answer. [2]

The researcher then produced the following diagnostic plots in R.

> qqnorm(result$residuals)
> qqline(result$residuals)

3

> plot(result$fitted, result$residuals)
  1. x) For each of the plots above, explain the specific purpose of the plot. [2]

  2. xi) State your conclusions made from observing each plot. [2]

Solutions

Expert Solution


Related Solutions

Listed below are the body lengths (in inches) and weights (in lb) of randomly selected bears:...
Listed below are the body lengths (in inches) and weights (in lb) of randomly selected bears: Length 40         64         65         49         47 Weight 65        356       316        94         86 Find the value of the linear correlation coefficient. Letting y represent weights of bears and letting x represent their lengths, find the regression equation. Based on the given sample data, what is the best predicted weight of a bear with a length of 72.0 inch ?
Listed below are the body lengths (in inches) and weights (in lb) of randomly selected bears:...
Listed below are the body lengths (in inches) and weights (in lb) of randomly selected bears: Length 40         64         65         49         47 Weight 65        356       316        94         86 Find the value of the linear correlation coefficient. Letting y represent weights of bears and letting x represent their lengths, find the regression equation. Based on the given sample data, what is the best predicted weight of a bear with a length of 72.0 inch?
Let D be the weight, in pounds, of a randomly selected male baseball player. Let L...
Let D be the weight, in pounds, of a randomly selected male baseball player. Let L be the weight, in pounds, of a randomly selected female softball player. Suppose D is normally distributed with a mean weight of 173 pounds and a standard deviation of 29 pounds. Suppose L is normally distributed with a mean weight of 143 pounds and a standard deviation of 25 pounds. L and D are independent a) Calculate the expected value of L + D....
The weights of four randomly and independently selected bags of tomatoes labeled 5.0 pounds were found...
The weights of four randomly and independently selected bags of tomatoes labeled 5.0 pounds were found to be 5.3, 5.0, 5.6, and 5.4 pounds. Assume Normality. Answers part (a) and (b) below. (a). Find a 95% confidence interval for the mean weight of all bags of tomatoes. (__,__) (b). Does the interval capture 5.0 pounds? Is there enough evidence to reject a mean weight of 5.0 pounds? Explain.
The weights of four randomly and independently selected bags of tomatoes labeled 5.0 pounds were found...
The weights of four randomly and independently selected bags of tomatoes labeled 5.0 pounds were found to be 5.1 ,.5.0, 5.2, and 5.1 pounds. Assume Normality. Find a? 95% confidence interval for the mean weight of all bags of tomatoes.
The weights of four randomly and independently selected bags of potatoes labeled 20 pounds were found...
The weights of four randomly and independently selected bags of potatoes labeled 20 pounds were found to be 20.7​, 21.4​, 20.9​, and 21.2 pounds. Assume Normality. Answer parts​ (a) and​ (b) below. a. Find a​ 95% confidence interval for the mean weight of all bags of potatoes. b. Does the interval capture 20 ​pounds? Is there enough evidence to reject a mean weight of 20 ​pounds? A.The interval does not capture 20 pounds, so there is enough evidence to reject...
the weights of four randomly and independently selected bags of potatoes labeled 20 pounds were found...
the weights of four randomly and independently selected bags of potatoes labeled 20 pounds were found 21.7,22,20.5, and 21.9. Assume Normality. Using a two sided alternative hypothesis should you be able to reject a hypothesis that the population mean is 20 pounds using a significance level of 0.05 why or why not the confidence interval is reported here I am 95% confidence the population mean is between 20.4 and 22.6 pounds. B) test the hypothesis that the population mean is...
he weights of randomly selected 5 female students and 5 male students are given by the...
he weights of randomly selected 5 female students and 5 male students are given by the following:weights for male students:160,165,152,158,178weights for female students:169,154,158,156,162. Let the weight distributions of male and female students follow N(μ1,σ1) and N(μ2,σ2)respectively. Find the correlation coefficient between the heights of male and female students.(5 points)
2.The weights of four randomly and independently selected bags of potatoes labeled 20 pounds were found...
2.The weights of four randomly and independently selected bags of potatoes labeled 20 pounds were found to be 21, 22​, ,20.4, and 21.2 Assume Normality. a. Using a​ two-sided alternative​ hypothesis, should you be able to reject the hypothesis that the population mean is 20 pounds using a significance level of 0.05 Why or why​ not? The confidence interval is reported​ here: I am 95​%confident the population mean is between 20.1 and 22.2pounds. b. Test the hypothesis that the population...
The data set represents the weights (in grams) of 10 randomly selected adult male fox squirrels...
The data set represents the weights (in grams) of 10 randomly selected adult male fox squirrels from a forest. Assume the weights are normally distributed. (Adapted from Proceedings of the South Dakota Academy of Science) 821 857 782 930 720 821 794 876 810 841 (a) Find the sample mean and the sample standard deviation. (b) Construct a 95% confidence interval for the population mean. Interpret the results. (c) Construct a 99% confidence interval for the population variance. (d) Construct...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT