Question

In: Statistics and Probability

a. Using the following R codes to fit the linear regression model for VitC on HeadWt,...

a. Using the following R codes to fit the linear regression model for VitC on HeadWt, and obtain its summary. Paste the R output in your homework. cabbages_data <- read.csv("http://users.stat.umn.edu/~wuxxx725/data/cabbages_data.csv") cabbages_reg <- lm(VitC ~ HeadWt, data = cabbages_data) summary(cabbages_reg)

b. State and interpret the value of r 2 from the model summary output in part a).

c. Calculate the correlation r between HeadWt and VitC, and state the strength and the direction of the correlation.

d. State the estimated regression equation in the form VitC ˆ = a + b(HeadWt).

e. Interpret the slope b in the estimated regression equation in part d).

f. Cabbage head #20 had a weight of 2.0 kg. Predict its vitamin C content using the estimated regression equation in part d)

g. The actual vitamin C content for cabbage head #20 was 60 mg/100 g. Find its residual based on your answer in parts f).

h. How would the correlation r between HeadWt and VitC change, if HeadWt were measured in pounds instead of kilograms? Please explain. (1 kg = 2.205 lbs)

i. Check the linearity, normal error, and constant variance assumptions using the following plots.

j. Conduct a five-step hypothesis test on whether the true slope β is different from 0 at the significance level 0.05.

Solutions

Expert Solution

a)

> cabbages_data <- read.csv("http://users.stat.umn.edu/~wuxxx725/data/cabbages_data.csv")
> cabbages_reg <- lm(VitC ~ HeadWt, data = cabbages_data)
> summary(cabbages_reg)

Call:
lm(formula = VitC ~ HeadWt, data = cabbages_data)

Residuals:
Min 1Q Median 3Q Max
-8.9317 -4.5844 -0.6325 4.3333 13.0863

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 79.554 3.353 23.723 < 2e-16 ***
HeadWt -6.647 1.380 -4.817 4.57e-05 ***

---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 6.363 on 28 degrees of freedom
Multiple R-squared: 0.4532,   Adjusted R-squared: 0.4337
F-statistic: 23.21 on 1 and 28 DF, p-value: 4.57e-05

b)

Multiple R-squared: 0.4532

there is 45.32% variation in the VitC which is explaiend by headWt.

c)

> cor.test( ~VitC+HeadWt, data=cabbages_data)

   Pearson's product-moment correlation

data: VitC and HeadWt
t = -4.8173, df = 28, p-value = 4.57e-05
alternative hypothesis: true correlation is not equal to 0
95 percent confidence interval:
-0.8317424 -0.4131245
sample estimates:
cor
-0.6731977

there is a strong negative linear relationship between VitC and HeadWt

d)

VitC ˆ = 79.554 + -6.647 (HeadWt)


Related Solutions

Using the following R codes to fit the linear regression model for VitC on HeadWt, and...
Using the following R codes to fit the linear regression model for VitC on HeadWt, and obtain its summary. Paste the R output in your homework. cabbages_data <- read.csv("http://users.stat.umn.edu/~wuxxx725/data/cabbages_data.csv") cabbages_reg <- lm(VitC ~ HeadWt, data = cabbages_data) summary(cabbages_reg)
a. Using the following R codes to fit the linear regression model for VitC on HeadWt,...
a. Using the following R codes to fit the linear regression model for VitC on HeadWt, and obtain its summary. Paste the R output in your homework. cabbages_data <- read.csv("http://users.stat.umn.edu/~wuxxx725/data/cabbages_data.csv") cabbages_reg <- lm(VitC ~ HeadWt, data = cabbages_data) summary(cabbages_reg) b. State and interpret the value of r 2 from the model summary output in part a). c. Calculate the correlation r between HeadWt and VitC, and state the strength and the direction of the correlation. d. State the estimated regression...
The following is a regression summary from R for a linear regression model between an explanatory...
The following is a regression summary from R for a linear regression model between an explanatory variable x and a response variable y. The data contain n = 50 points. Assume that all the conditions for SLR are satisfied. Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) -1.1016 0.4082 -2.699 -------** x 2.2606 0.0981 ---- < 2e-16 *** (a) Write the equation for the least squares regression line. (b) R performs a t-test to test whether the slope is significantly...
Fit a multiple linear regression model of the form y=β0 + β1 x1 + β2 x2...
Fit a multiple linear regression model of the form y=β0 + β1 x1 + β2 x2 + β3 x3 + ε. Here, ε is the random error term that is assumed to be normally distributed with 0 mean and constant variance. State the estimated regression function. How are the estimates of the three regression coefficients interpreted here? Provide your output, and interpretations in a worksheet titled “Regression Output.” Obtain the residuals and prepare a box-plot of the residuals. What information...
A least-squares simple linear regression model was fit predicting duration (in minutes) of a dive from...
A least-squares simple linear regression model was fit predicting duration (in minutes) of a dive from depth of the dive (in meters) from a sample of 45 penguins' diving depths and times. Calculate the F-statistic for the regression by filling in the ANOVA table. SS df MS F-statistic Regression Residual 1628.4056 Total 367385.9237
Using the GDP Data do the following: Generate the best fit model (regression) Generate the specific...
Using the GDP Data do the following: Generate the best fit model (regression) Generate the specific regression form Explain any dummy variables created Explain any time variables created Discuss the significance of all variables Generate and discuss the residual plot GDP C I G 822.2 625.7 93.6 110.1 751.5 592.3 62.5 121.3 703.6 574.3 39.2 126.6 611.8 523.0 11.8 122.4 603.3 511.0 17.5 118.0 668.3 546.9 31.6 133.0 728.3 580.6 58.4 137.0 822.5 639.6 74.9 158.9 865.8 663.5 93.6 153.2...
Determine the Best Fit Linear Regression Equation Using Technology - Excel Question The table shows the...
Determine the Best Fit Linear Regression Equation Using Technology - Excel Question The table shows the age in years and the number of hours slept per day by 24 infants who were less than 1 year old. Use Excel to find the best fit linear regression equation, where age is the explanatory variable. Round the slope and intercept to one decimal place. Age Hours 0.03 16.5 0.05 15.2 0.06 16.2 0.08 15.0 0.11 16.0 0.19 16.0 0.21 15.0 0.26 14.5...
Summarize the statistical concepts of predicted value and R-squared for a linear regression model, including the...
Summarize the statistical concepts of predicted value and R-squared for a linear regression model, including the meaning and interpretation. Give two examples of these concepts applied to a health care decision in a professional setting, and discuss practical, administration-related implications.
The following is the estimation results for a multiple linear regression model: SUMMARY OUTPUT             Regression...
The following is the estimation results for a multiple linear regression model: SUMMARY OUTPUT             Regression Statistics R-Square                                                       0.558 Regression Standard Error (S)                  863.100 Observations                                               35                                Coeff        StdError          t-Stat    Intercept               1283.000    352.000           3.65    X1                             25.228        8.631                       X2                               0.861        0.372           Questions: Interpret each coefficient.
The following is the estimation results for a multiple linear regression model: SUMMARY OUTPUT             Regression...
The following is the estimation results for a multiple linear regression model: SUMMARY OUTPUT             Regression Statistics R-Square                                                       0.558 Regression Standard Error (S)                  863.100 Observations                                               35                                Coeff        StdError          t-Stat    Intercept               1283.000    352.000           3.65    X1                             25.228        8.631                       X2                               0.861        0.372           Question: 1. A. Write the fitted regression equation. B. Write the estimated intercepts and slopes, associated with their corresponding standard errors. C. Interpret each coefficient.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT