Question

In: Statistics and Probability

This problem is going to use the data set in R called "ChickWeight" that has 4...

This problem is going to use the data set in R called "ChickWeight" that has 4 variables, as described below.

ChickWeight:
A data frame with 578 observations on 4 variables.
1) weight: a numeric vector giving the body weight of the chick (gm).
2) Time: a numeric vector giving the number of days since birth when the measurement was made.
3) Chick: an ordered factor with levels 18 < ... < 48 giving a unique identifier for the chick. The ordering of the levels groups chicks on the same diet together and orders them according to their final weight (lightest to heaviest) within diet.
4) Diet: a factor with levels 1, ..., 4 indicating which experimental diet the chick received.

Using a significance level of 0.05, is there evidence to support that the weight can be determined by the Time (treatment) and Diet (block)?

Fill in the R code below.

dat.aov=aov( ~ factor( ) +  ,data= )
summary( )

Fill in the ANOVA table below.
Type the values into the table EXACTLY as they appear in your output in R.

df SS MS F Pr(>F)
factor(Time) 2e-16
Diet 2e-16
Residuals

Is there evidence to support that the treatment variable Time is significant?
1. ?0:?1=?2=...=?12H0:μ1=μ2=...=μ12 vs ??:????Ha:ALOI
2. ?=0.01α=0.01
3. F =  
4. ??Fα =  
5. Conclusion:
Reject H0
Fail to reject H0
Interpretation:
There is sufficient evidence to support that the variable Time is significant.
There is not sufficient evidence to support that the variable Time is significant.

Is there evidence to support that the block variable Diet is significant?
1. ?0:H0: No block effect vs ??:Ha: There is a block effect
2. ?=0.01α=0.01
3. F =  
4. ??Fα =  
5. Conclusion:
Reject H0
Fail to reject H0
Interpretation:
There is sufficient evidence to support that the variable Diet is significant.
There is not sufficient evidence to support that the variable Diet is significant.

Solutions

Expert Solution

This problem is going to use the data set in R called "ChickWeight" that has 4 variables, as described below.

ChickWeight:
A data frame with 578 observations on 4 variables.
1) weight: a numeric vector giving the body weight of the chick (gm).
2) Time: a numeric vector giving the number of days since birth when the measurement was made.
3) Chick: an ordered factor with levels 18 < ... < 48 giving a unique identifier for the chick. The ordering of the levels groups chicks on the same diet together and orders them according to their final weight (lightest to heaviest) within diet.
4) Diet: a factor with levels 1, ..., 4 indicating which experimental diet the chick received.

Using a significance level of 0.05, is there evidence to support that the weight can be determined by the Time (treatment) and Diet (block)?

Fill in the R code below.

dat.aov = aov(weight~factor(Diet)+factor(Time),data=ChickWeight)

summary( dat.aov)

Fill in the ANOVA table below.
Type the values into the table EXACTLY as they appear in your output in R.

df

SS

MS

F

Pr(>F)

factor(Time)

2e-16

Diet

2e-16

Residuals

summary(dat.aov)

                   Df Sum Sq    Mean Sq F value Pr(>F)   

factor(Diet)   3 155863     51954     40.75     <2e-16 ***

factor(Time) 11 2040908 185537 145.53    <2e-16 ***

Residuals      563 717785    1275                  

---

Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Is there evidence to support that the treatment variable Time is significant?
1. H0:μ1=μ2=...=μ12 vs ??:????
2. ?=0.01
3. F =  145.53
4. ?? =  2.279
5. Conclusion:
Reject H0
Interpretation:
There is sufficient evidence to support that the variable Time is significant.

Is there evidence to support that the block variable Diet is significant?
1. H0: No block effect vs Ha: There is a block effect
2. ?=0.01
3. F =  40.75
4. ?? =  3.817
5. Conclusion:
Reject H0
Interpretation:
There is sufficient evidence to support that the variable Diet is significant.


Related Solutions

This problem is going to use the data set in R called "ChickWeight" that has 4...
This problem is going to use the data set in R called "ChickWeight" that has 4 variables, as described below. ChickWeight: A data frame with 578 observations on 4 variables. 1) weight: a numeric vector giving the body weight of the chick (gm). 2) Time: a numeric vector giving the number of days since birth when the measurement was made. 3) Chick: an ordered factor with levels 18 < ... < 48 giving a unique identifier for the chick. The...
This problem is going to use the data set in R called "ChickWeight" that has 4...
This problem is going to use the data set in R called "ChickWeight" that has 4 variables, as described below. ChickWeight: A data frame with 578 observations on 4 variables. 1) weight: a numeric vector giving the body weight of the chick (gm). 2) Time: a numeric vector giving the number of days since birth when the measurement was made. 3) Chick: an ordered factor with levels 18 < ... < 48 giving a unique identifier for the chick. The...
Use R studio to do this problem. This problem uses the wblake data set in the...
Use R studio to do this problem. This problem uses the wblake data set in the alr4 package. This data set includes samples of small mouth bass collected in West Bearskin Lake, Minnesota, in 1991. Interest is in predicting length with age. Finish this problem without using Im() (a) Compute the regression of length on age, and report the estimates, their standard errors, the value of the coefficient of determination, and the estimate of variance. Write a sentence or two...
In the R programming language, we would like to use the data set called iris to...
In the R programming language, we would like to use the data set called iris to build a simple linear regression model to predict Sepal.Length based on Petal.Length. Calculate the least squares regression line to predict Sepal.Length based on Petal.Length. Interpret the slope of the line in the context of the problem. Remember that both variables are measured in centimeters. Plot the regression line in a scatterplot of Sepal.Length vs. Petal.Length. Test H1: ??1 ≠ 0 at ?? = 0.05...
using the mtcars data set data(mtcars) USE data in mtcars library in R 5.Use k means...
using the mtcars data set data(mtcars) USE data in mtcars library in R 5.Use k means cluster analysis. 6. Get cluster means. 7. Visualize the clustering result.
Instructions tell you how to get the data in R R has built in dataset called...
Instructions tell you how to get the data in R R has built in dataset called Iris. This famous (Fisher's or Anderson's) iris data set gives the measurements in centimeters of the variables sepal length and width and petal length and width, respectively, for 50 flowers from each of 3 species of iris. The species are Iris setosa, versicolor, and virginica. We are interested in estimating the length of Petal (Y) using the length of Sepal (X). First, load the...
Problem 4: Antismoking Campaign (no data set is posted for this problem). From 2005 to 2015,...
Problem 4: Antismoking Campaign (no data set is posted for this problem). From 2005 to 2015, an intensive antismoking campaign has been sponsored by the U.S federal government. Suppose in both years, the American Cancer Society randomly and independently sampled 2,000 adults. In 2005, they discovered 418 of the 2,000 sampled defined themselves as smokers. In 2015, they discovered 302 of the 2,000 sampled defined themselves as smokers. Define the population parameter in context in one sentence. Calculate the two...
Using the R built-in data set called Chick Weight, we want to compare the mean weight...
Using the R built-in data set called Chick Weight, we want to compare the mean weight across the different types of Diet. IMPORTANT: We only want to compare chicks at the final value of Time, 21. In this problem, use ?? = 0.05. Make a boxplot to compare weight across the different types of Diet. Based on the boxplot, describe any differences (or lack of differences) you see. Run an ANOVA to compare weight across the different types of Diet....
3. Using the R data set called warpbreaks (See ?warpbreaks for more info), we want to...
3. Using the R data set called warpbreaks (See ?warpbreaks for more info), we want to compare the mean breaks across both the different types of wool and the different levels of tension. In this problem, use ?? = 0.10. a. Make a boxplot to compare breaks across both wool and tension. Color-code the three different tension levels for easier visibility. Within wool A, describe the relationship between tension and breaks. Within wool B, describe the relationship between tension and...
Consider one of the subset regression models for each data set obtained in Problem Set 4...
Consider one of the subset regression models for each data set obtained in Problem Set 4 and answer the following questions. (i) Draw the scatter plot matrix, residual vs. predictor variable plots and added variable plots. Comment on the regression model based on these plots. (ii) Draw the normal-probability plot and comment. (iii) Draw the correlogram and comment. (iv) Detect leverage points from the data. (v) Compute Cook’s distance statistics and detect all outlier points from the data. (vi) Compute...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT