Question

In: Math

Use the dataset “ChickWeight”, available in R. Assume that each row is an observation of a...

Use the dataset “ChickWeight”, available in R. Assume that each row is an observation of a unique chick. weight is the weight of the chick. Diet is the diet treatment that the chick received. Assume that the distribution of chickweigts within each diet is approximately normal and variances are equal.

1. Construct a 95% confidence interval for the true mean weight of chicks.

2. Interpret the confidence interval in 1. in the context of the problem.

3. Construct a 99% confidence interval for the true mean weight of chicks.

4. Interpret the confidence interval in 2. in the context of the problem.

5. Write down the null and alternative hypothesis to determine if the mean weight of chicks is greater than 120.

6. Conduct a statistical test to determine if the mean weight of chicks is greater than 120. Use = 0.05.

7. Construct a box-and-whisker plot of weight by Diet. Label the graph and axes appropriately. (Hint: there should be 4 box-and whisker plots on one graph)

8. Write down the null and alternative hypothesis to determine if there is a difference in mean weight between diets.

9. Use an ANOVA to determine if there is a difference in mean weights between diets. Assume that all of the assumptions are met to perform the procedure. = 0.05.

10. If there is a difference in mean weights between diets, use a statistical procedure to rank the means where possible.

Solutions

Expert Solution

Solution1:

Rcode:

attach(ChickWeight)

dim(ChickWeight)
data(ChickWeight)
t.test(ChickWeight$weight)

output:

One Sample t-test

data: ChickWeight$weight
t = 41.208, df = 577, p-value < 2.2e-16
alternative hypothesis: true mean is not equal to 0
95 percent confidence interval:
116.0121 127.6246
sample estimates:
mean of x
121.8183

95 percent confidence interval lies in between  116.0121 and
127.6246.

2. Interpret the confidence interval in 1. in the context of the problem.

we are 95% confident that the true mean weight of chicks lies in between 16.0121 and
127.6246.

3. Construct a 99% confidence interval for the true mean weight of chicks.

Rcode:

t.test(ChickWeight$weight,conf.level = 0.99)

output:

One Sample t-test

data: ChickWeight$weight
t = 41.208, df = 577, p-value < 2.2e-16
alternative hypothesis: true mean is not equal to 0
99 percent confidence interval:
114.1784 129.4583
sample estimates:
mean of x
121.8183

. Interpret the confidence interval in 2. in the context of the problem.

we are 99% confident that the true mean weight of chicks lies in between  114.1784 and 129.4583

Solution5:

5. Write down the null and alternative hypothesis to determine if the mean weight of chicks is greater than 120.

H0:

Ha:


Related Solutions

R programming a. obtain the dataset ChickWeight b. obtain the mean weight at 16 days for...
R programming a. obtain the dataset ChickWeight b. obtain the mean weight at 16 days for all chicks on that diet
Use R.  Provide Solution and R Code within each problem. For this section use the dataset “PlantGrowth”,...
Use R.  Provide Solution and R Code within each problem. For this section use the dataset “PlantGrowth”, available in base R (you do not need to download any packages). a.Construct a 95% confidence interval for the true mean weight. b.Interpret the confidence interval in 1. in the context of the problem. c.Write down the null and alternative hypothesis to determine if the mean weight of the plants is less than 5. d.Conduct a statistical test to determine if the mean weight...
This problem is going to use the data set in R called "ChickWeight" that has 4...
This problem is going to use the data set in R called "ChickWeight" that has 4 variables, as described below. ChickWeight: A data frame with 578 observations on 4 variables. 1) weight: a numeric vector giving the body weight of the chick (gm). 2) Time: a numeric vector giving the number of days since birth when the measurement was made. 3) Chick: an ordered factor with levels 18 < ... < 48 giving a unique identifier for the chick. The...
This problem is going to use the data set in R called "ChickWeight" that has 4...
This problem is going to use the data set in R called "ChickWeight" that has 4 variables, as described below. ChickWeight: A data frame with 578 observations on 4 variables. 1) weight: a numeric vector giving the body weight of the chick (gm). 2) Time: a numeric vector giving the number of days since birth when the measurement was made. 3) Chick: an ordered factor with levels 18 < ... < 48 giving a unique identifier for the chick. The...
This problem is going to use the data set in R called "ChickWeight" that has 4...
This problem is going to use the data set in R called "ChickWeight" that has 4 variables, as described below. ChickWeight: A data frame with 578 observations on 4 variables. 1) weight: a numeric vector giving the body weight of the chick (gm). 2) Time: a numeric vector giving the number of days since birth when the measurement was made. 3) Chick: an ordered factor with levels 18 < ... < 48 giving a unique identifier for the chick. The...
Use the following dataset and assume all assumptions are met Provide all R code needed to...
Use the following dataset and assume all assumptions are met Provide all R code needed to conduct the tests Bat house color and species Lasiurus in brown: 45, 49, 53, 54, 46, 51, 50, 48, 52, 50 Myotis in brown: 40, 38, 35, 39, 39, 44, 42, 48, 41, 40 Lasiurus in tan: 53, 49, 51, 52, 59, 54, 53, 54, 58, 51 Myotis in tan: 62, 64, 59, 61, 65, 61, 58, 63, 56, 61 Provide the three hypotheses...
3. For each dataset, what is the unit of observation? What is/are the variable(s) collected? State...
3. For each dataset, what is the unit of observation? What is/are the variable(s) collected? State whether the distribution of this data will be skewed and explain why. Draw a plausible sketch of the distribution and label the axes. a. Lengths of pant legs cut and sewn to be 32 inches long. b. The times for students in an introductory psychology course to complete a difficult one-hour timed exam.
Fitting a linear model using R a. Read the Toluca.txt dataset into R (this dataset can...
Fitting a linear model using R a. Read the Toluca.txt dataset into R (this dataset can be found on Canvas). Now fit a simple linear regression model with X = lotSize and Y = workHrs. Summarize the output from the model: the least square estimators, their standard errors, and corresponding p-values. b. Draw the scatterplot of Y versus X and add the least squares line to the scatterplot. c. Obtain the fitted values ˆyi and residuals ei . Print the...
ANSWER USING R CODE Using the dataset 'LakeHuron' which is a built in R dataset describing...
ANSWER USING R CODE Using the dataset 'LakeHuron' which is a built in R dataset describing the level in feet of Lake Huron from 1872- 1972. To assign the values into an ordinary vector,x, we can do the following 'x <- as.vector(LakeHuron)'. From there, we can access the data easily. Assume the values in X are a random sample from a normal population with distribution X. Also assume the X has an unknown mean and unknown standard deviation. With this...
Please use R studio Dataset: IBM HR Analytics Employee Attrition & Performance dataset (you can download...
Please use R studio Dataset: IBM HR Analytics Employee Attrition & Performance dataset (you can download the dataset from kaggle) Name Description ATTRITION Employee leaving the company (0=no, 1=yes) BUSINESS TRAVEL (1=No Travel, 2=Travel Frequently, 3=Tavel Rarely) DEPARTMENT (1=HR, 2=R&D, 3=Sales) EDUCATION FIELD (1=HR, 2=LIFE SCIENCES, 3=MARKETING, 4=MEDICAL SCIENCES, 5=OTHERS, 6= TEHCNICAL) GENDER (1=FEMALE, 0=MALE) JOB ROLE (1=HC REP, 2=HR, 3=LAB TECHNICIAN, 4=MANAGER, 5= MANAGING DIRECTOR, 6= REASEARCH DIRECTOR, 7= RESEARCH SCIENTIST, 8=SALES EXECUTIEVE, 9= SALES REPRESENTATIVE) MARITAL STATUS (1=DIVORCED,...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT