Question

In: Statistics and Probability

Consider the Toluca data (it is in the le named Toluca.txt). Use R to do the...

Consider the Toluca data (it is in the le named Toluca.txt). Use R to do the following.
(a) Read the data and t a linear model. Print the ANOVA table.
(b) Compute 92% con dence interval for b0 and b1 using the appropriate R com-
mand.
(c) Test the hypothesis H0 : b1 = 0 vs Ha : b1 =/= 0 with level = .05. What is
the p-value.
(d) Can you create a con dence interval for the mean amount of time to produce
a batch with Lot.Size = 85. What would it be if Lot.Size= 50. What about
when Lot.Size = 150. Compute the width of each interval. Comment on
their widths.
(e) Tomorrow the factory decides to produce a batch of size 85. Compute a 97%
prediction interval for the time it will take to produce that batch.

(Data)Toluca.txt

Lot.Size Work.Hours

80 399

30 121

50 221

90 376

70 361

60 224

120 546

80 352

100 353

50 157

40 160

70 252

90 389

20 113

110 435

100 420

30 212

50 268

90 377

110 421

30 273

90 468

40 244

80 342

70 323

Solutions

Expert Solution


Related Solutions

(1) Read in the data and create an R data frame named tennis.dfr that has the...
(1) Read in the data and create an R data frame named tennis.dfr that has the following names for its columns: first.name, last.name, major.match.wins, major.match.losses, overall.match.wins, overall.match.losses, major.titles, overall.titles. (Note that the data file has several explanatory lines before the real data begin that should be skipped when reading in the data lines.) NOTE: For the file name, you must use the following web address (URL): "http://people.stat.sc.edu/hitchcock/tennisplayers2018.txt". Please do not have your code read in the file from your own...
Use R studio to do this problem. This problem uses the wblake data set in the...
Use R studio to do this problem. This problem uses the wblake data set in the alr4 package. This data set includes samples of small mouth bass collected in West Bearskin Lake, Minnesota, in 1991. Interest is in predicting length with age. Finish this problem without using Im() (a) Compute the regression of length on age, and report the estimates, their standard errors, the value of the coefficient of determination, and the estimate of variance. Write a sentence or two...
USE R STUDIO. Consider the pressure data frame. There are two columns: temperature and pressure: •...
USE R STUDIO. Consider the pressure data frame. There are two columns: temperature and pressure: • Construct a scatterplot with pressure on the vertical axis and temperature on the horizontal axis. • The graph of the following function passes through the plotted points reasonably well: y = (0.168 + 0.007 ∗ x) ^(20/3). Recall that the differences between the pressure values predicted by the curve (i.e. y) and the observed pressure values (i.e. the pressure values obtained from the data...
Please use R to do it. Using the SATGPA data set in Stat2Data package. Test by...
Please use R to do it. Using the SATGPA data set in Stat2Data package. Test by using α= .05 Question: Test if the proportion of MathSAT greater than VerbalSAT is 0.60 > library(Stat2Data) > data("SATGPA") > data(SATGPA) > SATGPA
In R, Use library(MASS) to access the data sets for this test. Use the Pima.tr data...
In R, Use library(MASS) to access the data sets for this test. Use the Pima.tr data set to answer questions 1-5. What is the average age for women in this data set? What is the maximum number of pregnancies for women in this data set ? What is the median age for women who have diabetes? What is the median age for women who do not have diabetes? What is the third quartile of the skin variable?
Consider the data contained in the Excel worksheet named Cell Phones. This data set contains monthly...
Consider the data contained in the Excel worksheet named Cell Phones. This data set contains monthly phone bills (in dollars) for a random sample of 50 cell phone users. a. Use the data set to conduct a one sample t-test to determine whether the expected bill amount equals $65 or is greater than $65. What is the p-value associated with the test? b. Construct a 95% confidence interval on expected bill amount. c. Do the data provide sufficient evidence to...
Write R commands for below queries, assume the data is in file named input.csv. Also explain...
Write R commands for below queries, assume the data is in file named input.csv. Also explain your answer id name salary start_date dept 1 Rick 623.3 1/01/2012 IT 2 Dan 515.2 23/09/2013 Operations 3 Michelle 611 15/11/2014 IT 4 Ryan 729 11/05/2014 HR 5 Gary 843.25 27/03/2015 Finance 6 Nina 578 21/05/2013 IT 7 Simon 632.8 30/07/2013 Operations 8 Guru 722.5 17/06/2014 Finance Open the file and Get the max salary from data frame Get the detail of person having...
D. The data le TreeAgeDiamSugarMaple.txt is available at the same site as the other data sets...
D. The data le TreeAgeDiamSugarMaple.txt is available at the same site as the other data sets you have used in the homework assignments. The data are from 27 maple trees. The rst column of the le is x=tree diameter and the second column is y=tree age (in years). Do the following for these data: (i) Determine a good polynomial regression model for this data using the AIC and/or BIC criteria. (Fit all polynomial regression models upto a maximum degree of...
Use the data set named Store_Visits located in the folder Data Files for HW Assignment (outside...
Use the data set named Store_Visits located in the folder Data Files for HW Assignment (outside of Minitab folder) in the K-drive. The response variable y is the number of visits of a customer to a particular food store in a large suburban area within the period of a month, and the independent variable x is the distance (in miles) of the customer’s home to the store. Fit a simple linear regression model to the data, and answer the following...
Use the data set named Store_Visits located in the folder Data Files for HW Assignment (outside...
Use the data set named Store_Visits located in the folder Data Files for HW Assignment (outside of Minitab folder) in the K-drive. The response variable y is the number of visits of a customer to a particular food store in a large suburban area within the period of a month, and the independent variable x is the distance (in miles) of the customer’s home to the store. Fit a simple linear regression model to the data, and answer the following...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT