Question

In: Statistics and Probability

Problem 2. Consider the FLAG data set. The first 10 observations are given for informational purposes....

Problem 2. Consider the FLAG data set. The first 10 observations are given for informational purposes.

CONTRACT

COST

DOTEST

STATUS

1

1379.43

1386.29

1

2

134.03

85.71

1

3

202.33

248.89

0

4

397.12

467.49

0

5

158.54

117.72

1

6

1128.11

1008.91

1

7

400.33

472.98

1

8

581.64

785.39

0

9

353.96

370.02

0

10

138.71

174.25

0

b) Calculate a confidence and prediction interval for DOTEST = 110.

c) Interpret the confidence and prediction intervals given in the output. Do you see any problems with the interpretation of the prediction interval in terms of what we are trying to predict?

d) Why are confidence intervals always more narrow than prediction intervals?

Solutions

Expert Solution

The regression equation is defined as,

The least square estimate of intercept and slope are,

CONTRACT COST, Y DOTEST, X X^2 XY
1 1379.43 1386.29 1921800 1912290
2 134.03 85.71 7346.204 11487.71
3 202.33 248.89 61946.23 50357.91
4 397.12 467.49 218546.9 185649.6
5 158.54 117.72 13858 18663.33
6 1128.11 1008.91 1017899 1138161
7 400.33 472.98 223710.1 189348.1
8 581.64 785.39 616837.5 456814.2
9 353.96 370.02 136914.8 130972.3
10 138.71 174.25 30363.06 24170.22
SUM 4874.2 5117.65 4249222 4117915

Form the data values, the values are calculated as,

b)

The confidence interval is defined as,

The standard error of regression is calculated as,

CONTRACT COST, Y DOTEST, X Y-hat=-22.236+0.99588X (Y-Y-hat) (Y-Y-hat)^2
1 1379.43 1386.29 1358.3411 21.0889 444.7427
2 134.03 85.71 63.1208 70.9092 5028.1181
3 202.33 248.89 225.6283 -23.2983 542.8112
4 397.12 467.49 443.3275 -46.2075 2135.1291
5 158.54 117.72 94.9989 63.5411 4037.4762
6 1128.11 1008.91 982.5163 145.5937 21197.5366
7 400.33 472.98 448.7948 -48.4648 2348.8401
8 581.64 785.39 759.9174 -178.2774 31782.8276
9 353.96 370.02 346.2591 7.7009 59.3034
10 138.71 174.25 151.2959 -12.5859 158.4049
SUM 67735.1899

The prediction interval is obtained using the formula,

c)

The prediction interval gives the interval where the next predicted data would be.

The confidence interval gives the interval of mean value of input variable


Related Solutions

1. Given a data set with n = 10 observations, containing one independent variable, find the...
1. Given a data set with n = 10 observations, containing one independent variable, find the critical value for an F-test at α = 2.5% significance. Show your answer with four decimal places 2. Given the following portion of an output produced by a regression software package (with some values missing), find the value of MSR: ANOVA df SS MS F Significance F Regression 9 980.4187 ? ? ? Residual 75 220.7745 ? Total 84 1201.1932 Round your answer to...
Consider the following data set with 10 observations: 6, 7, 7, 8, 10, 12, 14, 16,...
Consider the following data set with 10 observations: 6, 7, 7, 8, 10, 12, 14, 16, 18, X . Find 3 different values of X for which mean=median .
Given a data set with 100 observations, a goodness of fit test to see if a...
Given a data set with 100 observations, a goodness of fit test to see if a sample follows a uniform distribution or a poisson distribution or a normal distribution will have the same number of degrees of freedom. true or false and When a contingency table of expected frequencies is constructed, the null hypothesis is that all of the cells in the table are equally likely. true or false thank you :)
Consider a sample with 10 observations of 2, 3, 10, 13, 12, 5, –1, 10, 2,...
Consider a sample with 10 observations of 2, 3, 10, 13, 12, 5, –1, 10, 2, and 12. Use z-scores to determine if there are any outliers in the data; assume a bell-shaped distribution. (Round your answers to 2 decimal places. Negative values should be indicated by a minus sign.) The z-score for the smallest observation The z-score for the largest observation There are in the data. Consider the following data for two investments, A and B: Investment A: x¯x¯...
Write R code: Here are the first six observations from the prostate data set found in...
Write R code: Here are the first six observations from the prostate data set found in the faraway library. Use help(prostate) to describe the dataset and the variables in the data sets. obs lcavol lweight age lbph svi lcp gleason pgg45 lpsa 1 -0.579819 2.7695 50 -1.38629 0 -1.38629 6 0 -0.43078 2 -0.994252 3.3196 58 -1.38629 0 -1.38629 6 0 -0.16252 3 -0.510826 2.6912 74 -1.38629 0 -1.38629 7 20 -0.16252 4 -1.203973 3.2828 58 -1.38629 0 -1.38629 6...
Consider one of the subset regression models for each data set obtained in Problem Set 4...
Consider one of the subset regression models for each data set obtained in Problem Set 4 and answer the following questions. (i) Draw the scatter plot matrix, residual vs. predictor variable plots and added variable plots. Comment on the regression model based on these plots. (ii) Draw the normal-probability plot and comment. (iii) Draw the correlogram and comment. (iv) Detect leverage points from the data. (v) Compute Cook’s distance statistics and detect all outlier points from the data. (vi) Compute...
Consider one of the subset regression models for each data set obtained in Problem Set 4...
Consider one of the subset regression models for each data set obtained in Problem Set 4 and answer the following questions. (i) Draw the scatter plot matrix, residual vs. predictor variable plots and added variable plots. Comment on the regression model based on these plots. (ii) Draw the normal-probability plot and comment. (iii) Draw the correlogram and comment. (iv) Detect leverage points from the data. (v) Compute Cook’s distance statistics and detect all outlier points from the data. (vi) Compute...
a data set mean 14 and standard deviation 2. Approximately 68% of the observations lie between...
a data set mean 14 and standard deviation 2. Approximately 68% of the observations lie between ____ and _____
resistant. To find the​ 10% trimmed mean for a data​ set, first arrange the data in​ order, then delete...
resistant. To find the​ 10% trimmed mean for a data​ set, first arrange the data in​ order, then delete the bottom​ 10% of the values and delete the top​ 10% of the​values, then calculate the mean of the remaining values. Use the axial loads​ (pounds) of aluminum cans listed below for cans that are 0.0111 in. thick. Identify any​outliers, then compare the​ median, mean,​ 10% trimmed​ mean, and​ 20% trimmed mean. 247247 261261 269269 272272 275275 279279 280280 284284 285285 285285   285285 288288 289289 292292 292292 295295 295295 300300...
Consider the observations taken on the continuous random variable Y given below. Graph the data using...
Consider the observations taken on the continuous random variable Y given below. Graph the data using an appropriate plot. Comment on the key features of the graph. Assess the normality of the data set. What do you conclude? Compute the 5-number summary. Treat the sample mean and sample standard deviation as if they are the true population mean and standard deviation. Find P (Y > 90). Treat the sample mean and sample standard deviation as if they are the true...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT