In: Statistics and Probability
The data below was obtained in a study of number of hours spent by a salesman in a day (x) versus the number of cars sold (y) by the salesman.
| x | y | 
| 1 | 1 | 
| 2 | 1 | 
| 3 | 2 | 
| 4 | 3 | 
| 5 | 5 | 
a) The correlation coefficient r between X and Y is close to
b)The estimated coefficients of regression line (y = ?0 + ?1x) are:
c)If the number of hours spent is 3, what is the predicted sales
d)Consider the observed data: number of hours spent is 3, and number of cars sold is 2. What is the residual of this observation:
The statistical software output for this problem is:
Simple linear regression results:
Dependent Variable: y
Independent Variable: x
y = -0.6 + 1 x
Sample size: 5
R (correlation coefficient) = 0.94491118
R-sq = 0.89285714
Estimate of error standard deviation: 0.63245553
Parameter estimates:
| Parameter | Estimate | Std. Err. | Alternative | DF | T-Stat | P-value | 
|---|---|---|---|---|---|---|
| Intercept | -0.6 | 0.66332496 | ? 0 | 3 | -0.90453403 | 0.4324 | 
| Slope | 1 | 0.2 | ? 0 | 3 | 5 | 0.0154 | 
Analysis of variance table for regression
model:
| Source | DF | SS | MS | F-stat | P-value | 
|---|---|---|---|---|---|
| Model | 1 | 10 | 10 | 25 | 0.0154 | 
| Error | 3 | 1.2 | 0.4 | ||
| Total | 4 | 11.2 | 
Predicted values:
| X value | Pred. Y | s.e.(Pred. y) | 95% C.I. for mean | 95% P.I. for new | 
|---|---|---|---|---|
| 3 | 2.4 | 0.28284271 | (1.4998683, 3.3001317) | (0.19513652, 4.6048635) | 
Hence,
a) Correlation coefficient = 0.9449
b) Regression equation: y = -0.6 + x
c) For x = 3: y = 2.4
d) Residual = Actual value - Estimated value = 2 - 2.4 = -0.4