In: Statistics and Probability
Homework #3-2
a) (Use software for this problem) Four independent samples are collected form four normally distributed populations. The data are:
Group 1: 12 11 14 10 12 10
Group 2: 14 12 16 15
Group 3: 17 18 20 22 23
Group 4: 10 9 13 13
The SST is equal to 305.68. Conduct a test of the null hypothesis that the group means are equal. Use a 5% significance level.
b) (Use software for this problem) The following pairs of observations were collected.
X 1 2 3 4 5
Y 2 6 4 10 15
a. Plot the values of X and Y. What relationship does the scatter diagram suggest?
b. Find the least squares line
c. Find the predicted value for each value of X
d. Find the residual for each predicted value of Y.
e. Verify that the sum of the residuals in part e is zero.
a).
|
One factor ANOVA |
|||||
|
Mean |
n |
Std. Dev |
|||
|
11.5 |
6 |
1.52 |
Group1 |
||
|
14.3 |
4 |
1.71 |
Group2 |
||
|
20.0 |
5 |
2.55 |
Group3 |
||
|
11.3 |
4 |
2.06 |
Group4 |
||
|
14.3 |
19 |
4.12 |
Total |
||
|
ANOVA table |
|||||
|
Source |
SS |
df |
MS |
F |
p-value |
|
Treatment |
246.68 |
3 |
82.228 |
20.91 |
0.0000129 |
|
Error |
59.00 |
15 |
3.933 |
||
|
Total |
305.68 |
18 |
|||
H1: At least one pair of population means are different
Calculated F=120.91, P=0.00001 which is < 0.05 level of significance.
The null hypothesis is rejected.
The data indicate there is a significant difference among the three groups.
b).

a).
the plot suggests there is positive relation exists between x and y.
b).
|
Regression Analysis |
|||||||
|
r² |
0.840 |
n |
5 |
||||
|
r |
0.916 |
k |
1 |
||||
|
Std. Error of Estimate |
2.394 |
Dep. Var. |
y |
||||
|
Regression output |
confidence interval |
||||||
|
variables |
coefficients |
std. error |
t (df=3) |
p-value |
95% lower |
95% upper |
|
|
Intercept |
a = |
-1.600 |
|||||
|
x |
b = |
3.000 |
0.757 |
3.962 |
.0287 |
0.590 |
5.410 |
|
ANOVA table |
|||||||
|
Source |
SS |
df |
MS |
F |
p-value |
||
|
Regression |
90.000 |
1 |
90.000 |
15.70 |
.0287 |
||
|
Residual |
17.200 |
3 |
5.733 |
||||
|
Total |
107.200 |
4 |
|||||
Regression line y= -1.6+3.0*x
c).d).e). all the 3 results given in the table.
|
x |
y |
Predicted |
Residual |
|
1 |
2.0 |
1.4 |
0.6 |
|
2 |
6.0 |
4.4 |
1.6 |
|
3 |
4.0 |
7.4 |
-3.4 |
|
4 |
10.0 |
10.4 |
-0.4 |
|
5 |
15.0 |
13.4 |
1.6 |
|
Total |
0.0 |