In: Math
A sociologist was hired by a large city hospital to investigate
the relationship between the number of unauthorized days that
employees are absent per year and the distance (miles) between home
and work for the employees. A sample of 10 employees was chosen,
and the following data were collected. Use the estimated
regression equation developed in part (c) to develop a 95%
confidence interval for the expected number of days absent for
employees living 5 miles from the company (to 1 decimal). Least
squares equation from part c:
Days Absent = 7.269 + -0.194 Distance
Distance to Work | Number of Days Absent |
1 | 9 |
4 | 6 |
4 | 9 |
6 | 8 |
8 | 7 |
10 | 4 |
12 | 7 |
14 | 3 |
14 | 6 |
18 | 3 |
Let Y=Number of Days Absent
X=Distance to Work
The model that we want to estimate is
We calculate the following
n=10 is the sample size
The sample means are
The sum of squares are
The estimated value of the slope is
The estimated value of intercept is
The estimated regression line is
that is the answer in part c should be
(Note: This is different from what has been pasted, but for the data given, the above is correct )
Now the rest
The sum of square errors is
The mean square Error is
The standard error of regression is
the estimated number of days absent for employees living miles () from the company is
The standard error of for is
The level of significance for 95% confidence interval is
the right tail critical value is obtained using
The degrees of freedom for t is n-2=10-2=8
Using the t tables for df=8 and area under the right tail=0.025 (combined area under both tails=0.05), we get
Now the 95% confidence interval
ans: a 95% confidence interval for the expected number of days absent for employees living 5 miles from the company is [6.2,8.9]