Question

In: Statistics and Probability

This is the final grade and number of absences for a set of students. Regress grade...

This is the final grade and number of absences for a set of students. Regress grade on absences. Use a 95% confidence level. Give the equation of estimation. Interpret the equation. According to the regression, now much does a tardy (= 1/2 an absence) change your grade? Evaluate the model. What evaluation criterion did you use? Could this be a case of reverse causality? If so, give an example of how the causation could run in the opposite direction.

Student Grade Absences
1 57 6
2 87.2 0
3 87.6 2.5
4 66.2 6
5 94.2 1
6 96.1 0
7 74.8 2.5
8 86.6 0
9 74.6 4.5
10 90.7 1
11 85.5 1
12 83.4 2.5
13 92.8 1
14 76.7 4
15 78.9 1.5
16 84.6 0
17 84.7 1.5
18 86.3 2.5
19 95.7 0
20 95.3 2
21 87.9 0
22 84.7 0
23 81.6 2
24 70.5 5.5
25 76.7 1
26 90.1 0
27 95.1 1
28 98.2 0
29 66.5 4
30 87.1 0
31 69.8 4.5
32 77.2 2
33 81 0.5
34 76.6 0
35 84.2 0
36 79.1 1.5
37 84.5 3.5
38 71.4 2.5
39 68.3 5
40 92.2 0
41 69.2 5

Solutions

Expert Solution

Regressing Grade on Absences in Excel (go to Data tab -> Data Analysis -> Regression, and choose Grade column as Y-values and Absences column as X-values, Confidence level as 95%) gives us the following output:

Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 89.7151986 1.36692369 65.63292396 1.53875E-41 86.95033447 92.48006274
Absences -3.968040552 0.511113307 -7.763524244 1.96325E-09 -5.001864797 -2.934216306

Equation of estimation obtained using the coefficients above:

Grade = -3.968 * Absences + 89.715

The equation suggests that when there are no absences (Absences=0), the projected Grade is 89.715.

However, the Grade comes down by 3.968 points for every Absence day. Hence, Grade and Absence are negatively correlated.

A tardy (1/2 an absence) brings down the grade by 3.968 * 0.5 = 1.984 points

Looking at the p-value of 1.96325E-09, which is << 0.01, we can say that absences are a very significant predictor of Grades at 99% confidence level. Hence, the model is a very nice estimate of the Grades in terms of Absences.

The model was evaluated as above using the p-value of slope (absences) coefficient, which is much less than the critical p-value, hence we can reject the null hypothesis that the Grades and Absences are uncorrelated ( = 0), at 99% confidence level, in favour of the alternative hypothesis ( != 0).

This could as well be a case of Reverse Causality, where lower Grades lead to more Absences (for example, because of lack of interest in such students in attending classes, owing to lower grades in earlier exams).


Related Solutions

number of absences 6 2 15 9 12 5 8 final grade 82 86 43 74...
number of absences 6 2 15 9 12 5 8 final grade 82 86 43 74 58 90 65 The data below were obtained in a study on the number of absences and the final grades (in percentages) of seven randomly selected students in a statistics class. Number of absences 6 2 15 9 12 5 8 Final grade 82 86 43 74 58 90 65 (2 4 6 8 10 12 14 5 0 6 0 7 0 8...
The table below gives the number of absences and the overall grade in the class for...
The table below gives the number of absences and the overall grade in the class for seven randomly selected students. Based on this data, consider the equation of the regression line, y ?? = b?_0 ? + b?_1 x, for using the number of absences to predict a student's overall grade in the class. Keep in mind, the correlation coefficient may or may not be statistically significant for the data given. Remember, in practice, it would not be appropriate to...
The table below gives the number of absences and the overall grade in the class for...
The table below gives the number of absences and the overall grade in the class for seven randomly selected students. Based on this data, consider the equation of the regression line, yˆ=b0+b1xy^=b0+b1x, for using the number of absences to predict a student's overall grade in the class. Keep in mind, the correlation coefficient may or may not be statistically significant for the data given. Remember, in practice, it would not be appropriate to use the regression line to make a...
Number of absences (x) Final exam score (Y) 0                                   
Number of absences (x) Final exam score (Y) 0                                             89.4 1                                             87.2 2                                            84.5 3                                             81.7 4                                             77.3 5                                             74.8 6                                             63.7 7                                             72.1 8                                             66.4 9                                             65.8 Critical Values for Correlation Coefficient n 3     0.997 4     0.950 5     0.878 6     0.811 7     0.754 8    0.707 9     0.666 10   0.632 11   0.602 12   0.576 13   0.553 14   0.532 15   0.514 16   0.497 17   0.482 18   0.468 19   0.456 20   0.444 21   0.433 22   0.423 23   0.413 24   0.404 25   0.396 26  ...
The data in the table is the number of absences for 7 students and their corresponding...
The data in the table is the number of absences for 7 students and their corresponding grade. Number of Absences 1 2 3 4 6 7 8 Grade 5 4.5 4 3.5 2.5 2 1 Step 1 of 3: Calculate the correlation coefficient, r. Round your answer to six decimal places. Step 2 of 3: Determine if r is statistically significant at the 0.01 level. Step 3 of 3: Calculate the coefficient of determination, r2 . Round your answer to...
The data in the table is the number of absences for 7 students and their corresponding...
The data in the table is the number of absences for 7 students and their corresponding grade. Number of Absences 2 3 3 5 5 6 6 Grade 3.8 3.6 3.2 2.5 2.1 2 1.7 Table Step 1 of 5 : Calculate the sum of squared errors (SSE). Use the values b0=4.8669 and b1=−0.5056 for the calculations. Round your answer to three decimal places. Step 2 of 5 : Calculate the estimated variance of errors, s2e. Round your answer to...
The data in the table is the number of absences for 7 students and their corresponding...
The data in the table is the number of absences for 7 students and their corresponding grade. Number of Absences 3 3 3 4 4 4 6 Grade 3.9 3.7 3.7 2.7 2.5 1.8 1.8 Calculate the estimated variance of slope, s^2b1. Round your answer to three decimal places. Construct the 90% confidence interval for the slope. Round your answers to three decimal places. Construct the 90% confidence interval for the slope. Round your answers to three decimal places. Construct...
The data in the table is the number of absences for 7 students and their corresponding...
The data in the table is the number of absences for 7 students and their corresponding grade. Number of Absences 3 4 4 6 6 7 8 Grade 3.9 3.8 3.2 3 2.8 2.8 2.52 Step 3 of 5 : Calculate the estimated variance of slope, s2b1. Round your answer to three decimal places.
The data in the table is the number of absences for 7 students and their corresponding...
The data in the table is the number of absences for 7 students and their corresponding grade. Number of Absences 0 1 2 3 5 6 7 Grade 4.5 4 3.5 3 2.5 2 1.5 Step 1 of 3: Calculate the correlation coefficient, r. Round your answer to six decimal places. Calculate the coefficient of determination, r2r2. Round your answer to three decimal places. Determine if r is statistically significant at the 0.050.05 level.
The data in the table is the number of absences for 7 students and their corresponding...
The data in the table is the number of absences for 7 students and their corresponding grade. Number of Absences 1 1 2 3 3 6 7 Grade 3.9 3.8 3.7 3.5 2.7 2.4 2 Step 1 of 5: Calculate the sum of squared errors (SSE). Use the values b0=4.1440 and b1=−0.3047 for the calculations. Round your answer to three decimal places. Step 2 of 5: Calculate the estimated variance of errors, Round your answer to three decimal places. Step...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT