In: Statistics and Probability
A researcher wanted to know the determinants of SAT scores in the United States of America. Using data from 4,137 survey respondents, the following equation was estimated:
????̂= 1,028.10 + 19.30ℎ???? −2.19ℎ????2
−45.09?????? −169.81 ????? +62.31??????.?????
Standard Error: (6.29) (3.83) (0.53) (4.29) (12.71) (18.15)
R2: 0.0858 n = 4,137
where Sat is the combined SAT score, hsize is the size of the
student’s high school graduating class, in hundreds, female is a
gender dummy variable, and black is a race dummy variable equal to
one for blacks and zero otherwise.
(i) Is there strong evidence that hsize2 should be included in the
model? From this equation, what is the optimal high school
size?
(ii) Holding hsize fixed, what is the estimated difference in SAT score between nonblack females and nonblack males? How statistically significant is this estimated difference?
(iii) What is the estimated difference in SAT score between nonblack males and black males? Test the null hypothesis that there is no difference between their scores,
against the alternative that there is a difference.
(iv) What is the estimated difference in SAT score between black
females and nonblack females? What would you need to do to test
whether the difference is statistically
significant?