Question

In: Statistics and Probability

Using the data on 4137 college students, the following equation was estimated Using the data on...

Using the data on 4137 college students, the following equation was estimated

Using the data on 4137 college students, the following equation was estimated

by OLS

colgpai =β0 +β1hsperci +ui, i=1,2,...,4137

where colgpa is measured on a four-point scale and hsperc is the percentile in the high school graduating class (defined so that, for example, hsperc = 5 means the top 5 percent of the class).

  Coefficients:

Estimate Std. Error t value Pr(>|t|) (Intercept) 2.9803872 0.0141800 210.2 <2e-16 *** hsperc -0.0170349 0.0005585 -30.5 <2e-16 ***

---

Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 0.5952 on 4135 degrees of freedom Multiple R-squared: 0.1836, Adjusted R-squared: 0.1834 F-statistic: 930.2 on 1 and 4135 DF, p-value: < 2.2e-16

  1. (i) Why does it make sense for the coefficient on hsperc to be negative?

  2. (ii) Interpret the coefficient of hsperc.

  3. (iii) Is it statistically different than zero at the 5% level?

  4. (iv) What other factors do you think might be relevant for explaining colgpa?

  5. (v) Are these other factors likely to be correlated with hsperc? If so, what can you say about the interpretation of the coefficients on hsperc?

Solutions

Expert Solution

ANSWER:

i) It makes sense for the coefficient on hsperc to be negative because the lower a student's percentile will be in the high school graduating class, the higher will be his/her intelligence level and higher will be his/her college GPA, i.e. a lower high school percentile causes a higher college GPA.

ii) The coefficent on hsperc is -0.0170349. Its interpretation:-
When high school graduating percentile increases by 1 per cent, the college GPA, on an average, decreases by 0.017 points.

iii) The t-statistic is given to be -30.5. The critical t-value at 5 percent level is 1.96. Hence, we reject the null hypothesis that the coefficent of hsperc is equal to zero. Hence, it is statistically different from zero at 5 percent level.

iv) Other factors relevant that might affect/explain a student's college GPA is his family's income level, health condition etc.
A rich student can afford tuitions, books, and other resources more easily than a poor stuent, and hence perform better in academics.
A physically/mentally healthy student is expected to perform better than a unhealthy student.


Related Solutions

Question #1 - Using the data on 4137 college students, the following equation was estimated by...
Question #1 - Using the data on 4137 college students, the following equation was estimated by OLS colgpai = β0 + β1 hsperci + ui , i = 1, 2, . . . , 4137 where colgpa is measured on a four-point scale and hsperc is the percentile in the high school graduating class (defined so that, for example, hsperc = 5 means the top 5 percent of the class). Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) 2.9803872 0.0141800...
Consider the following data regarding students' college GPAs and high school GPAs. The estimated regression equation...
Consider the following data regarding students' college GPAs and high school GPAs. The estimated regression equation is Estimated College GPA=0.67+0.6551(High School GPA).Estimated College GPA=0.67+0.6551(High School GPA). Compute the sum of squared errors (SSESSE) for the model. Round your answer to four decimal places. GPAs College GPA High School GPA 2.022.02 3.293.29 2.812.81 3.113.11 2.532.53 3.303.30 3.763.76 4.974.97 3.083.08 3.003.00 3.963.96 3.873.87
Students at a certain school were​ surveyed, and it was estimated that 19​% of college students...
Students at a certain school were​ surveyed, and it was estimated that 19​% of college students abstain from drinking alcohol. To estimate this proportion in your​ school, how large a random sample would you need to estimate it to within 0.08 with probability 0.99​, if before conducting the study​ (a) you are unwilling to predict the proportion value at your school and​ (b) you use the results from the surveyed school as a guideline. a. n= b. n=
2. One of the leading treadmill manufacturers has estimated the following demand equation using data from...
2. One of the leading treadmill manufacturers has estimated the following demand equation using data from 66 stores around the country: Q =+ 250 – 4P + 20A + 3Pc - 15Ac + 40 I (1.8) (20) (1.2) (18) (15) R2 = 0.78 F = 30.86 The variables and their assumed values are Q = Quantity P = Price of basic model = $1000 A =Advertising expenditures = 60 units (in thousands) =Average price of the competitor’s product = $1200...
The estimated regression equation for these data is ŷ = 70 − 3x.
Consider the data. xi 3 12 6 20 14 yi 55 35 60 10 25 The estimated regression equation for these data is ŷ = 70 − 3x. (a) Compute SSE, SST, and SSR using equations SSE = Σ(yi − ŷi)2, SST = Σ(yi − y)2, and SSR = Σ(ŷi − y)2. SSE= SST= SSR= (b) Compute the coefficient of determination r2. (Round your answer to three decimal places.) r2 = (c) Compute the sample correlation coefficient. (Round your answer...
Given the following estimated demand equation Answer the following questions “From the data for 50 States...
Given the following estimated demand equation Answer the following questions “From the data for 50 States in the United States , the following Regression Equation was estimated: Ln C =    5.75 – 1.29 LnP + 0.67 LnY - 0.022LnA - 0.03LnExcT T- Stats:    (0.91) (1.45)       (2.45)   (1.02)             (2.04)           R2 = 0.87; T- table value at 95%; and 46 degrees of freedom is 1.96. Where C = Cigarette consumption packs per year    P = real...
Given the following estimated demand equation Answer the following questions “From the data for 50 States...
Given the following estimated demand equation Answer the following questions “From the data for 50 States in the United States for 2015, the following Regression Equation was estimated: Ln C =             5.75 – 1.29 LnP + 0.67 LnY - 0.022LnA - 0.03LnExcT T- Stats:           (0.91) (1.45)       (2.45)           (1.02)          (2.04)                             R2 = 0.87; T- table value at 95%; and 46 degrees of freedom is 1.96. Where C = Cigarette consumption packs per year             P = real...
According to national data, about 15% of American college students earn a graduate degree. Using this...
According to national data, about 15% of American college students earn a graduate degree. Using this estimate, what is the probability that exactly 27 undergraduates in a random sample of 200 students will earn a college degree? Hint: Use the normal approximation to the binomial distribution, where p = 0.15 and q = 0.85. (Round your answer to four decimal places.) Incorrect: Your answer is incorrect.
According to national data, about 15% of American college students earn a graduate degree. Using this...
According to national data, about 15% of American college students earn a graduate degree. Using this estimate, what is the probability that exactly 25 undergraduates in a random sample of 200 students will earn a college degree? Hint: Use the normal approximation to the binomial distribution, where p = 0.15 and q = 0.85.
Do male college students spend more time than female college students using a computer? This was...
Do male college students spend more time than female college students using a computer? This was one of the questions investigated by the authors of an article. Each student in a random sample of 46 male students at a university in England and each student in a random sample of 38 female students from the same university kept a diary of how he or she spent time over a three-week period. For the sample of males, the mean time spent...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT