Question

In: Statistics and Probability

(3) Suppose that you run a regression of wage on a set of independent variables: age,...

(3) Suppose that you run a regression of wage on a set of independent variables: age, years of education, experience, and a dummy variable married. There are 100 observations, i.e., n=100. You want to test whether all the coefficient and intercept estimates in the equation for married and unmarried are equal to each other. i.e.,

Married: wage = β0 + β1*age + β2*educ + β3*exper + uUnmarried: wage = α0 + α1*age + α2*educ + α3*exper + v

(a) Write down the null hypothesis and alternative hypothesis.
(b) Write down the formula to calculate the Chow statistic. You don’t need to compute a numerical value. Just provide the formula.
(b) Suppose the critical value is 2.5. When will the null hypothesis get rejected?

(4) Provide the steps of implementing a hypothesis test.
(5) What is the formula for variance estimator in White(1980) to deal with heteroskedasticity?

Solutions

Expert Solution

a)

Null Hypothesis:

H0: Pooled model is correct ie there is no significant improvement in fit from running two regressions.

Alternate Hypothesis:

Ha: Pooled model is not correct ie there is significant improvement in fit from running two regressions.

b)

RSSA is defined as the RSS using only subsample of Married

RSSB is defined as the RSS using only subsample of Unmarried

RSSP is defined as the RSS using the entire(pooled) sample

The Chow test is an F test with the following F-statistic:

F = {(RSSP – RSSA – RSSB ­­­­­­­­)/k}/ {(RSSA + RSSB)/(n-2k)}

where k is the number of parameters in pooled model

c)

If our F value is greater than critical value, then we reject the null hypothesis ie if out F value is greater than 2.5, we will reject the null hypothesis.

4)

The steps of a hypothesis test are:

  • Specify the Null Hypothesis.
  • Specify the Alternative Hypothesis.
  • Set the Significance Level (alpha)
  • Calculate the Test Statistic and Corresponding P-Value.
  • Draw a Conclusion.

5)

Steps of testing heteroskedasticity are:

  • Estimate model using OLS:

  • Obtain the predicted Y values after estimating our model.
  • Estimate the model using OLS:

  • Retain the R-squared value from this regression:
  • Calculate the F-statistic or the chi-squared statistic:

The degrees of freedom for the F-test are equal to 2 in the numerator and n – 3 in the denominator. The degrees of freedom for the chi-squared test are 2. If the test statistics is significant, then we have evidence of heteroskedasticity. If not, we fail to reject the null hypothesis of homoskedasticity.


Related Solutions

Assume you run a regression on two different models (sets of independent variables). The first model...
Assume you run a regression on two different models (sets of independent variables). The first model results in an ?2R2 of 0.60 and the second model results in an ?2R2 of 0.64. This would mean that Select one: a. The independent variables in the first model are not statistically significant, despite what the t-stats and p-values suggest. b. The second model has better overall explanatory power, but a better ?2R2 is only gained at the expense of other factors. The...
You run a regression analysis on a bivariate set of data (n=99). You obtain the regression...
You run a regression analysis on a bivariate set of data (n=99). You obtain the regression equation y=0.843x+6.762 with a correlation coefficient of r=0.954 which is significant at α=0.01 You want to predict what value (on average) for the explanatory variable will give you a value of 150 on the response variable. What is the predicted explanatory value? x = (Report answer accurate to one decimal place.) Here is a bivariate data set. Find the regression equation for the response...
The run times of two computer programs, A and B, are independent random variables. The run...
The run times of two computer programs, A and B, are independent random variables. The run time of program A is an exponential random variable with mean 3 minutes, while that of program B is an Erl(2,lambda) random variable with mean 3 minutes. If both programs are being executed on two identical computers, what is the probability that program B will finish first?
1. For a data set with 3 variables and 3 observations, suppose Xbar, the sample mean...
1. For a data set with 3 variables and 3 observations, suppose Xbar, the sample mean vector is [5, 3, 4]’. Let b’ = (1 1 1) and c’ = (1 2 -3). The sample covariance matrix is given as, S = ( 13 −3.5 1.5;  −3.5 1 −1.5 ; 1.5 −1.5 3 ) (a) Find the sample mean and variance for b’X and c’X. (b) Find the sample mean and variance for c’X. (c) Find the covariance between b’X and...
Question 1. In this exercise you will simulate a data set and run a simple regression....
Question 1. In this exercise you will simulate a data set and run a simple regression. To ensure reproducible results, make sure you use set.seed(1). a)   Using the rnorm() function create vector X that contains 200 observations from N(0,1) distribution. Similarly, create a 200 element vector, epsilon (ϵ), drawn from N(0,0.25) distribution. This is the irreducible error. b)   Create the response data using the following relationship: Y=−1+0.5X+ϵ Fit a linear regression of Y on X. Display the summary statistics and...
In a single variable regression analysis the respective values for the variables are as follows: Independent...
In a single variable regression analysis the respective values for the variables are as follows: Independent variable: 2.00, 7.00, 11.00, 16.00, 13.00, 14.00, 1.00, 12.00, 19.00, 9.00. Dependent variable: 58.78, 44.64, 33.27, 16.04, 26.72, 24.89, 63.11, 28.41, 7.66, 39.20. What is the predicted value of the dependent variable when the independent variable is equal to 15.00?
create a data set of one dependent variable and 3 independent variables from statistics Canada related...
create a data set of one dependent variable and 3 independent variables from statistics Canada related to COVID-19 (could be anything)
create a data set of one independent variable and 3 dependent variables from statistics Canada related...
create a data set of one independent variable and 3 dependent variables from statistics Canada related to covid-19 (could be anything)
e. Multiple Regression. Identify at least 3 variables for which you could calculate a multiple regression....
e. Multiple Regression. Identify at least 3 variables for which you could calculate a multiple regression. Describe the variables and their scale of measurement. Which variables would you include as the predictor variables and which as the outcome variable? Why? Which regression method would you use and why? What would R2 and adjusted R2 tell you about the relationship between the variables?
a) Run a regression analysis on the following bivariate set of data with y as the...
a) Run a regression analysis on the following bivariate set of data with y as the response variable. x y 10.7 81.6 13.7 81.5 36.7 56.5 4 72.1 50.7 23.2 47.6 -4.8 37.3 31.9 24.3 75.2 21.5 59.3 17.2 54.6 23.6 75.5 22.2 60.8 29.3 51 14 63.4 0.2 102.7 30.7 48.2 10.3 74.8 26.5 48.2 23.1 87 Verify that the correlation is significant at an ?=0.05?=0.05. If the correlation is indeed significant, predict what value (on average) for the...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT