Question

In: Statistics and Probability

Please explain what the assumption of Cov(x,u)=0 means and why it is so important for regression...

Please explain what the assumption of Cov(x,u)=0 means and why it is so important for regression to return unbiased estimates of the relationship of interest.

Solutions

Expert Solution

Let us first understand the assumption of Cov(x,u)=0

Assumptions :

Let us do by making a simple assumption about the error

The average value of u, the error term, in the population is 0 that is E(u)=0

This is not a restrictive assumption, since we can always use to normalize E(u) to 0

e.g. use   to normalize average ability to zero

We have  to assume that the average value of u does not depend on the value of x

x and u are independent, that is

We need some way of estimating the true relationship using the sample data

We Can do this using some of our assumptions First we need to realize that our main assumption of E(u|x) = E(u) = 0

also implies Cov(x,u) = E(xu) = 0

because  From basic probability we know that: Cov(x,u) = E(xu) – E(x)E(u)

given E(u)=0 (by assumption) and Cov(x,u)=0 (by same assumption plus independence) then E(xu)=0


Related Solutions

Please solve the following: ut=uxx, 0<x<1, t>0 u(0,t)=0, u(1,t)=A, t>0 u(x,0)=cosx, 0<x<1
Please solve the following: ut=uxx, 0<x<1, t>0 u(0,t)=0, u(1,t)=A, t>0 u(x,0)=cosx, 0<x<1
If Var(X) = 1, Var(Y) = 2 and Cov(X,Y) = 0, what is Cov(X-2Y, Y-X2)?
If Var(X) = 1, Var(Y) = 2 and Cov(X,Y) = 0, what is Cov(X-2Y, Y-X2)?
Use separation of variables to solve uxx+2uy+uyy=0, u(x, 0) = f(x), u(x, 1) = 0, u(0,...
Use separation of variables to solve uxx+2uy+uyy=0, u(x, 0) = f(x), u(x, 1) = 0, u(0, y) = 0, u(1, y) = 0.
Please explain what the term collinearity (or multicollinearity in the multiple regression context) means. Does it...
Please explain what the term collinearity (or multicollinearity in the multiple regression context) means. Does it affect our regression estimates (i.e., betas) or their variances? If so, please explain how? Does multicollinearity affect the chances of making either a Type I or Type II error? If so, how so?  
Illustrate why outliers are so important to identify in the correlation/regression context.
Illustrate why outliers are so important to identify in the correlation/regression context.
1. Let X and Y be independent U[0, 1] random variables, so that the point (X,...
1. Let X and Y be independent U[0, 1] random variables, so that the point (X, Y) is uniformly distributed in the unit square. Let T = X + Y. (a) Find P( 2Y < X ). (b). Find the CDF F(t) of T (for all real numbers t). HINT: For any number t, F(t) = P ( X <= t) is just the area of a part of the unit square. (c). Find the density f(t). REMARK: For a...
Consider a semi-infinite square well: U(x)=0 for 0 ≤ x ≤ L, U(x)=U0 for x >...
Consider a semi-infinite square well: U(x)=0 for 0 ≤ x ≤ L, U(x)=U0 for x > L, and U(x) is infinity otherwise. Determine the wavefunction for E < Uo , as far as possible, and obtain the transcendental equation for the allowable energies E. Find the necessary condition(s) on E for the solution to exist.
Solve the nonhomogeneous heat equation: ut-kuxx=sinx, 0<x<pi, t>0 u(0,t)=u(pi,t)=0, t>0 u(x,0)=0, 0<x<pi
Solve the nonhomogeneous heat equation: ut-kuxx=sinx, 0<x<pi, t>0 u(0,t)=u(pi,t)=0, t>0 u(x,0)=0, 0<x<pi
Explain why ρ is preferable to Cov(X,Y) in measuring the strength of relationship between X and...
Explain why ρ is preferable to Cov(X,Y) in measuring the strength of relationship between X and Y
A low regression means that: A)the regression is bad. B) there are other important factors that...
A low regression means that: A)the regression is bad. B) there are other important factors that influence the left hand variable. C) the SSR is low relative to the total variation in Y. D) tells you what the other factors influencing the left hand variable are.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT