Question

In: Math

1. You are given with the regression result that shows the regression model with k variables....

1. You are given with the regression result that shows the regression model with k variables. Answer the following parts:

a) How do you tell that a certain variable is influential?

b) Suppose the theoretical issue said there exist a linear constraint, how do you figure out the constraint holds?

c) Suppose you have two sets of explanatory variables; how did you consider which set is the better one?

d) What’s the meaning of R-squared? Should we always look for the model that has the high R-squared?

e) Suppose the explanatory variables are subject to linear dependence among themselves, what is the correct procedure to estimate the coefficients?

Solutions

Expert Solution

a) We can test for the influence/ significance of a certain variable using t - test. If the p value of the t statistic is less than 0.05, we conclude null hypothesis is rejected (H0: bi=0 versus Hi: bi not equal 0) that the certain variable is different from zero. This means that it is influential / significant.

b) We can test for the presence of a linear constraint of the following form using restricted regression.

c) Out of the two explanatory variables, the better one will be chosen based on more significance, whose p value is lesser than the other, that will be considered more better.

d) R square calculates coefficient of determination which measures the goodness of fit of the model. Suppose R square is 0.81, this would mean that 81% of variation in dependent variable is explained by independent variables. We don't often seek high value of R square as id depends on the purpose of the study and the field also. for example in Physics, 0.81 R square might be too low but in Psychology, 0.45 might be a very high R square.

e) If explanatory variables are subject to linear dependence among themselves, it results in the problem of multicollinearity under which the OLS estimators do not perform well in terms of variance. So, methods to get rid of multicollinearity must be adopted before proceeding for estimation.


Related Solutions

1. A multiple linear regression model should not be used if: A The variables are all...
1. A multiple linear regression model should not be used if: A The variables are all statistically significant. B The coefficient of determination R2 is large. C Both of the above. D Neither of the above. 2. Consider a multiple linear regression model where the output variable is a company's revenue for different months, and the purpose is to investigate how the revenue depends upon the company's advertising budget. The input variables can be time-lagged so that the first input...
Given the following linear programming model, answer the questions that follow. You are given the result...
Given the following linear programming model, answer the questions that follow. You are given the result of a computer program. The results are Maximize 9 X1 + 12 X2 + 10 X3 Subject to: Machine Constraint:   3 X1 + 4 X2 + 3 X3 < 160 Labor Constraint:        6 X1 + 10 X2 + 4 X3 < 288 Materials Constraint: 2 X1 + 2 X2 + 7 X3 < 200 Product 2 Constraint: X1 < 16 OPTIMAL SOLUTION Objective Function Value...
Given the following linear programming model, answer the questions that follow. You are given the result...
Given the following linear programming model, answer the questions that follow. You are given the result of a computer program. The results are Maximize 9 X1 + 12 X2 + 10 X3 Subject to: Machine Constraint:   3 X1 + 4 X2 + 3 X3 < 160 Labor Constraint:        6 X1 + 10 X2 + 4 X3 < 288 Materials Constraint: 2 X1 + 2 X2 + 7 X3 < 200 Product 2 Constraint: X1 < 16 OPTIMAL SOLUTION Objective Function Value...
In a normal simple linear regression model you are given that the variance of the error...
In a normal simple linear regression model you are given that the variance of the error term is 4. Four observations are taken, in which the X values are X=-2,1,2,3. Calculate the variance in the estimate for the slope coefficient.
Explain why you choose multiple regression with dummy variables but not linear trend model and why...
Explain why you choose multiple regression with dummy variables but not linear trend model and why do you believe this technique is appropriate to forecast your data?
Assume you run a regression on two different models (sets of independent variables). The first model...
Assume you run a regression on two different models (sets of independent variables). The first model results in an ?2R2 of 0.60 and the second model results in an ?2R2 of 0.64. This would mean that Select one: a. The independent variables in the first model are not statistically significant, despite what the t-stats and p-values suggest. b. The second model has better overall explanatory power, but a better ?2R2 is only gained at the expense of other factors. The...
6) Suppose a multinomial regression model has two continuous explanatory variables ?1 and ?2 ,and they...
6) Suppose a multinomial regression model has two continuous explanatory variables ?1 and ?2 ,and they are represented in the model by their linear and interaction terms. a) For a ? unit increase in ?1, derive the corresponding odds ratio that compares a category ? response to a category 1 response. Show the form of the variance that would be used in a Wald confidence interval. b) Repeat this problem for a proportional odds regression model.
what variables are used for a regression model. in regards to finding out the outcome of...
what variables are used for a regression model. in regards to finding out the outcome of a loan application at a financial institution.
for a fixed sample size as the number of indeoendent variables in a regression model increases...
for a fixed sample size as the number of indeoendent variables in a regression model increases the power of the regression decreases. T or F The width of a 95% confidence interval around a relative risk increases as the sample size decreases. T or F
Assignment on Multiple Linear Regression The Excel file BankData shows the values of the following variables...
Assignment on Multiple Linear Regression The Excel file BankData shows the values of the following variables for randomly selected 93 employees of a bank. This real data set was used in a court lawsuit against discrimination. Let = starting monthly salary in dollars (SALARY), = years of schooling at the time of hire (EDUCAT), = number of months of previous work experience (EXPER), = number of months that the individual was hired (MONTHS), = dummy variable coded 1 for males...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT