Question

In: Statistics and Probability

Explain models with binary independent variables and possible issues with them.

Explain models with binary independent variables and possible issues with them.

Solutions

Expert Solution

A binary variable can assume only two values. Numerically, it is usually represented as 0 or 1.

Logistic regression is a statistical model that in its basic form uses a logistic function to model a binary dependent variable, although many more complex extensions exist. In regression analysis, logistic regression is estimating the parameters of a logistic model (a form of binary regression). Mathematically, a binary logistic model has a dependent variable with two possible values, such as pass/fail which is represented by an indicator variable, where the two values are labeled "0" and "1".

But there are some real world problems in implementing logistic regression discussed below,

1) Logistic Regression is also not one of the most powerful algorithms out there and can be easily outperformed by more complex ones.

Also, we can’t solve non-linear problems with logistic regression since it’s decision surface is linear.
2) Logistic regression will not perform well with independent variables that are not correlated to the target variable and are very similar or correlated to each other that is we can’t solve non-linear problems with logistic regression since it’s decision surface is linear. Just take a look at the example,

3) Your likelihood function won’t converge if there is full separation in the data.
4) You’re assuming a specific functional form, and in particular monotonicity.
5) There are complications with heteroskedastic and clustered standard errors.
6) If you’re using fixed effects in a panel model with a short time dimension, your estimates will be severely biased.
7) You can’t use it as the first stage of an IV without committing the sin of “forbidden regression.”
Most of these complaints are true of any nonlinear parametric model, and the logit does have some nice properties. For example, it is globally concave.

8) verfitting the modelL

9) limited to Linear relationships between variables

10) Sensitivity to Outliers
11) Large Sample Size


Related Solutions

what are the possible safety issues of smartwatch and how to tackle them ? ( at...
what are the possible safety issues of smartwatch and how to tackle them ? ( at least 5)
explain the telecommunication issues encountered by individuals with disabilities and the possible solutions for these issues...
explain the telecommunication issues encountered by individuals with disabilities and the possible solutions for these issues My major is rehabilitation services. the class is assistuve technology. the book is assistive technology for people with disabilities
Assume you run a regression on two different models (sets of independent variables). The first model...
Assume you run a regression on two different models (sets of independent variables). The first model results in an ?2R2 of 0.60 and the second model results in an ?2R2 of 0.64. This would mean that Select one: a. The independent variables in the first model are not statistically significant, despite what the t-stats and p-values suggest. b. The second model has better overall explanatory power, but a better ?2R2 is only gained at the expense of other factors. The...
Xn are independent random variables and each one of them has a normal distribution with mean...
Xn are independent random variables and each one of them has a normal distribution with mean 0 and variance 1 what is the distribution of X-bar if n=9 what is the probability of X-bar =<3 if n=9
Refresher on Variables: Classifying variables as independent or dependent: The independent variable is the one being...
Refresher on Variables: Classifying variables as independent or dependent: The independent variable is the one being manipulated or grouped in the study for comparison. A tip to remember this is to think that independent starts with “I” and I, the researcher, manipulate that variable or have control over who is grouped in the study. The way we will be using independent variables in this course is in terms of between group analysis, so our focus will be on groups being...
Explain a typical dynamic lag equation, issues with it and possible solutions.
Explain a typical dynamic lag equation, issues with it and possible solutions.
what are dependent and independent variables
what are dependent and independent variables
For each of the following biological issues explain: (a) possible ethical issues involved; (b) two sides...
For each of the following biological issues explain: (a) possible ethical issues involved; (b) two sides or views of the issue; (c) possible separation of scientific versus societal view and opinions. 1. Cloning 3. Rare and endangered species conservation 4. Sea level rise
Please analyze the possible revenue models of WeChat, and briefly explain your conclusion.
Please analyze the possible revenue models of WeChat, and briefly explain your conclusion.
Explain security and privacy issues in an organization and recommend on how to deal with them
Explain security and privacy issues in an organization and recommend on how to deal with them
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT