Question

In: Civil Engineering

a) What is the primary difference between the dependent variable of an OLS regression model and...

a) What is the primary difference between the dependent variable of an OLS regression model and a logit model?

b) What is the primary difference between the model results of an OLS regression model and a logit model?

Solutions

Expert Solution

Answer:

a)

Definition of Linear Regression

The linear regression technique involves the continuous dependent variable and the independent variables can be continuous or discrete. By using best fit straight line linear regression sets up a relationship between dependent variable (Y) and one or more independent variables (X). In other words, there exist a linear relationship between independent and dependent variables.

The difference between linear and multiple linear regression is that the linear regression contains only one independent variable while multiple regression contains more than one independent variables. The best fit line in linear regression is obtained through least square method.

The following equation is used to represent a linear regression model:Where b0 is the intercept, b1is the slope of the line and e is the error. Here Y is dependent variable and X is an independent variable.

.

The following graph can be used to show the linear regression model.

Definition of Logistic Regression

The logistic regression technique involves dependent variable which can be represented in the binary (0 or 1, true or false, yes or no) values, means that the outcome could only be in either one form of two. For example, it can be utilized when we need to find the probability of successful or fail event. Here, the same formula is used with the additional sigmoid function, and the value of Y ranges from 0 to 1.

Logistic regression equation :

The following graph can be used to show the logistic regression model.

Comparison Chart

BASIS FOR COMPARISON LINEAR REGRESSION LOGISTIC REGRESSION
Basic The data is modelled using a straight line. The probability of some obtained event is represented as a linear function of a combination of predictor variables.
Linear relationship between dependent and independent variables Is required Not required
The independent variable Could be correlated with each other. (Specially in multiple linear regression) Should not be correlated with each other (no multicollinearity exist).

Difference Between Linear and Logistic Regression

Linear and Logistic regression are the most basic form of regression which are commonly used. The essential difference between these two is that Logistic regression is used when the dependent variable is binary in nature. In contrast, Linear regression is used when the dependent variable is continuous and nature of the regression line is linear.Regression is a technique used to predict the value of a response (dependent) variables, from one or more predictor (independent) variables, where the variable are numeric. There are various forms of regression such as linear, multiple, logistic, polynomial, non-parametric, etc.

Key Differences Between Linear and Logistic Regression

  1. The Linear regression models data using continuous numeric value. As against, logistic regression models the data in the binary values.
  2. Linear regression requires to establish the linear relationship among dependent and independent variable whereas it is not necessary for logistic regression.
  3. In the linear regression, the independent variable can be correlated with each other. On the contrary, in the logistic regression, the variable must not be correlated with each other.

b) Difference between linear regression and logistic regression

Linear regression uses the general linear equation Y=b0+∑(biXi)+ϵY=b0+∑(biXi)+ϵ where YY is a continuous dependent variable and independent variables Xi are usually continuous (but can also be binary, e.g. when the linear model is used in a t-test) or other discrete domains. ϵϵ is a term for the variance that is not explained by the model and is usually just called "error". Individual dependent values denoted by Yj can be solved by modifying the equation a little: Yj=b0+∑(biXij)+ϵj

Logistic regression is another generalized linear model (GLM) procedure using the same basic formula, but instead of the continuous YY, it is regressing for the probability of a categorical outcome. In simplest form, this means that we're considering just one outcome variable and two states of that variable- either 0 or 1.

The equation for the probability of Y=1Y=1 looks like this:

P(Y=1)=11+e−(b0+∑(biXi))

Your independent variables XiXi can be continuous or binary. The regression coefficients bibi can be exponentiated to give you the change in odds of YY per change in XiXi, i.e., Odds=P(Y=1)P(Y=0)=P(Y=1)1−P(Y=1)Odds=P(Y=1)P(Y=0)=P(Y=1)1−P(Y=1) and ΔOdds=ebi. ΔOdds is called the odds ratio, Odds(Xi+1)Odds(Xi)Odds(Xi+1)Odds(Xi). In English, you can say that the odds of Y=1 increase by a factor of per unit change in Xi.

Example: If you wanted to see how body mass index predicts blood cholesterol (a continuous measure), you'd use linear regression as described at the top of my answer. If you wanted to see how BMI predicts the odds of being a diabetic (a binary diagnosis), you'd use logistic regression


Related Solutions

Why is the use of OLS regression inappropriate when the dependent variable is dichotomous? explain one...
Why is the use of OLS regression inappropriate when the dependent variable is dichotomous? explain one technique you can use instead, clearly stating how to interpret the estimated regression coefficients.   
What type of relationship between a dependent and independent variable is described by linear regression? A....
What type of relationship between a dependent and independent variable is described by linear regression? A. An exponential relationship B. A parabolic relationship C. A threshold effect D. A linear relationship
What is the difference between dependent and independent variable ? What is phytophthora I destiny and...
What is the difference between dependent and independent variable ? What is phytophthora I destiny and how it change u.s history ?
You estimated a regression model using annual returns of ExxonMobil (as a dependent variable) and of...
You estimated a regression model using annual returns of ExxonMobil (as a dependent variable) and of the market (as an independent variable). The R-squared of this regression is 0.2, and the total variance of ExxonMobil's returns in the estimation window is 0.0625. In this case, the variance of the unsystematic (or idiosyncratic) component of ExxonMobil's returns is:
Using Excel generate a simple regression model with Y as the dependent variable and X1 and...
Using Excel generate a simple regression model with Y as the dependent variable and X1 and X2 as the independent variables in the attached spreadsheet. Write the following from the output: Intercept: Coefficients of Independent variable: R-square: Significance F: Based on the model generated, forecast profits for a firm with X1= Based on the model generated, forecast profits for a firm with x1=250 and X2=100. Evaluate the predictability of the model using explanatory language that someone who does not have...
1. Explain the meaning of independent and dependent variable. 2. What is the difference between correlation...
1. Explain the meaning of independent and dependent variable. 2. What is the difference between correlation and regression analysis? 3. An auto manufacturing company wanted to investigate how the price of one of its car models depreciates with age. The research department at the company took a sample of eight cars of this model and collected the following information on the ages (in years) and prices (in RM thousands) of these cars. Age, x 8 3 6 9 2 5...
Fit a binary logistic regression model with admission decision as the dependent variable, GRE and GPA...
Fit a binary logistic regression model with admission decision as the dependent variable, GRE and GPA as the independent variables. Evaluate the goodness of fit of the model. Determine the significance of independent variables. Interpret odds ratios for independent variables. State the binary logistic regression equation. Evaluate the classification accuracy of the model. Check if the residuals are independent. Admit GRE GPA 0 790 1 1 370 0 1 480 1 1 580 1 1 620 1 0 740 0...
Fit a multiple regression model using MPG as the dependent variable and DISP, HP, and WT...
Fit a multiple regression model using MPG as the dependent variable and DISP, HP, and WT as the independent variables. Is the overall regression model significant? Test at the α = 0.05 level of significance. State the null hypothesis, the alternative hypothesis =, the test statistic calculated and critical values and your test conclusion. mpg disp hp wt 21 160 110 2.62 21 160 110 2.875 22.8 108 93 2.32 21.4 258 110 3.215 18.7 360 175 3.44 18.1 225...
What is the exact difference between the single index model, arbitrage pricing model and linear regression...
What is the exact difference between the single index model, arbitrage pricing model and linear regression model? Based on the residual value's differentiation, will the single index model be included in the linear regression model rather than the arbitrage pricing model? Thanks.
If the errors in the classical linear regression model are NOT normally distributed is the OLS...
If the errors in the classical linear regression model are NOT normally distributed is the OLS estimator still "BLUE"? Please explain why, or why not.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT