Question

In: Statistics and Probability

Consider a logistic regression model with a linear predictor that includes an interaction term, say x’β...

  1. Consider a logistic regression model with a linear predictor that includes an interaction term, say x’β = β0 + β1x1 +β2x2+ β12x1x2. Does the odds ratio associated with the regressor x2 have the same interpretation as in the case where the linear predictor does not have the interaction term? Provide mathematical development to support your answer.

Solutions

Expert Solution

In all other model the regression coefficient of a variable corresponds to the change in log odds and its exponentiated form corresponds to the odds ratio. This is only true when our model does not have any interaction terms. When a model has interaction term(s) of two predictor variables, it attempts to describe how the effect of a predictor variable depends on the level/value of another predictor variable. The interpretation of the regression coefficients become more involved.

Let’s take a simple example. (Don't get involved too much into the data like from where it is coming and all. Just go for the logic since I just put these to make you understand clear.)  

logit(p) = log(p/(1-p))= β0 + β1*female + β2*math + β3*female*math

Logistic regression                               Number of obs   =        200

                                                  LR chi2(3)      =      62.94

                                                  Prob > chi2     =     0.0000

Log likelihood = -79.883301                       Pseudo R2       =     0.2826

------------------------------------------------------------------------------

         hon |      Coef.   Std. Err.      z    P>|z|     [95% Conf. Interval]

-------------+----------------------------------------------------------------

      female | -2.899863   3.094186    -0.94   0.349    -8.964357    3.164631

        math |   .1293781   .0358834     3.61   0.000     .0590479    .1997082

femalexmath |   .0669951     .05346     1.25   0.210    -.0377846    .1717749

   intercept | -8.745841    2.12913    -4.11   0.000    -12.91886   -4.572823

------------------------------------------------------------------------------

In the presence of interaction term of female by math, we can no longer talk about the effect of female, holding all other variables at certain value, since it does not make sense to fix math and femalexmath at certain value and still allow female change from 0 to 1!

In this simple example where we examine the interaction of a binary variable and a continuous variable, we can think that we actually have two equations: one for males and one for females. For males (female=0), the equation is simply

logit(p) = log(p/(1-p))= β0 + β2*math.

For females, the equation is

logit(p) = log(p/(1-p))= (β0 + β1) + (β2 + β3 )*math.

Now we can map the logistic regression output to these two equations. So we can say that the coefficient for math is the effect of math when female = 0. More explicitly, we can say that for male students, a one-unit increase in math score yields a change in log odds of 0.13. On the other hand, for the female students, a one-unit increase in math score yields a change in log odds of (.13 + .067) = 0.197. In terms of odds ratios, we can say that for male students, the odds ratio is exp(.13) = 1.14 for a one-unit increase in math score and the odds ratio for female students is exp(.197) = 1.22 for a one-unit increase in math score. The ratio of these two odds ratios (female over male) turns out to be the exponentiated coefficient for the interaction term of female by math: 1.22/1.14 = exp(.067) = 1.07.


Related Solutions

define the logistic regression model.
define the logistic regression model.
What are the similarities and differences between multiple linear regression, and logistic regression? Please consider Typical...
What are the similarities and differences between multiple linear regression, and logistic regression? Please consider Typical Application (used when), assumptions needed, Data Type
How does a simple linear model using one continuous predictor change if we add an interaction...
How does a simple linear model using one continuous predictor change if we add an interaction term with an indicator variable, but don’t include the indicator variable on its own?
Multinomial logistic regression can be used on: a)Categorical predictor variables only. b)Both categorical and continuous predictor...
Multinomial logistic regression can be used on: a)Categorical predictor variables only. b)Both categorical and continuous predictor variables. c)Continuous predictor variables only. d)Ordinal predictor variables only.
If a dependent variable is binary, is it optimal to use linear regression or logistic regression?...
If a dependent variable is binary, is it optimal to use linear regression or logistic regression? Explain your answer and include the theoretical and practical concerns associated with each regression model. Provide a business-related example to illustrate your ideas.
Consider the simple linear regression model y = 10+25x+e where the random error term is normally...
Consider the simple linear regression model y = 10+25x+e where the random error term is normally and independently distributed with mean zero and standard deviation 2. Do NOT use software, generate a sample of eight observations, one each at the levels x = 10, 12, 14, 16, 18, 20, 22, and 24. DO NOT USE SOFTWARE! A.Fit the linear regression model by least squares and find the estimates of the slope and intercept. B.Find the estimate of s^2. C. Find...
Please find one medical dataset that is suitable for correlation, logistic regression and linear regression.
Please find one medical dataset that is suitable for correlation, logistic regression and linear regression.
(Regression Analysis) Consider the simple linear regression model (I) and the corresponding scatter plot with the...
(Regression Analysis) Consider the simple linear regression model (I) and the corresponding scatter plot with the covariates XX (x-axis) vs residuals (y-axis). What model assumption violations can be diagnosed by this scatter plot?
Consider the simple linear regression model for which the population regression equation can be written in...
Consider the simple linear regression model for which the population regression equation can be written in conventional notation as: yi= βxi+ui as 1- Derive the Ordinary Least Squares estimator (OLS) of β (i.e. ˆβ) include in your answer details of the proof. 2- Give an interpretation of ˆβ
Consider the simple linear regression model for which the population regression equation can be written in...
Consider the simple linear regression model for which the population regression equation can be written in conventional notation as: yi= βxi+ui as 1- Derive the Ordinary Least Squares estimator (OLS) of β0 (i.e. ˆβ0) include in your answer details of the proof. 2- Give an interpretation of ˆβ0
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT