Question

In: Statistics and Probability

When the response variable of interest is dichotomous rather than continuous, why is it not advisable...

When the response variable of interest is dichotomous rather than continuous, why is it not advisable to fit a standard linear regression model using the probability of success as the outcome?

Solutions

Expert Solution

Answer:

Command:

Statistics
Regression
Logistic regression
Description

Logistic regression is a statistical method for analyzing a dataset in which there are one or more independent variables that determine an outcome. The outcome is measured with a dichotomous variable (in which there are only two possible outcomes).

In logistic regression, the dependent variable is binary or dichotomous, i.e. it only contains data coded as 1 (TRUE, success, pregnant, etc. ) or 0 (FALSE, failure, non-pregnant, etc.).

The goal of logistic regression is to find the best fitting (yet biologically reasonable) model to describe the relationship between the dichotomous characteristic of interest (dependent variable = response or outcome variable) and a set of independent (predictor or explanatory) variables. Logistic regression generates the coefficients (and its standard errors and significance levels) of a formula to predict a logit transformation of the probability of presence of the characteristic of interest:

where p is the probability of presence of the characteristic of interest. The logit transformation is defined as the logged odds:

and

Rather than choosing parameters that minimize the sum of squared errors (like in ordinary regression), estimation in logistic regression chooses parameters that maximize the likelihood of observing the sample values.

Required input

The MedCalc dialog box for logistic regression is similar to the one for multiple regression. In the dialog box you first identify the dependent variable. Remember that the dependent variable must be binary or dichotomous, and it should only contains data coded as 0 or 1. Cases with values other than 0 or 1 for the dependent variable will be excluded from the analysis!

For the independent variables you enter the names of variables that you expect to influence the dependent variable.

You can click the button to obtain a list of variables. In this list you can select a variable by clicking the variable's name.

Options

Method: select the way independent variables are entered into the model.
Enter: enter all variables in the model in one single step, without checking
Forward: enter significant variables sequentially
Backward: first enter all variables into the model and next remove the non-significant variables sequentially
Stepwise: enter significant variables sequentially; after entering a variable in the model, check and possibly remove variables that became non-significant.
Enter variable if P<
A variable is entered into the model if its associated significance level is less than this P-value.

Remove variable if P>
A variable is removed from the model if its associated significance level is greater than this P-value.

Classification table cutoff value: a value between 0 and 1 which will be used as a cutoff value for a classification table. The classification table is a method to evaluate the logistic regression model. In this table the observed values for the dependent outcome and the predicted values (at the selected cut-off value) are cross-classified.
Categorical: click this button to identify nominal categorical variables.
Results

After you click the OK button, the following results are displayed:


Related Solutions

Why are the slabs for composite construction designed as simply supported rather than continuous?
Why are the slabs for composite construction designed as simply supported rather than continuous?
Why is the use of OLS regression inappropriate when the dependent variable is dichotomous? explain one...
Why is the use of OLS regression inappropriate when the dependent variable is dichotomous? explain one technique you can use instead, clearly stating how to interpret the estimated regression coefficients.   
• Define each type of variable: dichotomous, ordinal, categorical, continuous • Define the following study designs:...
• Define each type of variable: dichotomous, ordinal, categorical, continuous • Define the following study designs: Randomized controlled trial, prospective cohort study, case-control study, crossover study. • Define in dependent versus independent samples.
Why do firms borrow capital that has to be repaid with interest rather than finance a...
Why do firms borrow capital that has to be repaid with interest rather than finance a firm with 100% equity?
why we use absorption costing rather than variable costing? if absorbing costung method will allow for...
why we use absorption costing rather than variable costing? if absorbing costung method will allow for over producing then why does CAAP allow this methid? kindly please write at least longer than 3 paragraphs explaning everything carefully. please only do this if you can write longer than 3 paragraphs and explain it well. thank you ! please kindly also explain why absorption is GAAP and variable is not, and the advantages and disadventages of the two . i spelled GAAP...
What are the advantages and disadvantages of using ranks rather than continuous measurements to conduct tests...
What are the advantages and disadvantages of using ranks rather than continuous measurements to conduct tests of hypotheses?
What is a random variable? Why is it important? What is discrete random variable? Continuous random?...
What is a random variable? Why is it important? What is discrete random variable? Continuous random? What is an expected value?
Why are plasma membranes arranged as a bilayer rather than a monolayer?
Why are plasma membranes arranged as a bilayer rather than a monolayer?
1. When you try to transfer bacteria from one plate to another, why is it advisable...
1. When you try to transfer bacteria from one plate to another, why is it advisable to pick one single colony? 2. Explain how you would measure bacterial growth in agar plate and in broth culture (test tube). Polymerase chain reaction is not the answer 3. Catalase and cytochrome c oxidase are two enzymes that are used to type or identify bacteria. What type(s) of bacteria would test positive for these two enzymes?
when is it more convient to use que theory rather than simulations
when is it more convient to use que theory rather than simulations
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT