Question

In: Statistics and Probability

In this chapter, you learn four steps that should be used to evaluate a regression model....

In this chapter, you learn four steps that should be used to evaluate a regression model. What is the first step and why is it important? Explain the other three steps, indicating what you learn from each of those three steps.

Solutions

Expert Solution

The word 'Regression' means 'retrograde motion' or 'going back'.It was first used by the British biometrician Sir Francis Galton(1822-1911) who used this in his study of relation between heights of parents and children. Regression Analysis means  th predition done to check the relationship between two variables , one is the dependent and the other is the independent variable.

There are different kind of regression models;

  • Linear Regression
  • Logistic regression
  • Polynomial regression
  • Stepwise Regression
  • Ridge Regression
  • Lasso Regression

etc...

The type of regression depends on the intensity of the relatiionship between the variable. However the most commonly used regression model is Linear Regression in which the relationship can be easly picturised through a scatter diagram, and the line about which the points in the scatter diagram cluster may be represented by an equation of the form y = a+bx or x = c+dy.

The four main metrices that are used to evaluate regression models are as follows;

  • Mean Squared Error(MSE) : It is simply the average of the squared difference between th dependend value and the independent value in the regression model.As it squares the differences, it penalizes even a small error that leads to over estimation of how bad the model is. It is preferred more than other metrices because it is differentiable and hence can be optimized.
  • Root Mean squared Error: It is the average square root of the difference between the dependend value and the independent value in a regression model.This is preffered in some cases because the errors are squared before the averaging.
  • Mean absolute Error: It is the mean absolute value between the dependend value and independent valu.
  • R^2 or coefficient of determination

Related Solutions

When should a regression model be used to make a prediction? When should it not be...
When should a regression model be used to make a prediction? When should it not be used to make a prediction? What value tells us how good the prediction will be?
1. A multiple linear regression model should not be used if: A The variables are all...
1. A multiple linear regression model should not be used if: A The variables are all statistically significant. B The coefficient of determination R2 is large. C Both of the above. D Neither of the above. 2. Consider a multiple linear regression model where the output variable is a company's revenue for different months, and the purpose is to investigate how the revenue depends upon the company's advertising budget. The input variables can be time-lagged so that the first input...
outline at least four criteria that should be used to evaluate approaches to performance measurement, and...
outline at least four criteria that should be used to evaluate approaches to performance measurement, and critically evaluate one approach using the criteria you specified. Which of the four criteria you identified do you consider the most and least important? What are the benefits and limitations of the performance measurement approach you chose, according to these criteria?
An article used a multiple regression model with four independent variables to study accuracy in reading...
An article used a multiple regression model with four independent variables to study accuracy in reading liquid crystal displays. The variables were y = error percentage for subjects reading a four-digit liquid crystal display x1 = level of backlight (ranging from 0 to 122 cd/m2) x2 = character subtense (ranging from 0.025° to 1.34°) x3 = viewing angle (ranging from 0° to 60°) x4 = level of ambient light (ranging from 20 to 1500 lux) The model fit to data...
When fitting a multiple regression model, you should check for independence of observations and the absence...
When fitting a multiple regression model, you should check for independence of observations and the absence of multicollinearity. Discuss how you would check appropriate statistics and/or plots. (a) When we run a multiple regression, we hope to be able to generalize the sample model to the entire population. To do this, several assumptions must be met including:  No Multicollinearity  Homoscedasticity  Independent Errors  Normally-distributed Errors Explain what is meant by each of these assumptions and describe the...
4. How should a researcher specify a multiple linear regression model? In answering this question, you...
4. How should a researcher specify a multiple linear regression model? In answering this question, you should be sure to draw on your overall knowledge of econometrics in order to consider not only how a researcher should specify an initial model, but also how a researcher should assess the performance of an initial model and thereby iterate towards a final model.
When should logistic regression be used for data analysis? What is the assumption of logistic regression?...
When should logistic regression be used for data analysis? What is the assumption of logistic regression? How to explain odds ratio?
What steps are used to evaluate the evidence within clinical practice guidelines?
What steps are used to evaluate the evidence within clinical practice guidelines?
What is a fitted value for a multiple regression model and the data that is used...
What is a fitted value for a multiple regression model and the data that is used to create it? Select one. -It is the difference between the actual value of the response variable and the corresponding predicted value (regression error) using the multiple regression model. -It is a statistic that explains the relationship between response and predictor variables. -It is a statistic that is used to evaluate the significance of the multiple regression model. -It is the predicted value of...
Review Porter's Five-Forces Model in Chapter 3.  Evaluate each force in the model as either a weak,...
Review Porter's Five-Forces Model in Chapter 3.  Evaluate each force in the model as either a weak, moderate or strong competitive force based on the criteria in the text and PPs and within the context of your general knowledge of the world-wide automobile industry.  Based on your evaluation, give an overall assessment on the intensity of competition (either weak, moderate and strong) in the world-wide automobile industry. Hint: as your text indicates, the rivalery among competing sellers, in the center of the...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT