Question

In: Statistics and Probability

SLR/MLR model analysis topic: Is it always possible to get a model that is the best...

SLR/MLR model analysis topic:

Is it always possible to get a model that is the best one and has all the best properties? Why or why not?

Solutions

Expert Solution

Yes, it is always possible to get a model that is the best one and has all the best properties. It is only because of assumptions that it holds.

  • Assumption 1: Linearity

This is likely the most important assumption. If your model doesn't fit this assumption then it'll be very obvious that you need to go to a different model. This is the assumption that the relationship between X and Y is approximately linear in shape when plotted. This assumption wouldn't hold if the line was curved in any way, but if all other assumptions hold true then one might be able to plot a different regression line such as with a parabola.

  • Assumption 2: Independence of Errors

The errors in a model is the difference between the predicted linear regression line and the actual locations of each individual point that we're plotting. This assumption hold that the errors are not correlated and are instead independent of one another. This doesn't hold true if there's a correlation to the errors, which could be related to clusters of points instead of a line of distribution.

  • Assumption 3: Normality

The Normality assumption is that the errors follow a roughly Normal distribution with a mean of 0. This is to say that while there are outliers, the majority of the errors within the model can be found pretty near the line of regression. If we overlay a mini normal curve at different points along the linear regression line we can see the amount of distance we would expect most points to be. This is contrasted by the possibility of a uniform distribution of any kind in which all points are about the same distance from the line or if the distribution looked more like a rectangle or square.

  • Assumption 4: Equality of Variance (also known as): Homoscedasticity of Errors

This is the megaphone or cone assumption. The Homoscedasticity of Errors is stating that the errors along the line of regression have a consistent standard deviation. Usually this assumption will fair when as X and Y increase along the graph, the deviation of points grows larger as we move up the linear regression line. Thus leading to the points in the shape of a cone. As if megaphone is calling out to be payed attention to how dramatically this assumption is being broken. For MLR this is simply expanded for all lines or planes. If the linear regression version of this assumption focuses on the relationship of Xi, then the MLR focuses on the relationship of all Xi.

  • Assumption 5 just for MLR: Independence of Predictors

This is stating that the independent variables are not correlated. While this is skill likely to happen it is a good practice to avoid having too many Independent variables being correlated to one another as even in the best of situation it's likely that only one would be needed when performing a regression model. If you imagine all of the possible information that we could predict as a hot dog and each independent variable as a little hotdog eating creature, let us say that feature A takes a bit of the hot dog and is thus able to predict a portion of the data. If Feature A is heavily correlated to feature B then feature B can only take about as much of a bit out of the hotdog as feature A, however it was only ever able to take the same bit as feature A and is thus without hot dog. This is all to say that even if you can perform MLR with correlated features, it's kinda silly and pointless to do so.

Based on these assumptions that SLR/MLR hold, we can say that it's always possible to say that it gives the best model and have all the best properties.


Related Solutions

how one way of central tendency is not always the “best” way to get the one...
how one way of central tendency is not always the “best” way to get the one in the middle?
how one way of central tendency is not always the “best” way to get the one...
how one way of central tendency is not always the “best” way to get the one in the middle? thank you
The advertising agency promoting a new product is hoping to get the best possible exposure in...
The advertising agency promoting a new product is hoping to get the best possible exposure in terms of the number of people the advertising reaches. The agency will use a two-pronged approach: focused Internet advertising, which is estimated to reach 200,000 people for each burst of advertising, and print media, which is estimated to reach 80,000 people each time an ad is placed. The cost of each Internet burst is $3,000, as opposed to only $900 for each print media...
In ANOVA test, acceptance of the null hypothesis indicates that: The slope of the SLR model...
In ANOVA test, acceptance of the null hypothesis indicates that: The slope of the SLR model is null True                 b)   False         A manufacturer produces ball bearings, normally distributed, with unknown mean and standard deviation. A sample of 25 has a mean of 2.5cm. The 99% confidence interval has length 4cm (double-sided). Which statement is correct (use t-distribution): s2 = 23.42cm2        b) s2 = 12.82cm2              c)   s= 3.58cm                   d) s= 4.84cm      The 99% prediction interval measures (use...
Topic: exchange rate regime in developing countries (Russia and Kazakhstan) Is it possible that analysis of...
Topic: exchange rate regime in developing countries (Russia and Kazakhstan) Is it possible that analysis of country’s macroeconomics can be limited only at one country’s level? Why? Who are effected by your selected country’s macroeconomic indicator’s changes? How?
Topic: exchange rate regime in developing countries (Russia and Kazakhstan) Is it possible that analysis of...
Topic: exchange rate regime in developing countries (Russia and Kazakhstan) Is it possible that analysis of country’s macroeconomics can be limited only at one country’s level? Why?
A MLR model have LIFE (y) as the response variable, and MALE (x1), BIRTH (x2), DIVO...
A MLR model have LIFE (y) as the response variable, and MALE (x1), BIRTH (x2), DIVO (x3), BEDS (x4), EDUC (x5), and INCO (x6), as predictors. I know you can use first fit the model using lm(y~x) then use anova(model) to check the SSreg,my question is, what is the difference between  SSreg(β2|β0,β3) and SSreg(β1|β0,β3,β2)? What should you put as the argument of lm() function with respect to (β2|β0,β3) and (β1|β0,β3,β2)
Run a regression analysis and find a best model to predict White Speck count from cotton...
Run a regression analysis and find a best model to predict White Speck count from cotton fiber properties given to you. Make sure to show all your steps on how you came up with the best model. Word or text file. Harvdate date of cotton Cotton fiber Length Cotton fiber Strength Short fiber content Cotton fineness Immature fiber content Cotton trash count Cotton dust count Cotton nep count y=White Specks 1 1.06 31.8 21.8 196 6.36 75 404 253 17.8...
Growth for the sake of growth is not always in the best interest of the firm...
Growth for the sake of growth is not always in the best interest of the firm and shareholder value. What are some of the things the firm can do to increase its growth potential, and when might an action to increase growth be contrary to the interest of increasing the firm's value?
Growth for the sake of growth is not always in the best interest of the firm...
Growth for the sake of growth is not always in the best interest of the firm and shareholder value. What are some of the things the firm can do to increase its growth potential, and when might an action to increase growth be contrary to the interest of increasing the firm's value?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT