Question

In: Statistics and Probability

Please write the answer in word form so i can copy What is the purpose of...

Please write the answer in word form so i can copy

  1. What is the purpose of saying “all-else-equal” or “ceteris paribus” when it comes to multiple linear regression?
  2. Why is the F-test important in multiple linear regression more so than in simple linear regression?
  3. What is the difference between the AIC and BIC?
  4. Why is outlier analysis so important?
  5. What is the danger in using stepwise regression?
  6. What is the difference between Cook’s Distance and DFFITS?
  7. What can you do to remove multicollinearity?
  8. What is a VIF?
  9. If you have to create dummy variables for the seven continents of the world, how many columns do you create and why?
  10. Why do we standardize residuals?
  11. Describe a QQ plot and what it can tell us.

Solutions

Expert Solution

1. What is the purpose of saying “all-else-equal” or “ceteris paribus” when it comes to multiple linear regression?

Ceteris paribus is latin and means the others equal, i.e. all other variables fixed.  In context, this means: keeping all other variables fixed.

Multiple linear regression analysis is an extension of simple linear regression analysis, used to assess the association between two or more independent variables and a single continuous dependent variable. The multiple linear regression equation is as follows:

,

whereis the predicted or expected value of the dependent variable, X1 through Xp are p distinct independent or predictor variables, b0 is the value of Y when all of the independent variables (X1 through Xp) are equal to zero, and b1 through bp are the estimated regression coefficients. Each regression coefficient represents the change in Y relative to a one unit change in the respective independent variable. In the multiple regression situation, b1, for example, is the change in Y relative to a one unit change in X1, holding all other independent variables constant (i.e., when the remaining independent variables are held at the same value or are fixed).

2. Why is the F-test important in multiple linear regression more so than in simple linear regression?

F-test in regression compares the fits of different linear models. Unlike t-tests that can assess only one regression coefficient at a time, the F-test can assess multiple coefficients simultaneously. The F-test of the overall significance is a specific form of the F-test. It compares a model with no predictors to the model that you specify.

3. What is the difference between the AIC and BIC?

AIC and BIC are widely used in model selection criteria. AIC means Akaike’s Information Criteria and BIC means Bayesian Information Criteria. Though these two terms address model selection, they are not the same. One can come across may difference between the two approaches of model selection.The AIC can be termed as a mesaure of the goodness of fit of any estimated statistical model. The BIC is a type of model selection among a class of parametric models with different numbers of parameters.

Akaike’s Information Criteria generally tries to find unknown model that has high dimensional reality. This means the models are not true models in AIC. On the other hand, the Bayesian Information Criteria comes across only True models. It can also be said that Bayesian Information Criteria is consistent whereas Akaike’s Information Criteria is not so. When Akaike’s Information Criteria will present the danger that it would outfit. the Bayesian Information Criteria will present the danger that it would underfit. Though BIC is more tolerant when compared to AIC, it shows less tolerance at higher numbers.

4. Why is outlier analysis so important?

Outliers can range from being unimportant to being really important.

  • Outliers are unimportant if they capture inaccurate information, and/or if they carry little weight in the analysis.
  • Outliers are really important if they carry a lot of weight, and/or if they give you important information that the more “normal” data don’t.

If it can be determined that an outlying point is in fact erroneous, then the outlying value should be deleted from the analysis (or corrected if possible).

In some cases, it may not be possible to determine if an outlying point is bad data. Outliers may be due to random variation or may indicate something scientifically interesting. In any event, we should not simply delete the outlying observation before a through investigation. In running experimdnts , we may repeat the experiment. If the data contains significant outliers, we may need to consider the use of robust statistical techniques.

Please raise separate requests for other questions.


Related Solutions

Please write using computer/phone's keyboard, so I can copy the answer directly. Showing image is optional....
Please write using computer/phone's keyboard, so I can copy the answer directly. Showing image is optional. I need plagiarism free answers pls. Will upvote for sure Answer the following questions about PWM (Pulse-Width Modulation) and write around 6-7 lines per question.    Do a research about PWM and its applications. Choose one of PWM applications and write about it in more details.           
please I want it to step by step and in word posted so I can read...
please I want it to step by step and in word posted so I can read them. Q: If the average realized return of a portfolio is 27.5% per year, the standard deviation of returns is 50%, the portfolio beta is 1.25, the average return of Treasury bills over the same period is 2.5% per year, and the average return on the market is 12.5% per year Calculate i) the Sharpe; ii) Treynor and iii) Jensen    
please answer with explanation using Microsoft word and copying and pasting the answer here so i...
please answer with explanation using Microsoft word and copying and pasting the answer here so i can easily copy and paste on my word document. thank you 9) A researcher has developed the following regression equation to predict the prices of luxurious Oceanside condominium units, = 40 + 0.15Size + 50View, where Price is the price of a unit (in $1,000s), Size is the square footage (in sq. feet), and View is a dummy variable taking on 1 for an...
Can you please write this in python and comment as well so I can understand what...
Can you please write this in python and comment as well so I can understand what yo are doing. Thank you. 1)Loop through list A, at each iteration, show the square root result of that element. Add proper text to your print function. A = [-4, 1, -16, 36, -49, 64, -128] 2)Create a counter variable, name it i and initialize it to 0. Using a for loop, count how many numbers are divisible by 3 in range of 1...
Please explain answer so I can understand. 1. Write a C program that prints out its...
Please explain answer so I can understand. 1. Write a C program that prints out its command line arguments, one to a line. Discussion: A C program starts at a function called main. main has two arguments, conventionally named argc and argv. argc is of type int; it’s the argument count, an integer that contains the number of command line arguments (including the program name). argv is the argument vector; it’s an array of pointers to characters, In C this...
Please answer do not use picture and handwriting, please typing word. copy other answer 2.State and...
Please answer do not use picture and handwriting, please typing word. copy other answer 2.State and Local Government face three fundamental fiscal choices . Please list these choices , and using an example in a specified government program , explain how these choices interact with each other , if at all 1 ) The correspondence principle provides an understanding of how the services ( or public goods ) of government may be organized and delivered . Please describe this principle...
I NEED 2 PARAGRAPHS FOR THIS> TYPING SO I CAN COPY IT Why is an investment...
I NEED 2 PARAGRAPHS FOR THIS> TYPING SO I CAN COPY IT Why is an investment more attractive to management if it has a shorter payback period? Identify two disadvantages of using the payback period for comparing investments.
Hey can you please answer this question in detail and explanation in your own word i...
Hey can you please answer this question in detail and explanation in your own word i need to post in discussion board Perform a Internet search upon the term and concept black box as it pertains to computer science. Why is this an important concept within software design and development? Why is this concept important within the context of code reuse? Be sure to give thought to each assigned question before posting.
Hey can you please answer this question in detail and explanation in your own word i...
Hey can you please answer this question in detail and explanation in your own word i need to post in discussion board Are there ethical issues to consider when designing a database? For example, should highly sensitive personal data (such as patient medical information) be stored in the same database that manages and contains office supply inventories? Why or why not? Suggest three situations where referential integrity and relationships, inherent in a relational database management system (RDBMS), avoids data problems....
Hey can you please answer this question in detail and explanation in your own word i...
Hey can you please answer this question in detail and explanation in your own word i need to post in discussion board If you were to perform the analysis and develop a logical model for the college's course registration system, would it be better to use a top-down approach or a bottom-up strategy (or another approach...be sure to describe). What are the reasons for your approach decision? Use a scenario from everyday life to create a basic decision table with...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT