Question

In: Statistics and Probability

I have to submit a term paper which involves conducting a regression and correlation analysis on...

I have to submit a term paper which involves conducting a regression and correlation analysis on any topic of my choosing. The paper must be based on yearly data for any economic or business variable, for a period of at least 20 years. The following also must be included in the paper: • The term paper should distinguish between dependent and independent variables; determine the regression equation by the least squares method; plot the regression line on a scatter diagram; interpret the meaning of regression coefficients; use the regression equation to predict values of the dependent variable for selected values of the independent variable and construct forecast intervals and calculate the standard error of estimate, coefficients of determination (r2) and correlation (r) and interpret the meaning of the coefficients (r2) and (r). Your regression and correlation analysis must: 1. Graph the data (scatter diagram) 2. Use the method of least squares to derive a trend equation and trend values
 3. Use check column to verify computations ∑ (Y-Yc)=0
 4. Superimpose trend equation on scatter diagram.
 5. Use your model to predict the movement of the variable for the next year.
 6. Compare your predictions with the actual behavior of the variable during the 21styear . I am stumped on what topic to choose and that is where I am looking for guidance. Any help would be greatly appreciated.

Solutions

Expert Solution

A good way to start might be to model the relationship with a linear equation, Y = mX + b, where m is the slope of the line and b is the Y-intercept, as you learned in high school algebra. The question now is to determine the best way to estimate m and be given your pairs of X's and Y's.

It turns out that the best way to do this is to use least squares regression. We call it least squares regression because the line that we choose will be the one for which the sum of the squares of the differences between predicted and observed values is as small as possible.

1. Scatter Plot of Y and X1

Scatter plot of sales and calls shows that there can be a linear trend between the both. The trendline indicates that it looks like higher the number of calls, higher will be sales

2. Best fit line

Using the Regression option in Excel Data analysis menu, we obtaain the following output

From this, bestfit line equation is Sales=Intercept+Coefficient of Calls *Calls

i.e., Sales = 22.52 + 0.1237 * Calls

3. Coefficient of Correlation

It denotes the strength of association between two variables. The sign denotes the direction of association.

In Exel, we calculate Correlation coefficient as Correl(X1array,Yarray)

We get the value as 0.318

This means that calls and sales are slightly positively associated. With increase in one quantiity, the other is also showing an increasing trend. Please note that this does not imply causation, i.e.,we CANNOT say that the rise or fall in one is causing the change in other.

4. Coefficient of Determination

It is more commonly known as R squared value. It gives the measure of how close the data points are to the best fit line. In other words, it gives the proportion of variability in dependent variable that can be explained by the independent variable. Higher the Rsquared value, better the model is.

From Excel regression output, we get R squared value or Coefficient of Determination as 0.101

~10% of variability in sales is explained by calls.

5. Utility of Regression model

F test can be used to test the utility of the model.

Null Hypothesis: Beta coefficient of call = 0; i.e., Calls is NOT linearly associated with sales

Alternate Hypothesis: Beta coefficient of call 0; Calls is linearly associated with sales

Let us choose significance level, = 0.05.

From the regression ANOVA output, we get p value (or significance value) of F test as 0.0012 (<0.05) for the given degrees of freedom (highlighted)

Since p value < , we can reject Null hypothesis, thereby concluding that with the given data it can be said that calls is linearly associated with sales.

6. Based on the above findings, it can be said that calls is a good and important variable in predicting sales volume. It has been proved that calls and sales have a positive linear association between them. From the best fit line (Sales = 22.52 + 0.1237 * Calls), we can say that with every call, sales increases by 0.1237units (interpretation of coefficient of calls).

7. 95% Confidence Interval

The 95% confidence interval for the coefficient of Calls (1) is [0.0498, 0.1976]

Interpretation: 95% confidence interval means that if this regression analysis is to be repeated for other samples from population, 95% of the intervals will contain the true value of 1. In simpler terms, we can say that we are 95% confident that the true value of 1 is in our interval...


Related Solutions

Objective of this Assignment: To conduct a regression and correlation analysis Instructions: The term paper involves...
Objective of this Assignment: To conduct a regression and correlation analysis Instructions: The term paper involves conducting a regression and correlation analysis on any topic of your choosing. The paper must be based on yearly data for any economic or business variable, for a period of at least 20 years (Note: the source of the data must be given. Data which is not properly documented as to source is unacceptable and paper will be graded F. Textbook examples are unacceptable)....
Objective of this Assignment:    To conduct a regression and correlation analysis Instructions: The term paper involves...
Objective of this Assignment:    To conduct a regression and correlation analysis Instructions: The term paper involves conducting a regression and correlation analysis on any topic of your choosing. The paper must be based on yearly data for any economic or business variable, for a period of at least 20 years (Note: the source of the data must be given. Data which is not properly documented as to source is unacceptable and paper will be graded F. Textbook examples are unacceptable)....
I have a question which involves the use of stata what regression would I have to...
I have a question which involves the use of stata what regression would I have to run to answer this question? I have WEEKPAY – Gross weekly pay in the respondent’s main job GENDER – The respondent’s reported gender MON – The month the respondent started her current job YEAR– The year the respondent started her current job What is the average size of the gender pay gap after the implementation (2018) of the regulation? Run a regression to estimate...
What is the real difference between between correlation and regression analysis? I see that correlation allows...
What is the real difference between between correlation and regression analysis? I see that correlation allows us to examine whether two variables exhibit some kind of association with each other. Does correlation mean means that one variable is dependent on another? Is this an example of correlation - Tips are dependent on the actual restaurant bill? Can you give other examples of correlation and regression? Thanks.
Intro Discussion and Questions- Regression and Correlation: How does correlation analysis differ from regression analysis? What...
Intro Discussion and Questions- Regression and Correlation: How does correlation analysis differ from regression analysis? What are the goals of each? What does the correlation coefficient r measure? Be specific. What values can it take on? What do the values indicate? What values can the coefficient of determination take on? What does it measure? What are some of the limitations of simple linear regression? Describe the equation for multiple linear regression; be sure to clearly define all of its parts....
What impact does regression have on correlation analysis? Explain the two ways correlation can be displayed...
What impact does regression have on correlation analysis? Explain the two ways correlation can be displayed mathematically. Does one way have an advantage over the other? Please explain your thinking.
How is a linear regression analysis is different from a correlation analysis?
How is a linear regression analysis is different from a correlation analysis?
I have a term paper that I have to write for my microeconomics class. Economics is...
I have a term paper that I have to write for my microeconomics class. Economics is super new to me, this is the first class I've ever taken related to it. My subject is the economic impacts of the opioid epidemic. My outline is due next week and I have no idea what to even do research on or which direction to go. Where do I even start?
The subject is the principle of nutrition. Homework question is to submit term paper for dietary...
The subject is the principle of nutrition. Homework question is to submit term paper for dietary analysis assignment. I have never submitted a term paper and have no idea how to write in APA format, what to include in the term paper.? My professor provides some guidelines that it should be 10 pages, 10,000 words and use supertraker from choosemyplate.gov to print a three-day report.
By the end of the week, submit a topic proposal for your Negotiation Analysis Paper. The...
By the end of the week, submit a topic proposal for your Negotiation Analysis Paper. The proposal (one page) should describe the focus of the paper and your method. The negotiation can be one in which you were a participant or one in which you have been an active observer. Some examples of applicable negotiations are: workplace negotiation, such as a complex contract, new position, or new salary (preferred); a complex business transaction, such as a merger or acquisition; a...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT