Question

In: Economics

CASE STUDY 2: Correlation and Regression are investigating the relationship between two continuous variables such as...

CASE STUDY 2: Correlation and Regression are investigating the relationship between two continuous variables such as height and weight, time and speed or the concentration of an injected drug and heart rate.

a) In your opinion, discuss the importance of Correlation and Regression as a tools for analysis purposes.

b) Find any correlation and regression that have been applied in business from the online platform. From the data you required to:

i. State the Independent and Dependent Variable

ii. Draw the Scatter plot using Microsoft Excel.

iii. Find the Pearson Product Moment Correlation Coefficient

iv. Find the Regression line

v. Interpret the slope from (iv).

c) Discuss and determine the conclusion that you have from above (b) result.

Objective:

1) To determine the problem solving using the real situation

2) To have the practical knowledge of theoretical and application of statistics

Plagiarism must be below 30 %

Solutions

Expert Solution

a) Any kind of analysis, whether qualitative or quantitative across different fields of study; scientific, social science, business and humanities, requires investigating relation between different variables. The dependent variable is the variable being tested while the independent variables are the ones that capture the factors that are responsible for any change in dependent variable under study. Example: while studying the gender-wage gap, we need to find whether gender plays a significant role in deciding the wage rate and therefore gender is an independent variable while wage rate is the dependent variable.

Correlation and Regression are most commonly used analysis tools for studying the relationship between two variables. Former quantifies the strength of the linear relationship between a pair of variables, whereas latter expresses the relationship in the form of an equation. While correlation coefficient is same whether we study correlation of X on Y or Y on X but the same is not true in case of regression where the value of regression coefficient depend on the choice of dependent and independent variable.

b) Follwing table is data for the distance of a business from a city center (X) versus the amount of product sold per person (Y) and we need to check if there is a correlation between the two variables. Also, we will check whether the distance from a city centre is a significant variable that effects the amount of product sold per person.

Sakau Market distance/km (x) mean cups per person (y)
Upon the river 3 5.18
Try me first 13.5 3.93
At the bend 14 3.19
Falling down 15.5 2.62

i) Dependent Variable : Mean cups per person; Independent Variable : DIstance (KM)

ii)

iii) The Pearson product-moment correlation coefficient r tells us how well the data fits a straight line and can be calculated using following command in a spreadsheet:

=CORREL(y-values,x-values)

Pearson product-moment correlation coefficient r -0.93

iv) Regression line can be drawn in spreadsheet using the line graph under insert tab

Regression output:

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.93221259
R Square 0.86902032
Adjusted R Square 0.80353047
Standard Error 0.48999883
Observations 4
ANOVA
df SS MS F Significance F
Regression 1 3.18600228 3.18600228 13.2695437 0.06778741
Residual 2 0.48019772 0.24009886
Total 3 3.6662
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 5.79824873 0.61837767 9.37654928 0.01118357 3.13758434 8.45891312 3.13758434 8.45891312
Distance -0.1798477 0.04937157 -3.6427385 0.06778741 -0.3922764 0.032581 -0.3922764 0.032581
RESIDUAL OUTPUT
Observation Predicted Y Residuals
1 5.25870558 -0.0787056
2 3.37030457 0.55969543
3 3.28038071 -0.0903807
4 3.01060914 -0.3906091

The slope intercept indicates a negative relationship between the distance of business from city centre and the amount of product sold implying that as the distance from the city centre increases, the mean cups sold per person falls.


Related Solutions

1. If the coefficient of determination is 25%, the correlation between two continuous variables is a)...
1. If the coefficient of determination is 25%, the correlation between two continuous variables is a) -5 b) 5 c) -.25 d) .25 e) a or b f) none of the above 2. To assess the correlation between height and weight, one should use a) spearman correlation b) regression equation c. pearson correlation d) point biserial correlation 3. For a computed r = -0.547, given a dataset of n = 16, alpha = .05, and two-tailed significance, one should fail...
For an inverse relationship between two variables, the sign of the correlation coefficient is "+" TRUE...
For an inverse relationship between two variables, the sign of the correlation coefficient is "+" TRUE OR FALSE
What are the differences between results that demonstrate a correlation between two variables and results where a regression is run using two variables?
What are the differences between results that demonstrate a correlation between two variables and results where a regression is run using two variables? Think about your future clinical role and provide a clinical example of variables that you may want a correlation analysis run and explain. Think about your future clinical role and provide a clinical example of variables that you may want a regression analysis run and explain.
Regression methods were used to analyze the data from a study investigating the relationship between roadway...
Regression methods were used to analyze the data from a study investigating the relationship between roadway surface temperature (x) and pavement deflection (y). Summary quantities were n = 20, ∑?! = 12.75, ∑?!" = 8.86, ∑?! = 1478, ∑?!" = 143215.8, ∑?!?! = 1083.67. Give a 95% confidence interval for the mean response of pavement deflection given that temperature is 90 F.
This week examines how to use correlation and simple linear regression to test the relationship of two variables.
  Discussion 1: Searching for Causes This week examines how to use correlation and simple linear regression to test the relationship of two variables. In both of these tests you can use the data points in a scatterplot to draw a line of best fit; the closer to the line the points are the stronger the association between variables. It is important to recognize, however, that even the strongest correlation cannot prove causation. For this Discussion, review this week’s Learning...
Checking Your Progress – Correlation & Regression Researchers investigated the relationship between amount of study time...
Checking Your Progress – Correlation & Regression Researchers investigated the relationship between amount of study time statistics class and mid-semester quiz scores. The data appear below: 1 28 95 2 25 95 3 3 58 4 10 75 5 0 44 6 15 83 7 20 91 8 24 87 9 7 65 10 8 70 Find the correlation between hours of study and quiz scores, and test it for significance. Then complete a simple linear regression analysis using hours...
Project 2. Choose a topic of study that will include a simple correlation between 2 variables....
Project 2. Choose a topic of study that will include a simple correlation between 2 variables. 1. Collect 2 variables from at least 8 individuals. (you will have a sample of 8 pair of values) 2. Identify the dependent variable and the independent variable. 3. What is the regression line? 4. Test the significance of the correlation between the variables 5. Choose an independent variables\ to make a prediction of the dependent variable associated with it. 6. What are the...
5. A Pearson correlation statistic is only valid when the relationship between the two quantitative (continuous)...
5. A Pearson correlation statistic is only valid when the relationship between the two quantitative (continuous) variables is ____________. Explain why it is true that the slope of a line is related to the Pearson correlation statistic, r. Create a scatterplot to investigate the association between the amount of fluoride in domestic water (ppm) and the number of dental caries in permanent teeth per 100 children for 21 cities. The data are below. Create the scatterplot Describe the association you...
Perform one correlation between two independent variables, such as Age and Relationship with Coworkers. Perform the...
Perform one correlation between two independent variables, such as Age and Relationship with Coworkers. Perform the second correlation on an independent variable (such as Relationship with Direct Supervisor) and the dependent variable (such as Workplace Happiness Rating. Gender Age Supervisor Telecommute Coworkers Happiness Engagement Overall Rating 1 29 1 1 2 7 8 15 2 32 4 1 3 9 10 19 1 39 1 1 1 4 5 8 1 25 2 1 2 5 8 13 1 27...
One of the major misconceptions about correlation is that a relationship between two variables means causation;...
One of the major misconceptions about correlation is that a relationship between two variables means causation; that is, one variable causes changes in the other variable. There is a particular tendency to make this causal error when the two variables seem to be related to each other. What is one instance where you have seen correlation misinterpreted as causation? Please describe.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT