In: Statistics and Probability
The following data for the dependent variable, y, and the independent variable, x, have been collected using simple random sampling:
X |
Y |
10 |
120 |
14 |
130 |
16 |
170 |
12 |
150 |
20 |
200 |
18 |
180 |
16 |
190 |
14 |
150 |
16 |
160 |
18 |
200 |
a) Based on the given data the scatter plot is plotted as:
Based on the scatter plot data distribution we can say that there is a strong positive correlation between the variables.
b) To calculate the correlation coefficient following table calculation is cone as:
X Values
∑ = 154
Mean = 15.4
∑(X - Mx)2 = SSx = 80.4
Y Values
∑ = 1650
Mean = 165
∑(Y - My)2 = SSy = 7050
X and Y Combined
N = 10
∑(X - Mx)(Y - My) = 670
Where,
X: X Values
Y: Y Values
Mx: Mean of X Values
My: Mean of Y Values
X - Mx & Y -
My: Deviation scores
(X - Mx)2 & (Y -
My)2: Deviation
Squared
(X - Mx)(Y -
My): Product of Deviation Scores
Now the correlation is calculated as:
R Calculation
r = ∑((X - My)(Y - Mx)) /
√((SSx)(SSy))
r = 670 / √((80.4)(7050)) = 0.8899
The 0.89 correlation explains that there is strong correlation between the two variables.