In: Statistics and Probability
In baseball, is there a linear correlation between batting average and home run percentage? Let x represent the batting average of a professional baseball player, and let y represent the player's home run percentage (number of home runs per 100 times at bat). A random sample of n = 7 professional baseball players gave the following information.
x 0.255 0.251 0.286 0.263 0.268 0.339 0.299
y 1.5 3.9 5.5 3.8 3.5 7.3 5.0
(a) Make a scatter diagram of the data. Then visualize the line you think best fits the data.
(b) Use a calculator to verify that Σx = 1.961, Σx2 = 0.555, Σy = 30.5, Σy2 = 152.69 and Σxy = 8.842. Compute r. (Round to 3 decimal places.)
As x increases, does the value of r imply that y should tend to increase or decrease? Explain your answer.
Given our value of r, we can not draw any conclusions for the behavior of y as x increases.
Given our value of r, y should tend to remain constant as x increases.
Given our value of r, y should tend to increase as x increases.
Given our value of r, y should tend to decrease as x increases.
Solution:
Part a
Required scatter diagram is given as below:

From above scatter diagram, it is observed that there is a positive linear relationship exists between the given two variables.
Part b
The formula for correlation coefficient is given as below:
Correlation coefficient = r = [n∑xy - ∑x∑y]/sqrt[(n∑x^2 – (∑x)^2)*(n∑y^2 – (∑y)^2)]
Required calculation table is given as below:
| 
 No.  | 
 x  | 
 y  | 
 x^2  | 
 y^2  | 
 xy  | 
| 
 1  | 
 0.255  | 
 1.5  | 
 0.065025  | 
 2.25  | 
 0.3825  | 
| 
 2  | 
 0.251  | 
 3.9  | 
 0.063001  | 
 15.21  | 
 0.9789  | 
| 
 3  | 
 0.286  | 
 5.5  | 
 0.081796  | 
 30.25  | 
 1.573  | 
| 
 4  | 
 0.263  | 
 3.8  | 
 0.069169  | 
 14.44  | 
 0.9994  | 
| 
 5  | 
 0.268  | 
 3.5  | 
 0.071824  | 
 12.25  | 
 0.938  | 
| 
 6  | 
 0.339  | 
 7.3  | 
 0.114921  | 
 53.29  | 
 2.4747  | 
| 
 7  | 
 0.299  | 
 5  | 
 0.089401  | 
 25  | 
 1.495  | 
| 
 Total  | 
 1.961  | 
 30.5  | 
 0.555137  | 
 152.69  | 
 8.8415  | 
Σx = 1.961, Σx2 = 0.555, Σy = 30.5, Σy2 = 152.69 and Σxy = 8.842
r = [n∑xy - ∑x∑y]/sqrt[(n∑x^2 – (∑x)^2)*(n∑y^2 – (∑y)^2)]
r = [7*8.842- 1.961*30.5]/sqrt[(7*0.555 – (1.961)^2)*(7*152.69 – (30.5)^2)]
r = 0.878655
r = 0.879
There is a strong positive linear correlation or association exists between the given two variables.
Given our value of r, y should tend to increase as x increases.