Question

In: Statistics and Probability

A researcher conducts a long-term study of the correlation between the number of children a family...

A researcher conducts a long-term study of the correlation between the number of children a family has (X) and the number of pets they have 20 years later (Y). He finds the following results:

Children (X)                                   Pets 20 years later (Y)

2                                                       4

4                                                       6                

3                                                       1

0                                                       2

1                                                       2

First, the researcher wants to calculate the correlation between the two variables. Using this dataset, calculate r. (3 pts)

Next, the researcher wants to use his knowledge about the correlation to be able to predict future pet ownership based on current family size. Using the information from the original test, calculate the linear regression equation for this dataset (2 pts)

Finally, use the regression equation to predict the number of future pets owned by a family that currently has 3 children. Use it again to predict the number of future pets owned by a family with 1 child. Make sure to label your answers clearly (1 pt each)

Solutions

Expert Solution

Answer to 1st Question:
The following table shows the calculations –

Children(X)

Pets 20 years later(Y)

X^2

Y^2

XY

2

4

4

16

8

4

6

16

36

24

3

1

9

1

3

0

2

0

4

0

1

2

1

4

2

Total

10

15

30

61

37

Total number of observations, n = 5

Mean of X, = 10/5 = 2

Mean of Y, = 15/5 = 3

Standard Deviation of X, Sx = {((X^2) / n) - (^2)}^0.5 = {(30/5) – (2^2)}^0.5 = 1.4142

Standard Deviation of Y, Sy = {((Y^2) / n) - (^2)}^0.5   = {(61/5) – (3^2)}^0.5 = 1.7888

Covariance between X and Y, Cov.(X,Y) = ((XY) / n) - () = (37/5) – (2 x 3) = 1.4

Correlation Coefficient, r = Cov.(X, Y) / (Sx.Sy) = 1.4 / (1.4142 x 1.7888) = 0.5534

(Here, all measures are rounded up to 4 decimal places)

Answer to 2nd Question:
The general way of obtaining a least – squares regression equation for two variables is given below –

Where b is the slope of the regression equation and a is the Y – Intercept

Therefore,

b = (r.Sy) / Sx = (0.5534 x 1.7888) / 1.4142 = 0.6999 = 0.7 (approximately)

a = - b = 3 – (0.7 x 2) = 1.6

The regression equation is -

(predicted value) = 1.6 + 0.7X

Answer to 3rd Question:
When the family has 3 children, that is, when X = 3

= 1.6 + (0.7 x 3) = 4 (rounded to the nearest whole number)

Therefore, when the family has 3 children, the number of future pets owned by the family is 4

When the family has 1 child, that is, when X = 1

= 1.6 + (0.7 x 1) = 2 (rounded to the nearest whole number)

Therefore, when the family has 1 child, the number of future pets owned by the family is 2


Related Solutions

A state conducts a study that shows strong negative correlation between residents' life expectancy and number...
A state conducts a study that shows strong negative correlation between residents' life expectancy and number of breweries within 2 miles of the residents. The state concludes that if they mandate a decrease in the number of breweries life expectancy will go up. Select all true statements and explain why. A. The state's conclusion is correct as long as the correlation measures strong linear association with no distorting outliers. B. We can use this study to conclude that proximity to...
If a researcher found that there was a correlation of r = -0.67 between the number...
If a researcher found that there was a correlation of r = -0.67 between the number of siblings a person has and introversion, and the researcher sampled n = 30 people, what conclusions can you make about this relationship
A researcher is interested in determining whether there is a correlation between number of packs of...
A researcher is interested in determining whether there is a correlation between number of packs of cigarettes smoked # packs of cigarettes smoked (X) (Y) 0 80 0 70 1 72 1 70 2 68 2 65 3 69 3 60 4 58 4 55 day and longevity (in years). n=10.
A researcher is interested in determining whether there is a correlation between number of packs of...
A researcher is interested in determining whether there is a correlation between number of packs of cigarettes smoked per day and longevity (in years). n=10. Longevity # packs of cigarettes smoked (X) (Y) 0 80 0 70 1 72 1 70 2 68 2 65 3 69 3 60 4 58 4 55
A family researcher is interested whether there is an association between the number of siblings a...
A family researcher is interested whether there is an association between the number of siblings a person has and the number of children they have. She interviews six older adults and finds the following information. Number of Siblings Number of Children 0 1 1 1 2 1 2 0 3 3 4 3 Mean Number of Siblings = 2 Mean Number of Children = 1.5 Standard Deviation = 2 Standard Deviation = 1.5 Explain which variable more naturally plays the...
. A study was conducted to examine the correlation between number of study hours and students...
. A study was conducted to examine the correlation between number of study hours and students grads in exam. Study hours for students: 2, 3, 5, 6, 8, 10, 10, 2, 5, 6, 5, 3, 7, 6, 2, 7, 6, 8, 2, 5 Grads in exam for students: 3, 4, 6, 7, 8, 10, 9, 8, 3, 6, 5, 4, 6, 6, 3, 7, 6, 3, 4, 5 1- as ungrouped data find the frequency, accumulated relative frequency, and accumulative...
. A study was conducted to examine the correlation between number of study hours and students...
. A study was conducted to examine the correlation between number of study hours and students grads in exam. Study hors for students: 2, 3, 5, 6, 8, 10, 10, 2, 5, 6, 5, 3, 7, 6, 2, 7, 6, 8, 2, 5 Grads in exam for students: 3, 4, 6, 7, 8, 10, 9, 8, 3, 6, 5, 4, 6, 6, 3, 7, 6, 3, 4, 5 1- as ungrouped data find the frequency, accumulated relative frequency, and accumulative...
1)A researcher believes that there is a correlation between the number of cigarettes smoked per day...
1)A researcher believes that there is a correlation between the number of cigarettes smoked per day and intelligence.  The following data were collected on 15 smokers.  Find Pearson's r. Number of cigarettes (X) IQ score (Y) 7    10 49 6 41    15 38   5 37 12 19    4 35 19 40   11 1 3 10 3 18 22 21 17 15 12 7    9 38    13 n=15 Sum of X= 376          Sum of Y=161   Sum...
A researcher conducts a study on the effects of amount of sleep on creativity. The creativity...
A researcher conducts a study on the effects of amount of sleep on creativity. The creativity scores for four levels of sleep (2 hours, 4 hours, 6 hours, and 8 hours) are presented below:             2 Hours of Sleep         4 Hours of Sleep         6 Hours of Sleep         8 Hours of Sleep                         3                                  4                                  10                                10                         5                                  7                                  11                                13                         6                                  8                                  13                                10                         4                                  3                                  9                                9                         2                                  2                                  10                                10 Pretend that you have...
A study was conducted to investigate for any possible correlation that may exist between the number...
A study was conducted to investigate for any possible correlation that may exist between the number of tardiness and the final grade (%) of the students in statistics class. Based on this data, Kuya, who is enrolling in the class next trimester, may anticipate his grade from being tardy every other week (6 times). What would his estimated grade be? STUDENT TARDY GRADE 1 1 89 2 2 94 3 0 97 4 0 87 5 1 94 6 0...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT