Question

In: Statistics and Probability

The ministry of Health wishes to determine if there is a relationship between the number of...

  1. The ministry of Health wishes to determine if there is a relationship between the number of cigarettes smoked daily and life time health care costs The result of six sample smokers is recorded below.

Number of Cigarettes

Health Costs (in thousands$)

30

43

40

45

50

54

60

53

70

56

80

63

  1. Calculate the correlation coefficient
  2. Test is the correlation coefficient is different from zero (.01 level)
  3. Create the Ordinary Least Squares regression line
  4. Discuss the assumptions that you make in order to use the OLS line as a predictor.

Solutions

Expert Solution

  1. Calculate the correlation coefficient

The correlation coefficient is 0.958.

  1. Test is the correlation coefficient is different from zero (.01 level)

The hypothesis being tested is:

H0: ρ = 0

Ha: ρ ≠ 0

The t-statistic is 6.664.

The p-value is 0.0026.

Since the p-value (0.0026) is less than the significance level (0.05), we can reject the null hypothesis.

Therefore, we can conclude that the correlation coefficient is different from zero.

  1. Create the Ordinary Least Squares regression line

The Ordinary Least Squares regression line is:

y = 31.5905 + 0.3771*x

  1. Discuss the assumptions that you make in order to use the OLS line as a predictor.

There are four assumptions associated with a linear regression model:

  1. Linearity: The relationship between X and the mean of Y is linear.
  2. Homoscedasticity: The variance of residual is the same for any value of X.
  3. Independence: Observations are independent of each other.
  4. Normality: For any fixed value of X, Y is normally distributed.
Number of Cigarettes Health Costs (in thousands$)
30 43
40 45
50 54
60 53
70 56
80 63
0.917
r   0.958
Std. Error   2.367
n   6
k   1
Dep. Var. Health Costs (in thousands$)
ANOVA table
Source SS   df   MS F p-value
Regression 248.914 1   248.9143 44.41 .0026
Residual 22.4190 4   5.6048
Total 271.333 5  
Regression output confidence interval
variables coefficients std. error    t (df=4) p-value 95% lower 95% upper
Intercept 31.5905
Number of Cigarettes 0.3771 0.0566 6.664 .0026 0.2200 0.5343

Please give me a thumbs-up if this helps you out. Thank you!


Related Solutions

manager wishes to determine the relationship between the number of miles traveled (in hundreds of miles)...
manager wishes to determine the relationship between the number of miles traveled (in hundreds of miles) by her sales representatives and their amount of sales (in thousands of dollars) per month. Complete the linear regression test below in order to predict the amount of sales if the sales representative traveled 1,300 miles? By visual observation of the scatter plot, does the data appear linear? Null Hypothesis: Alternative Hypothesis: What is the value of r? What are the critical values for...
A trainer wishes to find out if there is a relationship between the number of miles...
A trainer wishes to find out if there is a relationship between the number of miles walked per week and the weight of a person. The data for the sample are shown: Miles Walked, x: 15 20 3 12 17 8 Weight (lbs.), y: 147 106 210 160 122 165 a) Compute the value of the correlation coefficient. b) Find the equation of the regression line. c) Find y’ (the weight) when x = 11. (Someone walks 11 miles).
A researcher wishes to determine whether there is a relationship between the Campus at which SACAP...
A researcher wishes to determine whether there is a relationship between the Campus at which SACAP students are based and their statistics anxiety level. The results of the subsequent survey are given in the following table: Statistics anxiety level Campus Low Medium High Total Online 13 30 67 Cape Town 58 70 22 Johannesburg 26 34 30 Total Perform an appropriate hypothesis test at α = 0.01. If a significant result is obtained, determine the strength of the relationship. Show...
An emergency service wishes to determine whether a relationship exists between the outside temperature and the...
An emergency service wishes to determine whether a relationship exists between the outside temperature and the number of emergency calls it receives for a 7-hour period. The data are shown. (25 points) Temperature x 25 10 27 30 33 No. of calls y 7 4 8 10 11 a) Find the correlation coefficient r (If you use any software or online calculator, please take the screenshot. You will get extra points if you do by hand.) b) Find the regression...
You want to determine if there is a linear relationship between mental health and physical health....
You want to determine if there is a linear relationship between mental health and physical health. You have several people assessed by a psychologist and a medical doctor. The psychologist assess mental health (ment) on a scale of 1-30 (higher numbers mean better mental health) and the medical doctors rated their overall physical health (phys) from 1-30 (higher numbers mean better health). You obtain the data below. (note: there is more information than you need) Person   MENT   PHYS   + -...
An environmentalist wants to determine the relationship between the number of fires, in thousands, and the...
An environmentalist wants to determine the relationship between the number of fires, in thousands, and the number of acres burned, in hundreds of thousands. Based on this data, decide if the correlation is significant at alpha = 0.05. Number of fires x 73 74 58 48 80 65 54 49 Number of acres burned y 64 46 22 23 51 12 29 10 5. When x = 61 what is y and what does it mean I this context. 6....
A study is conducted to determine the relationship between a driver's age and the number of...
A study is conducted to determine the relationship between a driver's age and the number of accidents he or she has over a 1-year period. The data are shown here. If there is a significant relationship, predict the number of accidents of a driver who is 28. Driver's No. of Age x accidents y 16 3 24 2 18 5 17 2 23 0 27 1 32 1 Predict y' for a specific value of x, if the relationship is...
A study is conducted to determine the relationship between a driver's age and the number of...
A study is conducted to determine the relationship between a driver's age and the number of accidents he or she has over a 1-year period. The data are shown here. If there is a significant relationship, predict the number of accidents of a driver who is 28. Driver's No. of Age x accidents y 16 3 24 2 18 5 17 2 23 0 27 1 32 1 For steps 4 and 5 of the hypothesis testing, what is your...
A study is conducted to determine the relationship between a driver's age and the number of...
A study is conducted to determine the relationship between a driver's age and the number of accidents that he or she has had over a 1-year period. The data from this study is in the table: Age 16 24 18 17 23 27 32 Accidents 3 2 5 2 0 1 1 The correlation coefficient for this problem is There is/ is not evidence of significant correlation at the 5% significance level The best estimate for the number of accidents...
A study is conducted to determine the relationship between a driver's age and the number of...
A study is conducted to determine the relationship between a driver's age and the number of accidents he or she has over a 1-year period. The data are shown here. If there is a significant relationship, predict the number of accidents of a driver who is 28. Driver's No. of Age x accidents y 16 3 24 2 18 5 17 2 23 0 27 1 32 1 1. What is your alternative hypothesis (H1)? 2. Compute the value of...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT