In: Statistics and Probability
The following data represent a company's yearly sales volume and its advertising expenditure over a period of 8 years.
|
Sales in |
Advertising |
|
11 |
32 |
|
16 |
33 |
|
18 |
35 |
|
17 |
34 |
|
16 |
36 |
|
19 |
37 |
|
19 |
39 |
|
24 |
42 |
| a. | Which variable is the dependent variable? |
| b. | Using the Excel spreadsheet, run the least squares model and write the summary output here. |
| c. | If the company's advertising expenditure is $400,000, what are the predicted sales? Give the answer in dollars. |
| d. | What does the slope of the estimated regression line indicate? |
| e. | Compute the coefficient of determination and fully interpret its meaning. |
| f. | Use the F test to determine whether or not the regression model is significant at a = 0.05. |
| g. | Use the t test to determine whether the slope of the regression model is significant at a = 0.05. |
| h. | Develop a 95% confidence interval for predicting the average sales for the years when $400,000 was spent on advertising. |
| i. | Compute the correlation coefficient. |
a) Sales are depends on Advertising.
So here Sales in millions of dollar is dependent variable.
b) Excel output is given by,
Data tab > Data analysis > regression.
Insert ranges of x and y . So we get the output as :
| SUMMARY OUTPUT | ||||||
| Regression Statistics | ||||||
| Multiple R | 0.899172 | |||||
| R Square | 0.808511 | |||||
| Adjusted R Square | 0.776596 | |||||
| Standard Error | 1.732051 | |||||
| Observations | 8 | |||||
| ANOVA | ||||||
| df | SS | MS | F | Significance F | ||
| Regression | 1 | 76 | 76 | 25.33333 | 0.002373 | |
| Residual | 6 | 18 | 3 | |||
| Total | 7 | 94 | ||||
| Coefficients | Standard Error | t Stat | P-value | Lower 95% | Upper 95% | |
| Intercept | -18.5 | 7.178642 | -2.57709 | 0.041933 | -36.0655 | -0.9345 |
| Advertising in($10000) | 1 | 0.19868 | 5.033223 | 0.002373 | 0.513848 | 1.486152 |
c) From output we get :
Slope = b= 1 , Intercept = a = -18.5
Hence the regression equation is given by,
y = a + b * x
y = -18.5 + 1 * x
Here expenditure = x = 400,000
y = -18.5 + 1 * 40= 21.5 millions of dollar.
d) Here from output : Slope = b =1
So this tells us that, as expenditure in advertising increases by 1 unit , sales will increase by b= 1 units.
e) Coefficient of determination = R2 = 0.8085
80.85% of the variations in Advertising are explained by regression equation.
f) The hypothesis are :
H0: The model is not significant. v/s H1 : The model is significant.
From output we get :
The test statistic,
F =
25.333
P value = 0.0024
Here p value <
( 0.05 ).
Hence we reject null hypothesis.
We conclude that model is significant.
g)
The hypothesis are :
H0:
= 0 v/s H1 :
0
From output we get,
The test statistics is,
t = 5.033
P value = 0.0024
Here p value <
( 0.05 )
Hence we reject null hypothesis.
Conclusion : Slope of regression model is significant.
h) x = 40,
= 21.5
The 95% confidence interval is given by,

Here c = 0.95,
= 1-c = 1-0.95 = 0.05, df = n-2= 8-2 = 6
-----( Using excel formula "=t.inv.2t(0.05,6) " )
From output we get MSerror = 3, SSx = 76
n= 8,
So the confidence interval is ,

21.5
2.46
( 19.04, 23.96 )
i) From output we get,
correlation coefficient = r = 0.899