In: Statistics and Probability
A regional express delivery service company recently conducted a study to investigate the relationship between the cost of shipping a package ($), the package weight (in pound) and the distance shipped (in miles). Twenty packages were randomly selected from among the large number received for shipment, and a detailed analysis of the shipping cost was conducted for each package. The data for this sample observations are given in the file Assignment 4 S1 2020.XLS.
a. Estimate a simple linear regression model involving shipping cost and package weight. Interpret the slope coefficient of the least squares line as well as the computed value of ? 2 . [4 marks]
b. Add another explanatory variable–distance shipped–to the regression model in part a. Estimate and interpret this expanded model. How does the ? 2value for this multiple regression model compare to that of the simple regression model estimated in part a? [5 marks]
c. Use the F test to determine the overall significance of the regression relationship for the expanded model. What is the conclusion at the 0.01 level of significance? [4 marks]
d. Use the t test to determine the significance of each independent variable. What is the conclusion for each test at the 0.01 level of significance? [4 marks]
Data
| Cost of Shipment and Potentially Relevant Data | ||
| Cost_of_Shipment | Package_Weight | Distance_Shipped | 
| $3.30 | 4.10 | 95 | 
| $2.00 | 0.30 | 160 | 
| $11.00 | 5.10 | 240 | 
| $2.60 | 5.90 | 47 | 
| $1.90 | 4.50 | 53 | 
| $8.00 | 3.50 | 250 | 
| $15.50 | 7.00 | 260 | 
| $5.00 | 2.40 | 209 | 
| $1.00 | 0.60 | 100 | 
| $4.40 | 0.75 | 280 | 
| $6.00 | 6.20 | 115 | 
| $1.70 | 1.10 | 90 | 
| $14.50 | 6.50 | 240 | 
| $14.00 | 7.50 | 190 | 
| $9.20 | 6.60 | 160 | 
| $1.10 | 2.70 | 45 | 
| $12.10 | 8.10 | 160 | 
| $1.50 | 0.70 | 80 | 
| $8.00 | 4.40 | 202 | 
| $3.90 | 3.20 | 145 | 
| $4.40 | 0.75 | 280 | 
| $16.50 | 7.20 | 280 | 
| $15.50 | 7.00 | 250 | 
| $14.00 | 7.50 | 190 | 
| $3.30 | 4.10 | 95 | 
| $2.20 | 1.50 | 160 | 
| $11.00 | 5.10 | 240 | 
| $1 | 0.6 | 100 | 
| $4 | 0.75 | 280 | 
| $2 | 0.7 | 80 | 
| $8 | 4.4 | 202 | 
| $2 | 4.5 | 52 | 
| $8.00 | 3.2 | 240 | 
| $15.50 | 7.6 | 270 | 
| $5.00 | 2.5 | 211 | 
| $1.00 | 7 | 98 | 
| $8.00 | 4.4 | 202 | 
| $3.90 | 3.2 | 145 | 
| $4.40 | 0.75 | 280 | 
| $5.00 | 2.4 | 209 | 
using excel>data>data analysis>Regression
we have
| SUMMARY OUTPUT | ||||||
| Regression Statistics | ||||||
| Multiple R | 0.742802 | |||||
| R Square | 0.551755 | |||||
| Adjusted R Square | 0.539959 | |||||
| Standard Error | 3.354405 | |||||
| Observations | 40 | |||||
| ANOVA | ||||||
| df | SS | MS | F | Significance F | ||
| Regression | 1 | 526.3139 | 526.3139 | 46.77501 | 4.06E-08 | |
| Residual | 38 | 427.5771 | 11.25203 | |||
| Total | 39 | 953.891 | ||||
| Coefficients | Standard Error | t Stat | P-value | Lower 95% | Upper 95% | |
| Intercept | 0.781701 | 0.994461 | 0.786055 | 0.43671 | -1.23148 | 2.794882 | 
| Package weight | 1.472373 | 0.215284 | 6.839226 | 4.06E-08 | 1.036554 | 1.908192 | 
a simple linear regression model involving shipping cost and package weight is
cost = 0.7817+1.4724 *Package weight
For every one one unit increase in weight , cost of shipment will increase by $1.4724
the ? 2value = 0.5518 , about 55.18 % variation in shipment cost is explained by Package weight
using excel>data>data analsysis>Regression
we have
| SUMMARY OUTPUT | ||||||
| Regression Statistics | ||||||
| Multiple R | 0.931476 | |||||
| R Square | 0.867647 | |||||
| Adjusted R Square | 0.860493 | |||||
| Standard Error | 1.847203 | |||||
| Observations | 40 | |||||
| ANOVA | ||||||
| df | SS | MS | F | Significance F | ||
| Regression | 2 | 827.6411 | 413.8206 | 121.2782 | 5.65E-17 | |
| Residual | 37 | 126.2499 | 3.412159 | |||
| Total | 39 | 953.891 | ||||
| Coefficients | Standard Error | t Stat | P-value | Lower 95% | Upper 95% | |
| Intercept | -5.08357 | 0.830332 | -6.12233 | 4.29E-07 | -6.76598 | -3.40116 | 
| Package weight | 1.347949 | 0.119289 | 11.29982 | 1.48E-13 | 1.106246 | 1.589653 | 
| Distance speed | 0.036372 | 0.00387 | 9.397333 | 2.43E-11 | 0.02853 | 0.044214 | 
b. the new regression model is
cost = -5.08357 +1.34795 *Package weight + 0.0364* Distance speed
For every one one unit increase in speed , cost of shipment will increase by $\0.0364
the ? 2value = 0.8677, about 86.67 % variation in shipment cost is explained by Package weight and distance speed .
for this multiple regression model the value of R2 is more in compare to that of the simple regression model estimated in part a
c ) the value of F stat for overall model is 121.278
and p value is 0.0000 which is less than 0.01 so we conclude that model is significant to use .
d ) the explanatory variables package placed and distance speed are signficant because theri p value is less than 0.01