Question

In: Statistics and Probability

A regional express delivery service company recently conducted a study to investigate the relationship between the...

A regional express delivery service company recently conducted a study to investigate the relationship between the cost of shipping a package ($), the package weight (in pound) and the distance shipped (in miles). Twenty packages were randomly selected from among the large number received for shipment, and a detailed analysis of the shipping cost was conducted for each package. The data for this sample observations are given in the file Assignment 4 S1 2020.XLS.

a. Estimate a simple linear regression model involving shipping cost and package weight. Interpret the slope coefficient of the least squares line as well as the computed value of ? 2 . [4 marks]

b. Add another explanatory variable–distance shipped–to the regression model in part a. Estimate and interpret this expanded model. How does the ? 2value for this multiple regression model compare to that of the simple regression model estimated in part a? [5 marks]

c. Use the F test to determine the overall significance of the regression relationship for the expanded model. What is the conclusion at the 0.01 level of significance? [4 marks]

d. Use the t test to determine the significance of each independent variable. What is the conclusion for each test at the 0.01 level of significance? [4 marks]

Data

Cost of Shipment and Potentially Relevant Data
Cost_of_Shipment Package_Weight Distance_Shipped
$3.30 4.10 95
$2.00 0.30 160
$11.00 5.10 240
$2.60 5.90 47
$1.90 4.50 53
$8.00 3.50 250
$15.50 7.00 260
$5.00 2.40 209
$1.00 0.60 100
$4.40 0.75 280
$6.00 6.20 115
$1.70 1.10 90
$14.50 6.50 240
$14.00 7.50 190
$9.20 6.60 160
$1.10 2.70 45
$12.10 8.10 160
$1.50 0.70 80
$8.00 4.40 202
$3.90 3.20 145
$4.40 0.75 280
$16.50 7.20 280
$15.50 7.00 250
$14.00 7.50 190
$3.30 4.10 95
$2.20 1.50 160
$11.00 5.10 240
$1 0.6 100
$4 0.75 280
$2 0.7 80
$8 4.4 202
$2 4.5 52
$8.00 3.2 240
$15.50 7.6 270
$5.00 2.5 211
$1.00 7 98
$8.00 4.4 202
$3.90 3.2 145
$4.40 0.75 280
$5.00 2.4 209

Solutions

Expert Solution

using excel>data>data analysis>Regression

we have

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.742802
R Square 0.551755
Adjusted R Square 0.539959
Standard Error 3.354405
Observations 40
ANOVA
df SS MS F Significance F
Regression 1 526.3139 526.3139 46.77501 4.06E-08
Residual 38 427.5771 11.25203
Total 39 953.891
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 0.781701 0.994461 0.786055 0.43671 -1.23148 2.794882
Package weight 1.472373 0.215284 6.839226 4.06E-08 1.036554 1.908192

a simple linear regression model involving shipping cost and package weight is

cost = 0.7817+1.4724 *Package weight

For every one one unit increase in weight , cost of shipment will increase by $1.4724

the ? 2value = 0.5518 , about 55.18 % variation in shipment cost is explained by  Package weight

using excel>data>data analsysis>Regression

we have

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.931476
R Square 0.867647
Adjusted R Square 0.860493
Standard Error 1.847203
Observations 40
ANOVA
df SS MS F Significance F
Regression 2 827.6411 413.8206 121.2782 5.65E-17
Residual 37 126.2499 3.412159
Total 39 953.891
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept -5.08357 0.830332 -6.12233 4.29E-07 -6.76598 -3.40116
Package weight 1.347949 0.119289 11.29982 1.48E-13 1.106246 1.589653
Distance speed 0.036372 0.00387 9.397333 2.43E-11 0.02853 0.044214

b. the new regression model is

cost = -5.08357 +1.34795 *Package weight + 0.0364* Distance speed

For every one one unit increase in speed , cost of shipment will increase by $\0.0364

the ? 2value = 0.8677, about 86.67 % variation in shipment cost is explained by  Package weight and distance speed .

for this multiple regression model the value of R2 is more in compare to that of the simple regression model estimated in part a

c ) the value of F stat for overall model is 121.278

and p value is 0.0000 which is less than 0.01 so we conclude that model is significant to use .

d ) the explanatory variables package placed and distance speed are signficant because theri p value is less than 0.01


Related Solutions

A regional express delivery service company recently conducted a study to investigate the relationship between the...
A regional express delivery service company recently conducted a study to investigate the relationship between the cost of shipping a package ($), the package weight (in pound) and the distance shipped (in miles). Twenty packages were randomly selected from among the large number received for shipment, and a detailed analysis of the shipping cost was conducted for each package. Cost_of_Shipment Package_Weight Distance_Shipped $3.30 4.10 95 $2.00 0.30 160 $11.00 5.10 240 $2.60 5.90 47 $1.90 4.50 53 $8.00 3.50 250...
A study was conducted to investigate a possible relationship between the colors of helmets worn by...
A study was conducted to investigate a possible relationship between the colors of helmets worn by motorcycle drivers and whether they were injured, or not injured, in a crash. Using the results in the table below with a = 0.05, test the claim that being injured is independent of helmet color. a) State the claim in words. b) Write the null and alternate hypotheses. You may use symbols for the null and words for the alternate if easier. c) Indicate...
A study was conducted to investigate the relationship between maternal smoking during pregnancy and the presence...
A study was conducted to investigate the relationship between maternal smoking during pregnancy and the presence of congenital malformations in the child. Among children who suffer from an abnormality other than Down’s syndrome or an oral cleft, 32.8% have mothers who smoked during pregnancy. You wish to determine if this proportion is the same for those children born with an oral cleft. In a random sample of 27 infants with an oral cleft, 15 had mothers who smoked during pregnancy....
A​ case-control (or​ retrospective) study was conducted to investigate a relationship between the colors of helmets...
A​ case-control (or​ retrospective) study was conducted to investigate a relationship between the colors of helmets worn by motorcycle drivers and whether they are injured or killed in a crash. Results are given in the accompanying table. Using a 0.01 significance​ level, test the claim that injuries are independent of helmet color. Color of Helmet Black White Yellow Red Blue Controls​ (not injured) 479 351 29 169 90 Cases​ (injured or​killed) 203 114 8 71 42
A large city hospital conducted a study to investigate the relationship between the number of unauthorized...
A large city hospital conducted a study to investigate the relationship between the number of unauthorized days that employees are absent per year and the distance (miles) between home and work for the employees. A sample of 10 employees was selected and the following data were collected. Distance to Work (miles) Number of Days Absent 1 8 3 5 4 8 6 7 8 6 10 3 12 5 14 2 14 4 18 2 Use Excel - no hand...
A​ case-control (or​ retrospective) study was conducted to investigate a relationship between the colors of helmets...
A​ case-control (or​ retrospective) study was conducted to investigate a relationship between the colors of helmets worn by motorcycle drivers and whether they are injured or killed in a crash. Results are given in the accompanying table. Using a 0.05 significance​ level, test the claim that injuries are independent of helmet color. Color of Helmet Black White Yellow Red Blue Controls​ (not injured) 484 370 34 159 96 Cases​ (injured or​ killed) 213 108 9 68 47 Click here to...
A​ case-control (or​ retrospective) study was conducted to investigate a relationship between the colors of helmets...
A​ case-control (or​ retrospective) study was conducted to investigate a relationship between the colors of helmets worn by motorcycle drivers and whether they are injured or killed in a crash. Results are given in the accompanying table. Using a 0.05 significance​ level, test the claim that injuries are independent of helmet color. Black White Yellow Red Blue Controls (not injured) 503 373 35 160 56 Cases (injured or killed) 215 122 10 66 28 Identify the null and alternative hypotheses....
A study was conducted to investigate for any possible correlation that may exist between the number...
A study was conducted to investigate for any possible correlation that may exist between the number of tardiness and the final grade (%) of the students in statistics class. Based on this data, Kuya, who is enrolling in the class next trimester, may anticipate his grade from being tardy every other week (6 times). What would his estimated grade be? STUDENT TARDY GRADE 1 1 89 2 2 94 3 0 97 4 0 87 5 1 94 6 0...
A study was conducted to investigate levels of optimism between nursing students when they started in...
A study was conducted to investigate levels of optimism between nursing students when they started in Fall of 2014 and the following year (Fall of 2015). Is there a significant relationship between the two assessment periods so that one may conclude that students who are optimistic at entry point tend to remain optimistic, and those who are less optimistic tend to remain on the pessimistic side, at least for a year of nursing school? FALL 2015 FALL 2014 44 45...
A study was conducted to investigate for any possible correlation that may exist between the number...
A study was conducted to investigate for any possible correlation that may exist between the number of tardiness and the final grade (%) of the students in statistics class. Based on this data, Kuya, who is enrolling in the class next trimester, may anticipate his grade from being tardy every other week (6 times). What would his estimated grade be? Data can be found in the grades tab.    Conclusion is (closest answer due to rounding error): predicted grade is...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT