Question

In: Math

You are part of a team investigating the identifying motor vehicle accidents. A multiple regression model...

You are part of a team investigating the identifying motor vehicle accidents. A multiple regression model is to be constructed to predict the number of motor vehicle accidents in a town per year based upon the population of the town, the number of recorded traffic offenses per year and the average annual temperature in the town.

Data has been collected on 30 randomly selected towns:

Number of motor vehicle
accidents per year
Population
(× 1000)
No. of recorded
traffic offences
(× 100)
Average temperature
°F
355 181 29 78
490 257 56 82
597 441 34 81
475 50 95 81
922 495 102 82
736 38 165 81
305 167 25 84
1,128 378 191 78
745 369 86 76
476 237 63 84
143 100 4 84
203 118 21 79
909 489 106 78
410 210 39 77
642 138 131 81
847 308 138 82
604 418 40 77
719 194 132 78
350 319 8 84
327 70 61 76
1,038 259 192 78
756 299 115 81
635 440 40 79
796 283 131 85
301 64 56 81
135 26 26 79
639 31 150 81
325 210 13 77
441 43 98 79
522 370 26 82

a)Find the multiple regression equation using all three explanatory variables. Assume that X1 is population, X2 is number of recorded traffic offenses per year and X3 is average annual temperature. Give your answers to 3 decimal places.

y^ =  + population + no. traffic offences + average temp

b)At a level of significance of 0.05, the result of the F test for this model is that the null hypothesis isis not rejected.

For parts c) and d), using the data, separately calculate the correlations between the response variable and each of the three explanatory variables.

c)The explanatory variable that is most correlated with number of motor vehicle accidents per year is:

population
number of traffic offenses
average annual temperature

d)The explanatory variable that is least correlated with number of motor vehicle accidents per year is:

population
number of traffic offenses
average annual temperature

e)The value of R2 for this model, to 2 decimal places, is equal to

f)The value of se for this model, to 3 decimal places, is equal to

g)Construct a new multiple regression model by removing the variable average annual temperature. Give your answers to 3 decimal places.

The new regression model equation is:

y^ =  + population + no. traffic offences

h)In the new model compared to the previous one, the value of R2 (to 2 decimal places) is:

increased
decreased
unchanged

i)In the new model compared to the previous one, the value of se (to 3 decimal places) is:

increased
decreased
unchanged

Solutions

Expert Solution

All analysis is carried out in excel software.

a) The multiple regression equation for three explanatory variables is describe as

where y is number of motor vehicle accidents, x1 is population, x2 is number of recorded traffic offenses per year and x3 is average annual temperature.

The resulted outcome is using OLS method under excel command,

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.997319
R Square 0.994646
Adjusted R Square 0.994028
Standard Error 20.08388
Observations 30
ANOVA
df SS MS F Significance F
Regression 3 1948343 649447.6 1610.085 1.24E-29
Residual 26 10487.42 403.3623
Total 29 1958830
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 236.3423 116.8627 2.022394 0.053535 -3.87235 476.5569
x1 0.989568 0.025325 39.07436 1.27E-24 0.937512 1.041625
x2 3.800536 0.066973 56.74689 8.68E-29 3.66287 3.938202
x3 -2.52262 1.44612 -1.74441 0.092901 -5.49516 0.449922

b) At a level of significance of 0.05, the result of the F test for this model is 1610.085 and p-value is less than 0.05. We concluded that the null hypothesis is rejected.

c)

Corr(y,x1) = 0.5697

Corr(y,x2) = 0.8234

Corr(y,x3) = -0.1057

The explanatory variable that is most correlated with number of motor vehicle accidents per year is number of traffic offenses .

d) The explanatory variable that is least correlated with number of motor vehicle accidents per year is average annual temperature

e)

e)The value of R2 for this model is equal to 0.9946

f)The value of se for this model is equal to 20.083

g)Construct a new multiple regression model by removing the variable average annual temperature.

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.997005
R Square 0.994019
Adjusted R Square 0.993576
Standard Error 20.82985
Observations 30
ANOVA
df SS MS F Significance F
Regression 2 1947115 973557.7 2243.827 9.68E-31
Residual 27 11714.83 433.8827
Total 29 1958830
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 33.052 9.02213 3.663436 0.00107 14.54011 51.56388
x1 0.991293 0.026246 37.76948 6.56E-25 0.937441 1.045145
x2 3.808852 0.069285 54.97381 2.96E-29 3.666691 3.951013

The new regression model equation is:

i) In the new model compared to the previous one, the value of se increased.


Related Solutions

The multiple regression model is estimated in Excel and part of the output is provided below....
The multiple regression model is estimated in Excel and part of the output is provided below. ANOVA df SS MS F Significance F Regression 3 3.39E+08 1.13E+08 1.327997 0.27152899 Residual 76 6.46E+09 85052151 Total 79 6.8E+09 Question 8 (1 point) Use the information from the ANOVA table to complete the following statement. To test the overall significance of this estimated regression model, the hypotheses would state there is    between attendance and the group of all explanatory variables, jointly. there is...
(12 pts) Imagine that you are part of an international medical team. You are investigating a...
(12 pts) Imagine that you are part of an international medical team. You are investigating a strange virus that has broken out in the US. You have samples of the virus from January when it was first detected and samples taken from patients in May. The virus uses a specific protein to attach to a receptor on lung cells and infect the people. You notice the that the attachment protein in the January samples has a different amino acid sequence...
Discuss the underlying assumptions of a simple linear regression model; multiple regression model; and polynomial regression.
Discuss the underlying assumptions of a simple linear regression model; multiple regression model; and polynomial regression.
Use the regression equation to predict the number of motor vehicle fatalities in a western state...
Use the regression equation to predict the number of motor vehicle fatalities in a western state with a population of 4 million people (40 hundred thousand people). Does this answer seem reasonable based on the given data? Why or why not? Population (hundreds of thousands) = 3.3277772 + 0.054110317 Motor Vehicle Fatalities Motor Vehicle Fatalities Population (hundreds of thousands) 87 7 1117 50 742 47 119 13 264 14 270 9 381 24 449 19 436 36 328 25 659...
1. Distinguish between a bivariate regression model and a multiple regression.
1. Distinguish between a bivariate regression model and a multiple regression.
A multiple regression model is to be constructed to model the time spent using the internet...
A multiple regression model is to be constructed to model the time spent using the internet per week among internet users. The explanatory variables are age, hours spent working per week and annual income. Data has been collected on 30 randomly selected individuals: Time using internet (minutes) Age Hours working per week Annual income ('000) 140 56 39 28 257 35 31 79 163 35 35 34 115 33 52 27 182 45 36 37 214 51 57 80 187...
The Environmental Protection agency requires that the exhaust of each model of motor vehicle be tested...
The Environmental Protection agency requires that the exhaust of each model of motor vehicle be tested for the level of several pollutants. The level of oxides of nitrogen (NOX) in the exhaust of one light truck model was found to vary among individually trucks according to a Normal distribution with mean 1.45 grams per mile driven and standard deviation 0.40 grams per mile. (a) What is the 30th percentile for NOX exhaust, rounded to four decimal places? (b) Find the...
The Environmental Protection agency requires that the exhaust of each model of motor vehicle be tested...
The Environmental Protection agency requires that the exhaust of each model of motor vehicle be tested for the level of several pollutants. The level of oxides of nitrogen (NOX) in the exhaust of one light truck model was found to vary among individually trucks according to a Normal distribution with mean 1.45 grams per mile driven and standard deviation 0.40 grams per mile. (a) What is the 66th percentile for NOX exhaust, rounded to four decimal places? (b) Find the...
When we estimate a linear multiple regression model (including a linear simple regression model), it appears...
When we estimate a linear multiple regression model (including a linear simple regression model), it appears that the calculation of the coefficient of determination, R2, for this model can be accomplished by using the squared sample correlation coefficient between the original values and the predicted values of the dependent variable of this model. Is this statement true? If yes, why? If not, why not? Please use either matrix algebra or algebra to support your reasoning.
Which of the following is NOT a required assumption for the multiple regression model? a The...
Which of the following is NOT a required assumption for the multiple regression model? a The error/randomness in attendance is independent from one game to the next. b The error term has a constant variance for all possible values of Temp, Win%, and OpWin%. c The relationship between Attendance and the slope/intercept parameters is linear. d The variable Temp has a normal distribution.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT