Question

In: Statistics and Probability

LAB9B.DAT 1 10 0.802 2 12 2.176 3 7 0.261 4 13 1.618 5 13 2.033...

LAB9B.DAT

1 10 0.802
2 12 2.176
3 7 0.261
4 13 1.618
5 13 2.033
6 15 4.094
7 15 2.201
8 8 0.902
9 8 1.185
10 12 1.734
11 3 0
12 10 1.477
13 19 3.801
14 13 1.732
15 14 2.747
16 10 0.304
17 7 1.627
18 11 1.726
19 0 0
20 12 2.222
21 14 2.908
22 12 2.261
23 11 0.972
24 13 1.779
25 10 1.194
26 15 3.447
27 14 2.496
28 8 1.015
29 17 4.558
30 6 0
31 8 2.081
32 10

1.758

■ Regression analysis, where one variable depends on another, can be used to predict levels of a dependent variable for specified levels of an independent variable. Use the EXCEL REGRESSION command to calculate the intercept and slope of the leastsquares line, as well as the analysis of variance associated with that line. Fill in the following table and use the results to answer the next few questions. Carefully choose your independent and dependent variables and input them correctly using EXCEL’s regression command. In this example, the percentage of drivers under the age of 21 affects the number of Fatals/1000 licenses.

The regression equation (leastsquares line) is

Fatals/1000 licenses = + % under 21

(intercept) (slope)

10. What is the estimated increase in number of fatal accidents per 1000 licenses due to a one percent increase in the percentage of drivers under 21 (i.e. the slope)?

11. What is the standard deviation of the estimated slope?

12. What is the estimated number of fatal accidents per 1000 licenses if there were no drivers under the age of 21 (i.e. the y intercept)?

13. What percentage of the variation in accident fatalities can be explained by the linear relationship with drivers under 21 (i.e. 100 ◊ the unadjusted coefficient of determination)?

Note about data:

In a study of the role of young drivers in automobile accidents, data on percentage of licensed drivers under the age of 21 and the number of fatal accidents per 1000 licenses were determined for 32 cities. The data are stored in Table B. The first column contains a number as the city code, the second column contains the percentage of drivers who are under 21, and the third column contains the number of fatal accidents per 1000 drivers. The primary interest is whether or not the number of fatal accidents is dependent upon the proportion of licensed drivers that are under 21.

I keep putting the values in Excel and my answers are wrong.

Solutions

Expert Solution

Hey!

firstly check you have data analysis add-in in your excel to do this, hence with your data, select the column 2 as the range of "y" and select column 3 as the range for x.

the following will the excel output

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.946472519
R Square 0.895810229
Adjusted R Square 0.863552165
Standard Error 3.804246998
Observations 32
ANOVA
df SS MS F Significance F
Regression 1 3857.358848 3857.358848 266.5340078 1.80762E-16
Residual 31 448.6411519 14.47229522
Total 32 4306
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 0 #N/A #N/A #N/A #N/A #N/A #N/A #N/A
X Variable 1 5.188255281 0.317793509 16.32586928 8.9417E-17 4.540111147 5.836399415 4.540111147 5.836399415

now to answer the questions

10) 5.188255281 is the number of increase in accidents for 1% increase in the number of drivers under 21

11) Standard error is defined as the standard deviation of the statistic and here statistic is the coefficient so hence standard deviation of your coefficient is 0.317793509

12) intercept is 0. i.e no accidents occur naturally.

13) The % of variation in "Y" explained by the given "X" is given by Rsquare value. hence here 89.581% of variation in "Y" is being explained by "X"

Y here is the no of fatal accidents every 1000 liecenses

X is the percentage of drivers under age 21.


Related Solutions

5.) For the data set 2 4 4 5 7 8 9 10 12 12 13...
5.) For the data set 2 4 4 5 7 8 9 10 12 12 13 13 16 16 16 16 17 19 19 20 23 24 24 24 25 26 26 27 28 28 29 31 32 34 34 36 37 38 42 44 45 46 47 47 48 50 52 53 53 54 55 56 56 57 58 (a) Find the 80th percentile. The 80t percentile is =    (a) Find the 42nd percentile. The 42nd percentile is...
Match No. 1 2 3 4 5 6 7 8 9 10 11 12 13 14...
Match No. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Player A 8 42 56 68 91 123 12 46 57 137 5 80 14 10 19 Player B 38 44 46 59 57 61 48 42 51 39 58 41 55 45 68 1. For the given data set representing the runs scored by two players in last 15 matches, conduct the following analysis: i. Which average you will use to summarize...
Consider the data. xi 1 2 3 4 5 yi 3 7 5 10 13 (a)...
Consider the data. xi 1 2 3 4 5 yi 3 7 5 10 13 (a) Compute the mean square error using equation s2 = MSE = SSE n − 2  . (Round your answer to two decimal places.) (b) Compute the standard error of the estimate using equation s = MSE = SSE n − 2  . (Round your answer to three decimal places.) (c) Compute the estimated standard deviation of b1 using equation sb1 = s Σ(xi −...
Day 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15...
Day 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Number of Aides Absent 5 8 11 15 4 2 7 1 4 6 14 19 3 5 8 In which of the following ranges you can find the Upper Control Limit of the control chart? 0.1427 0.1536 0.1677 Not computable with information available In which of the following ranges you can find the Lower Control Limit of the control chart? Does not exit...
student 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15...
student 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Test score 67 67 87 89 87 77 73 74 68 72 58 98 98 70 77 Above we have the final averages of the last stats and I want to know if the class average falls within the boundaries of all my statistics classes for the past 20 years. Find the sample size, mean, and standard deviation of the data above (Table 1)....
Company ID 1 2 3 4 5 6 7 8 9 10 11 12 13 14...
Company ID 1 2 3 4 5 6 7 8 9 10 11 12 13 14 1 5330 6352 6340 3399 6566 7376 6882 1842 7362 8205 6222 7695 1681 2283 2 7972 5941 2861 3290 1019 7399 6442 8003 5092 4634 7569 2538 2619 7932 3 8545 6667 2121 2339 4770 7296 3215 7919 2176 9129 1321 695 7945 1673 4 4578 466 4142 8350 7439 6085 8381 5872 2703 6974 702 5846 7176 9179 5 7302 2727 3103...
Consider a sample with 10 observations of 2, 3, 10, 13, 12, 5, –1, 10, 2,...
Consider a sample with 10 observations of 2, 3, 10, 13, 12, 5, –1, 10, 2, and 12. Use z-scores to determine if there are any outliers in the data; assume a bell-shaped distribution. (Round your answers to 2 decimal places. Negative values should be indicated by a minus sign.) The z-score for the smallest observation The z-score for the largest observation There are in the data. Consider the following data for two investments, A and B: Investment A: x¯x¯...
Promotional expenses(x) Sales(y 7 12 10 14 9 13 4 5 11 15 5 7 3...
Promotional expenses(x) Sales(y 7 12 10 14 9 13 4 5 11 15 5 7 3 4 a) draw the scatter plot and draw the line of best fit b) Calculate and interpret the correlation between promotional expenses and sales C) Calculate the regression equation( calculate the slope and intercept of the regression line d)Interpret the slop coefficient of regression equation e)Using the regression equation calculate the sales volume with respect to promotional expense of 4. f) Obtain the coefficient...
A=[ 7 8 -2 -6 7 4 1 ; 2 4 -4 -13 9 9 -12...
A=[ 7 8 -2 -6 7 4 1 ; 2 4 -4 -13 9 9 -12 ; 6 6 0 -9 8 9 -4 ; 1 8 -14 -22 5 8 -1 ; 4 9 -10 -14 7 4 -1] B=[ 19 4 4 14 -3 -7 -5 ; 21 -6 -5 10 14 -2 4 ; 22 -4 5 13 5 -6 4 ; 41 20 0 26 11 -1 -27 ; 29 14 -2 20 3 -4 -19]...
Consider the data. xi 1 2 3 4 5 yi 4 7 5 11 13 (a)...
Consider the data. xi 1 2 3 4 5 yi 4 7 5 11 13 (a) Compute the mean square error using equation  s2 = MSE = SSE n − 2  . (Round your answer to two decimal places.) (b) Compute the standard error of the estimate using equation s = MSE = SSE n − 2  . (Round your answer to three decimal places.) (c) Compute the estimated standard deviation of b1 using equation sb1 = s Σ(xi − x)2...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT