Question

In: Statistics and Probability

Use the CO2 data and via Multiple regression select the two variables that predict the CO2...

Use the CO2 data and via Multiple regression select the two variables that predict the CO2 level with the best P-value. Make another table with these two variables and answer the questions. Numerical answers are rounded so choose the answer that matches the best:

Hour CO Traffic Wind

1 2.4 50 -0.2

2 1.7 26 0.0

3 1.4 16 0.0

4 1.2 10 0.0

5 1.2 12 0.1

6 2.0 41 -0.1

7 3.4 157 -0.1

8 5.8 276 -0.2

9 6.8 282 0.2

10 6.6 242 1.0

11 6.3 200 2.3

12 5.8 186 3.8

13 5.5 179 4.6

14 5.9  178 5.4

15 6.8 203 5.9

16 7.0 264 5.9

17 7.4 289 5.6

18 7.4 308 4.9

19 6.4 267 3.8

20 5.0 190 2.5

21 3.8  125 1.4

22 3.5 120 0.6

23 3.3 116 0.4

24 3.1 87 0.1

Answer the questions for Assessment:

13. What are the two selected variables? a. Hour and Traffic b. Wind and Traffic c. Hour and Wind

14. Which of the variables has a better P-value and what is this P-value? (Note: numbers are truncated.) a. Traffic; 0.018 b. Traffic; 6.85E-12 c. Wind; 0.0056 d. Wind; 0.174

15. Based on the table, how would you characterize the Regression fit? a. Poor b. Good c. Excellent

16. What is another name for the coefficient 1.274461 and what is its interpretation based on the data? a. The X-intercept; when the average weekday traffic density and the perpendicular wind-speed component are zero.

b. The slope of an average summer weekday's CO2 concentration. It is how much the CO2 concentration will increase when both the average weekday traffic density and the perpendicular wind-speed component increase by 1 unit.

c. The Y-intercept; It is how much the CO2 concentration will increase when both the average weekday traffic density and the perpendicular wind-speed component are zero.

d. The Y-intercept; it is the average summer weekday CO2 concentration when the average weekday traffic density and the perpendicular wind-speed component are zero.

e. None of these

Solutions

Expert Solution

I used R software to solve this question.

R codes and output:

d=read.table('co2.csv',header=TRUE, sep=',')
> head(d)
Hour CO Traffic Wind
1 1 2.4 50 -0.2
2 2 1.7 26 0.0
3 3 1.4 16 0.0
4 4 1.2 10 0.0
5 5 1.2 12 0.1
6 6 2.0 41 -0.1
> attach(d)
> fit=lm(CO~Hour+Traffic+Wind)
> summary(fit)

Call:
lm(formula = CO ~ Hour + Traffic + Wind)

Residuals:
Min 1Q Median 3Q Max
-0.67461 -0.31714 -0.04293 0.26546 0.91662

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 1.305220 0.206791 6.312 3.68e-06 ***
Hour -0.005669 0.014552 -0.390 0.700976
Traffic 0.018040 0.001205 14.976 2.48e-12 ***
Wind 0.231560 0.050749 4.563 0.000189 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 0.4346 on 20 degrees of freedom
Multiple R-squared: 0.9648, Adjusted R-squared: 0.9595
F-statistic: 182.9 on 3 and 20 DF, p-value: 1.056e-14

Que.13

Wind and Traffic

Because their p-values are less than 0.05.

Que.14

Traffic; 6.85E-12

Que.15

Excellent

Because test for overall significance is F-statistic: 182.9 on 3 and 20 DF, p-value: 1.056e-14

The p-value is less than 0.05, hence model is statistically significant. and it will explain 96.48% variation in CO2.

Que.16

The Y-intercept; it is the average summer weekday CO2 concentration when the average weekday traffic density and the perpendicular wind-speed component are zero.


Related Solutions

Use the HousePrice data and via multiple regression select the two variables that predict the house...
Use the HousePrice data and via multiple regression select the two variables that predict the house selling price the best. Make another table with these two variables and answer the questions. Numerical answers are rounded so choose the answer that matches the best: 9. Identify the negative coefficient. What is its value and what is the interpretation of this number? (Choose the most appropriate answer. Note: numbers are truncated.) a. -0.037; This is a negative number and serves no statistical...
Use the following data to develop a multiple regression model to predict from and . Discuss...
Use the following data to develop a multiple regression model to predict from and . Discuss the output, including comments about the overall strength of the model, the significance of the regression coefficients, and other indicators of model fit. y x1 x2 198 29 1.64 214 71 2.81 211 54 2.22 219 73 2.70 184 67 1.57 167 32 1.63 201 47 1.99 204 43 2.14 190 60 2.04 222 32 2.93 197 34 2.15 Appendix A Statistical Tables *(Round...
Use a multiple regression model with dummy variables as follows to develop an equation to account for seasonal effects in the data.
Quarter Year 1 Year 2 Year 3 1 3 6 8 2 2 4 8 3 4 7 9 4 6 9 11 (b) Use a multiple regression model with dummy variables as follows to develop an equation to account for seasonal effects in the data. Qtr1 = 1 if Quarter 1, 0 otherwise; Qtr2 = 1 if Quarter 2, 0 otherwise; Qtr3 = 1 if Quarter 3, 0 otherwise. If required, round your answers to three decimal places. For...
Use Excel to develop a multiple regression model to predict Cost of Materials by Number of...
Use Excel to develop a multiple regression model to predict Cost of Materials by Number of Employees, New Capital Expenditures, Value Added by Manufacture, and End-of-Year Inventories. Locate the observed value that is in Industrial Group 12 and has 7 employees. Based on the model and the multiple regression output, what is the corresponding residual of this observation? Write your answer as a number, round to 2 decimal places. SIC Code No. Emp. No. Prod. Wkrs. Value Added by Mfg....
b) Use a multiple regression model with dummy variables as follows to develop an equation to...
b) Use a multiple regression model with dummy variables as follows to develop an equation to account for seasonal effects in the data: Qtr1 = 1 if Quarter 1, 0 otherwise; Qtr2 = 1 if Quarter 2, 0 otherwise; Qtr3 = 1 if Quarter 3, 0 otherwise. If required, round your answers to three decimal places. For subtractive or negative numbers use a minus sign even if there is a + sign before the blank (Example: -300). If the constant...
Using the International Stock Market data in below, conduct a stepwise multiple regression procedure to predict...
Using the International Stock Market data in below, conduct a stepwise multiple regression procedure to predict DJIA by the Nasdaq, the S&P 500, the Nikkei, the Hang Seng, the FTSE 100, and the IPC. Develop a correlation matrix and discuss the correlations among all variables. ( please show step by step about doing correlation in excel and discuss) Perform a stepwise regression procedure (use α = 0.1) to select predictor variables for the model. Summarize the results (with t-values and...
Use multiple regression with dummies, since the data is seasonal for the regression model. Year Sales...
Use multiple regression with dummies, since the data is seasonal for the regression model. Year Sales (Millions) Trend 2014 1 480.0 1 2014 Q2 864.0 2 2014 Q3 942.0 3 2014 Q4 1,100.0 4 2015 Q1 1,200.0 5 2015 Q2 1,900.0 6 2015 Q3 1,900.0 7 2015 Q4 1,300.0 8 2016 Q1 1,200.0 9 2016 Q2 1,500.0 10 2016 Q3 1,200.0 11 2016 Q4 500.0 12 2017 Q1 356.0 13 2017 Q2 1,300.0 14 2017 Q3 1,000.0 15 2017 Q4...
The standard project is to use multiple regression analysis to analyze a data set. The data...
The standard project is to use multiple regression analysis to analyze a data set. The data set is a study of student persistent enrolling in the next semester based on Gender, Age, GPA, a 22 questionnaire on self-efficacy, and student enrollment status. The educational researcher wants to study the relationship between student enrollment status as it relates to gender, age, GPA, and the total response to a 22 questionnaire survey. a. The estimated multiple regression analysis equation. b. Does the...
Use the SAT data and create a multiple regression table but this time as input use...
Use the SAT data and create a multiple regression table but this time as input use ONLY two variables: Letters and SAT. Answer questions 1 to 4. Choose the best fitting answer. Note: numbers are truncated unless specified. 1. If an incoming student has Letters = 7 and SAT = 1000 what would his predicted College GPA be? a. 1.99 b. 2.01 c. ​​2.18 d. 2.39 2. What is the approximate error of this prediction? a. 0.58 b. ​​0.61 c....
Use the SAT data and create a multiple regression table but this time as input use...
Use the SAT data and create a multiple regression table but this time as input use ONLY two variables: Letters and SAT. Answer questions 1 to 4. Choose the best fitting answer. Note: numbers are truncated unless specified. C ollege GPA HighSchl GPA SAT Letters 2.04 2.01 1070 5 2.56 3.4 1254 6 3.75 3.68 1466 6 1.1 1.54 706 4 3 3.32 1160 5 0.05 0.33 756 3 1.38 0.36 1058 2 1.5 1.97 1008 7 1.38 2.03 1104...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT