Question

In: Statistics and Probability

Use the HousePrice data and via multiple regression select the two variables that predict the house...

Use the HousePrice data and via multiple regression select the two variables that predict the house selling price the best. Make another table with these two variables and answer the questions. Numerical answers are rounded so choose the answer that matches the best:

9. Identify the negative coefficient. What is its value and what is the interpretation of this number? (Choose the most appropriate answer. Note: numbers are truncated.)

  • a. -0.037; This is a negative number and serves no statistical interpretation in this case.
  • b. -0.037; This is how much the selling price of the house will decrease in hundred thousand as the house age by a year.
  • c. -0.077; This show how much selling price correlates with the age of the house
  • d. -1.92; For each additional bathroom the selling price of the house decrease by 1.92 hundred thousand dollars.

10. Which of the two variables has better P-value and what is this P-value? (Note: numbers are truncated.)

  • a. Age has the best P-value; P-value = 0.067
  • b. The higher the P-value, the better; so a P-value = 1.08E-09 is very good.
  • c. #Bathroooms has the best P-value; P-value = 1.08E-09
  • d. All P-values are poor since they are allhigher 5%.

11. Using this second table predict the selling price of a housethat is 10 years old, has 2 bathrooms and 3 rooms.

  • a. 10.5 hundred thousands
  • b. 12.57 hundred thousands
  • c. 9.06 hundred thousands
  • d. 11.58 hundred thousands

12. Based on the table would you characterize the Regression fit and the prediction as Poor, Good, Very Good, or Excellent?

  • a. Poor
  • b. Good
  • c. Very Good
  • d. Excellent
Age     #Bathrooms      #Rooms  #BedRooms       #FirePlaces     sellingPrice in $100000
42      1       7       4       0       4.9176
62      1       7       4       0       5.0208
40      1       6       3       0       4.5429
54      1       6       3       0       4.5573
42      1       6       3       0       5.0597
56      1       6       3       0       3.891
51      1       7       3       1       5.898
32      1       6       3       0       5.6039
32      1       6       3       0       5.8282
30      1       6       3       0       5.3003
30      1       5       2       0       6.2712
32      1       6       3       0       5.9592
32      1       6       3       0       5.6039
50      1.5     8       4       0       8.2464
17      1.5     6       3       0       7.7841
23      1       7       3       0       9.0384
22      1.5     6       3       0       7.5422
44      1.5     6       3       0       6.0931
3       1       7       3       0       8.14
31      1.5     8       4       0       9.1416
42      2.5     10      5       1       16.4202
14      2.5     9       5       1       14.4598
46      1       5       2       1       5.05
22      1.5     7       3       1       6.6969
40      1       6       3       1       5.9894
50      1.5     8       4       1       8.7951
48      1.5     8       4       1       8.3607
30      1.5     6       3       1       12

Solutions

Expert Solution

Regression with all the variables.
Put the data in excel and using the regression under data analysis tab, we input the values


The output is as follows.


Regression with the best two variable
We select the variable based on the pvalue.

For each beta coefficient we test the following hypothesis.

Next we check the pvalue for the variable in the regression output and check if the pvalue is less than 0.05, if it is less than 0.05, then we reject the null hypothesis and conclude that the variable is significant.

We find that only Age and #Bathroom are significant with a pvalue less than 0.05
Hence only these two are select in the next model.

Screenshots of the inputs and the output are shown below.


9. Identify the negative coefficient. What is its value and what is the interpretation of this number? (Choose the most appropriate answer. Note: numbers are truncated.)
b. -0.037; This is how much the selling price of the house will decrease in hundred thousand as the house age by a year.

10. Which of the two variables has better P-value and what is this P-value? (Note: numbers are truncated.)
c. #Bathroooms has the best P-value; P-value = 1.08E-09

11. Using this second table predict the selling price of a housethat is 10 years old, has 2 bathrooms and 3 rooms.
b. 12.57 hundred thousands
Explaination :
Regression equation
selling price = 0.98872665-0.0369(Age)+ 5.9745(#Bathrooms)
selling price = 0.98872665-0.0369(10)+ 5.9745(2) =12.5684


12. Based on the table would you characterize the Regression fit and the prediction as Poor, Good, Very Good, or Excellent?
c. Good
Fit of the regression line is determined by the coefficient of determination.

Coefficient of determination(rsqaure) = 0.805635082

It is the measure of the amount of variability in y explained by x. Its value lies between 0 and 1. Greater the value, better is the model. In this case, it 80.56%, hence the model is good


Related Solutions

Use the CO2 data and via Multiple regression select the two variables that predict the CO2...
Use the CO2 data and via Multiple regression select the two variables that predict the CO2 level with the best P-value. Make another table with these two variables and answer the questions. Numerical answers are rounded so choose the answer that matches the best: Hour CO Traffic Wind 1 2.4 50 -0.2 2 1.7 26 0.0 3 1.4 16 0.0 4 1.2 10 0.0 5 1.2 12 0.1 6 2.0 41 -0.1 7 3.4 157 -0.1 8 5.8 276 -0.2...
Use the following data to develop a multiple regression model to predict from and . Discuss...
Use the following data to develop a multiple regression model to predict from and . Discuss the output, including comments about the overall strength of the model, the significance of the regression coefficients, and other indicators of model fit. y x1 x2 198 29 1.64 214 71 2.81 211 54 2.22 219 73 2.70 184 67 1.57 167 32 1.63 201 47 1.99 204 43 2.14 190 60 2.04 222 32 2.93 197 34 2.15 Appendix A Statistical Tables *(Round...
1.Develop a multiple linear regression model to predict the price of a house using the square...
1.Develop a multiple linear regression model to predict the price of a house using the square feet of living area, number of bedrooms, and number of bathrooms as the predictor variables     Write the reqression equation.      Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence. Discuss the statistical significance of the coefficient for each independent variable using the appropriate regression statistics at a 95% level of confidence....
Use a multiple regression model with dummy variables as follows to develop an equation to account for seasonal effects in the data.
Quarter Year 1 Year 2 Year 3 1 3 6 8 2 2 4 8 3 4 7 9 4 6 9 11 (b) Use a multiple regression model with dummy variables as follows to develop an equation to account for seasonal effects in the data. Qtr1 = 1 if Quarter 1, 0 otherwise; Qtr2 = 1 if Quarter 2, 0 otherwise; Qtr3 = 1 if Quarter 3, 0 otherwise. If required, round your answers to three decimal places. For...
Use Excel to develop a multiple regression model to predict Cost of Materials by Number of...
Use Excel to develop a multiple regression model to predict Cost of Materials by Number of Employees, New Capital Expenditures, Value Added by Manufacture, and End-of-Year Inventories. Locate the observed value that is in Industrial Group 12 and has 7 employees. Based on the model and the multiple regression output, what is the corresponding residual of this observation? Write your answer as a number, round to 2 decimal places. SIC Code No. Emp. No. Prod. Wkrs. Value Added by Mfg....
b) Use a multiple regression model with dummy variables as follows to develop an equation to...
b) Use a multiple regression model with dummy variables as follows to develop an equation to account for seasonal effects in the data: Qtr1 = 1 if Quarter 1, 0 otherwise; Qtr2 = 1 if Quarter 2, 0 otherwise; Qtr3 = 1 if Quarter 3, 0 otherwise. If required, round your answers to three decimal places. For subtractive or negative numbers use a minus sign even if there is a + sign before the blank (Example: -300). If the constant...
Using the International Stock Market data in below, conduct a stepwise multiple regression procedure to predict...
Using the International Stock Market data in below, conduct a stepwise multiple regression procedure to predict DJIA by the Nasdaq, the S&P 500, the Nikkei, the Hang Seng, the FTSE 100, and the IPC. Develop a correlation matrix and discuss the correlations among all variables. ( please show step by step about doing correlation in excel and discuss) Perform a stepwise regression procedure (use α = 0.1) to select predictor variables for the model. Summarize the results (with t-values and...
Use multiple regression with dummies, since the data is seasonal for the regression model. Year Sales...
Use multiple regression with dummies, since the data is seasonal for the regression model. Year Sales (Millions) Trend 2014 1 480.0 1 2014 Q2 864.0 2 2014 Q3 942.0 3 2014 Q4 1,100.0 4 2015 Q1 1,200.0 5 2015 Q2 1,900.0 6 2015 Q3 1,900.0 7 2015 Q4 1,300.0 8 2016 Q1 1,200.0 9 2016 Q2 1,500.0 10 2016 Q3 1,200.0 11 2016 Q4 500.0 12 2017 Q1 356.0 13 2017 Q2 1,300.0 14 2017 Q3 1,000.0 15 2017 Q4...
The standard project is to use multiple regression analysis to analyze a data set. The data...
The standard project is to use multiple regression analysis to analyze a data set. The data set is a study of student persistent enrolling in the next semester based on Gender, Age, GPA, a 22 questionnaire on self-efficacy, and student enrollment status. The educational researcher wants to study the relationship between student enrollment status as it relates to gender, age, GPA, and the total response to a 22 questionnaire survey. a. The estimated multiple regression analysis equation. b. Does the...
Use the SAT data and create a multiple regression table but this time as input use...
Use the SAT data and create a multiple regression table but this time as input use ONLY two variables: Letters and SAT. Answer questions 1 to 4. Choose the best fitting answer. Note: numbers are truncated unless specified. 1. If an incoming student has Letters = 7 and SAT = 1000 what would his predicted College GPA be? a. 1.99 b. 2.01 c. ​​2.18 d. 2.39 2. What is the approximate error of this prediction? a. 0.58 b. ​​0.61 c....
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT