Question

In: Statistics and Probability

Price Rooms Neighborhood 309.6 7 0 307.4 8 0 340.3 9 0 346.5 12 0 298.2...

Price Rooms Neighborhood
309.6 7 0
307.4 8 0
340.3 9 0
346.5 12 0
298.2 6 0
337.8 9 0
324.1 10 0
313.2 8 0
327.8 9 0
325.3 8 0
308.5 6 1
381.3 13 1
337.4 10 1
346.2 10 1
342.4 9 1
323.7 8 1
329.6 8 1
343.6 9 1
360.7 11 1
348.3 9 1
  1. Predict the selling price for a house with nine rooms that is located in the east-side neighborhood. Construct a 95% confidence interval estimate and 95% prediction interval.
  2. Perform a residual analysis on the results and determine whether the regression assumptions are valid.
  3. Is there a significant relationship between selling price and the two independent variables (rooms and neighborhood) at the 0.05 level of significance?
  4. At the 0.05 level of significance, determine whether each independent variable makes a contribution to the regression model. Indicate the most appropriate regression model for this set of data.
  5. Construct and interpret a 95% confidence interval estimate of the population slope for the relationship between selling price and number of rooms.
  6. Construct and interpret a 95% confidence interval estimate of the population slope for the relationship between selling price and neighborhood.
  7. Compute and interpret the adjusted r2.
  8. (omit…)
  9. What assumption do you need to make about the slope of selling price with number of rooms.
  10. Add an interaction term to the model and, at the 0.05 level of significance, determine whether it makes a significant contribution to the model.
  11. On the basis of the results of (f) and (l), which model is most appropriate? Explain.

Solutions

Expert Solution

[The variable "Neighborhood" has not been properly defined. I am assuming that Neighborhood = 1 for East-side.]

(a)

0.867
Adjusted R² 0.851 n   20
R   0.931 k   2
Std. Error   7.739 Dep. Var. Price
ANOVA table
Source SS   df   MS F p-value
Regression 6,635.4808 2   3,317.7404 55.39 3.58E-08
Residual 1,018.2487 17   59.8970
Total 7,653.7295 19  
Regression output confidence interval
variables coefficients std. error    t (df=17) p-value 95% lower 95% upper std. coeff.
Intercept 243.7371 0.000
Rooms 9.2189 1.0296 8.954 7.62E-08 7.0466 11.3913 0.809
Neighborhood 12.6967 3.5354 3.591 .0023 5.2378 20.1557 0.325
Predicted values for: Price
95% Confidence Interval 95% Prediction Interval
Rooms Neighborhood Predicted lower upper lower upper Leverage
9 1 339.4043 334.1998 344.6088 322.2664 356.5422 0.102

Predicted price when Rooms = 9 and Neighborhood = 1 is 339.4043

The 95% confidence interval is [334.1998, 344.6088]

The 95% prediction interval is [322.2664, 356.5422]

(b)

The residuals show a linear trend so the normality condition is valid.

(c)

The global p- values as well as the p- values for the two independent variables separately are < 0.05. So, there a significant relationship between selling price and the two independent variables (rooms and neighborhood) at the 0.05 level of significance.

(d)

Yes, bot independent variables are significant in this regression, so both should be retained.

The best-fit model equation is Price = 243.7371 + 9.2189 Rooms + 12.6967 Neighborhood.


Related Solutions

(9) Diagonalizing 4 0 1 -7 -5 5. -12 -6 7
(9) Diagonalizing 4 0 1 -7 -5 5. -12 -6 7
A=[ 7 8 -2 -6 7 4 1 ; 2 4 -4 -13 9 9 -12...
A=[ 7 8 -2 -6 7 4 1 ; 2 4 -4 -13 9 9 -12 ; 6 6 0 -9 8 9 -4 ; 1 8 -14 -22 5 8 -1 ; 4 9 -10 -14 7 4 -1] B=[ 19 4 4 14 -3 -7 -5 ; 21 -6 -5 10 14 -2 4 ; 22 -4 5 13 5 -6 4 ; 41 20 0 26 11 -1 -27 ; 29 14 -2 20 3 -4 -19]...
Table: x: 6, 2, 15, 9, 12, 5, 8 y: 8, 9, 4, 7, 6, 9,...
Table: x: 6, 2, 15, 9, 12, 5, 8 y: 8, 9, 4, 7, 6, 9, 8 1) Make the scatter diagram. 2) Find the equation of the regression line using the formulas given in class. They must appear the intermediate steps of substitutions. Round the answers to three decimal places. 3) Find the correlation coefficient using the formulas given in class. Steps should appear intermediates of substitutions. Round the answer to three decimal places. 4) Find the value predicted...
The following sample was taken: 1, 9, 11, 3, 12, 15, 16, 8, 7, 8, 12,...
The following sample was taken: 1, 9, 11, 3, 12, 15, 16, 8, 7, 8, 12, 3, 2, 6, 6, 7, 8, 10, 9, 5, 6, 7, 8, 6, 8, 7 a. Calculate the average b. Estimate the variance c. Build a 95% confidence interval for the standard deviation. Please, give your answer by hand. Thanks in advance.
Input Data Month 0 1 2 3 4 5 6 7 8 9 10 11 12...
Input Data Month 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Revenue $             -   $            -   $            -   $        -   $        -   $         -   $    2,500 $    2,875 $    3,306 $    3,802 $    4,373 $    5,028 $    5,783 $    6,650 $    7,648 $      8,795 $   10,114 $   11,631 $   13,376 $   15,382 $   17,689 $   20,343 $   23,394 $   26,903 Monthly Revenue Growth...
Input Data Month 0 1 2 3 4 5 6 7 8 9 10 11 12...
Input Data Month 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Revenue $             -   $            -   $            -   $        -   $        -   $         -   $    2,500 $    2,875 $    3,306 $    3,802 $    4,373 $    5,028 $    5,783 $    6,650 $    7,648 $      8,795 $   10,114 $   11,631 $   13,376 $   15,382 $   17,689 $   20,343 $   23,394 $   26,903 Monthly Revenue Growth...
a = [3, -4, 7] b = [-6, 9, 8] c = [4, 0, 8] d...
a = [3, -4, 7] b = [-6, 9, 8] c = [4, 0, 8] d =[7, 1, 7] e = [3, -5, 2, 1] f =[5, -7, -3, 6] g = [3, -4, 4, 3] P = Projection of ex. C = |g|(gf/gf) C = gf/|f| ex. P g --> f = Cgf = C(gf/f) (1/|f|) (f) =( gf/ff)(f) Find a. Pg --> f b. Pa --> 3b + e Find (cross multiply) a. ||a X b|| b. ||g...
Data: 7,-5, -8, 7, 9, 15, 0, 2, 13, 8, 6, -2, 4 (a) Mean= Mode=...
Data: 7,-5, -8, 7, 9, 15, 0, 2, 13, 8, 6, -2, 4 (a) Mean= Mode= median= (b) Variance= Standard deviation= (c) Range= IQR(Interquartilerange)= (d) Mid-Range= Mid-Hinge=
In 10 test runs, a truck operated for 8, 10, 10, 7, 9, 12, 10, 8,...
In 10 test runs, a truck operated for 8, 10, 10, 7, 9, 12, 10, 8, 7, and 9 miles with one gallon of a certain gasoline. Is this evidence at the 0.05 level of signifcance that the truck is not operating at an average of 11.5 miles per gallon with this gasoline? What assumptions must be satised in order for the procedure you used to analyze these data to be valid?
Consider the following set of vectors in R6 S={[-9 7 -8 3 0 -5], [1 -7...
Consider the following set of vectors in R6 S={[-9 7 -8 3 0 -5], [1 -7 3 2 -8 -8], [-6 -14 1 9 -23 -29], [11 -21 14 1 -16 -11], [8 16 -8 8 10 1], [17 -7 13 -8 8 18] (a) (2 points) Demonstrate that S is not a basis for R6. (b) (4 points) Let H = Span S. Find a basis for H and determine its dimension. (c) (2 points) Determine whether v= [1,1,1,−1,−1,−1]...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT