Question

In: Statistics and Probability

A statistical program is recommended. Spring is a peak time for selling houses. Suppose the data...

A statistical program is recommended.

Spring is a peak time for selling houses. Suppose the data below contains the selling price, number of bathrooms, square footage, and number of bedrooms of 26 homes sold in Ft. Thomas, Kentucky, in spring 2018.

Selling Price Baths Sq Ft Beds
160,000 1.5 1,786 3
170,000 2 1,768 3
178,000 1 1,219 3
182,500 1 1,578 2
195,100 1.5 1,125 4
212,500 2 1,196 2
245,900 2 2,128 3
250,000 3 1,280 3
255,000 2 1,596 3
258,000 2.5 2,374 4
267,000 2.5 2,439 3
268,000 2 1,470 4
275,000 2 1,688 4
Selling Price Baths Sq Ft Beds
295,000 2.5 1,860 3
325,000 3 2,056 4
325,000 3.5 2,776 4
328,400 2 1,408 4
331,000 1.5 1,972 3
344,500 2.5 1,736 3
365,000 2.5 1,990 4
385,000 2.5 3,640 4
395,000 2.5 1,928 4
399,000 2 2,108 3
430,000 2 2,462 4
430,000 2 2,615 4
454,000 3.5 3,700 4

Consider the estimated regression equation we developed that can be used to predict the selling price given the number of bathrooms, square footage, and number of bedrooms in the house.

(x1 denotes number of bathrooms, x2 denotes square footage, x3 denotes number of bedrooms, and y denotes the selling price.)

ŷ = −1770.46 + 18130.69x1 + 60.00x2 + 40706.14x3

(a)

Does the estimated regression equation provide a good fit to the data? Explain. (Round your answer to two decimal places.)

Since the adjusted R2= _____

(b)

Consider the estimated regression equation that was developed which predicts selling price given the square footage and number of bedrooms.

(x2 denotes square footage,  x3 denotes number of bedrooms, and y denotes the selling price.)

ŷ = 7679.47 + 67.88x2 + 44959.40x3

Compare the fit for this simpler model to that of the model that also includes number of bathrooms as an independent variable. (Round your answer to Three decimal places.)

The adjusted R2 for the simpler model is:_____

Solutions

Expert Solution

a) With X1, X2, X3

Excel > Data > Data Analysis > Regression

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.731959402
R Square 0.535764566
Adjusted R Square 0.472459735
Standard Error 63051.30679
Observations 26
ANOVA
df SS MS F Significance F
Regression 3 1.00936E+11 33645370273 8.463249182 0.000632515
Residual 22 87460280335 3975467288
Total 25 1.88396E+11
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept -1770.455048 70253.81669 -0.025200838 0.980121962 -147467.9534 143927.0433 -147467.9534 143927.0433
Baths 18130.69307 24766.70749 0.732059079 0.471859809 -33232.31458 69493.70072 -33232.31458 69493.70072
Sq Ft 59.99529962 23.63299424 2.538624561 0.018715141 10.98346935 109.0071299 10.98346935 109.0071299
Beds 40706.13783 22399.92992 1.817243981 0.082827556 -5748.473551 87160.74921 -5748.473551 87160.74921

ŷ = −1770.46 + 18130.69x1 + 60.00x2 + 40706.14x3

x1 denotes number of bathrooms,

x2 denotes square footage,

x3 denotes number of bedrooms,

and y denotes the selling price.

Adjusted R^2 = 0.47, it is close to 0.5, which means not a good fit, it is moderate fit

b) With X2, X3

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.724193339
R Square 0.524455992
Adjusted R Square 0.483104339
Standard Error 62411.94666
Observations 26
ANOVA
df SS MS F Significance F
Regression 2 98805616177 49402808088 12.68283019 0.000193952
Residual 23 89590774977 3895251086
Total 25 1.88396E+11
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 7679.466794 68357.42325 0.112342836 0.911526323 -133728.637 149087.5706 -133728.637 149087.5706
Sq Ft 67.88020883 20.82253767 3.259939298 0.003446639 24.80550782 110.9549098 24.80550782 110.9549098
Beds 44959.40208 21413.95612 2.099537415 0.046942019 661.2587841 89257.54538 661.2587841 89257.54538

ŷ = 7679.47 + 67.88x2 + 44959.40x3

x2 denotes square footage,

x3 denotes number of bedrooms,

and y denotes the selling price.

Adjusted R^2 = 0.48

Adjusted R^2 simpler model > Adjusted R^2 from part a, So model from part b is preferred  

----------------------------

DEAR STUDENT,

IF YOU HAVE ANY QUERY ASK ME IN THE COMMENT BOX,I AM HERE TO HELPS YOU.PLEASE GIVE ME POSITIVE RATINGS

*****************THANK YOU***************


Related Solutions

A statistical program is recommended. Spring is a peak time for selling houses. Suppose the data...
A statistical program is recommended. Spring is a peak time for selling houses. Suppose the data below contains the selling price, number of bathrooms, square footage, and number of bedrooms of 26 homes sold in Ft. Thomas, Kentucky, in spring 2018. Selling Price Baths Sq Ft Beds 160,000 1.5 1,786 3 170,000 2 1,768 3 178,000 1 1,219 3 182,500 1 1,578 2 195,100 1.5 1,125 4 212,500 2 1,196 2 245,900 2 2,128 3 250,000 3 1,280 3 255,000...
A statistical program is recommended. Suppose that the magnitude of earthquakes recorded in a region of...
A statistical program is recommended. Suppose that the magnitude of earthquakes recorded in a region of North America can be modeled as having a gamma distribution with α = 0.8 and β = 2.6. (b) What is the probability that the magnitude of an earthquake striking the region will exceed 5 on the Richter scale? (Round your answer to four decimal places.) (c) Using an exponential distribution with mean 2.6 to model the magnitude of an earthquake striking the region,...
A statistical program is recommended. Consider the following data for a dependent variable y and two...
A statistical program is recommended. Consider the following data for a dependent variable y and two independent variables, x1 and x2. x1 x2 y 30 12 94 47 10 108 25 17 112 51 16 178 40 5 94 51 19 175 74 7 170 36 12 117 59 13 142 76 16 211 (a) Develop an estimated regression equation relating y to x1. (Round your numerical values to one decimal place.)ŷ =   Predict y if x1 = 43. (Round...
A statistical program is recommended. Data showing the values of several pitching statistics for a random...
A statistical program is recommended. Data showing the values of several pitching statistics for a random sample of 20 pitchers from the American League of Major League Baseball is provided. Player Team W L ERA SO/IP HR/IP R/IP Verlander, J DET 24 5 2.40 1.00 0.10 0.29 Beckett, J BOS 13 7 2.89 0.91 0.11 0.34 Wilson, C TEX 16 7 2.94 0.92 0.07 0.40 Sabathia, C NYY 19 8 3.00 0.97 0.07 0.37 Haren, D LAA 16 10 3.17...
A statistical program is recommended. Data showing the values of several pitching statistics for a random...
A statistical program is recommended. Data showing the values of several pitching statistics for a random sample of 20 pitchers from the American League of Major League Baseball is provided. Player Team W L ERA SO/IP HR/IP R/IP Verlander, J DET 24 5 2.40 1.00 0.10 0.29 Beckett, J BOS 13 7 2.89 0.91 0.11 0.34 Wilson, C TEX 16 7 2.94 0.92 0.07 0.40 Sabathia, C NYY 19 8 3.00 0.97 0.07 0.37 Haren, D LAA 16 10 3.17...
A statistical program is recommended. Data showing the values of several pitching statistics for a random...
A statistical program is recommended. Data showing the values of several pitching statistics for a random sample of 20 pitchers from the American League of Major League Baseball is provided. Player Team W L ERA SO/IP HR/IP R/IP Verlander, J DET 24 5 2.40 1.00 0.10 0.29 Beckett, J BOS 13 7 2.89 0.91 0.11 0.34 Wilson, C TEX 16 7 2.94 0.92 0.07 0.40 Sabathia, C NYY 19 8 3.00 0.97 0.07 0.37 Haren, D LAA 16 10 3.17...
A statistical program is recommended. Data showing the values of several pitching statistics for a random...
A statistical program is recommended. Data showing the values of several pitching statistics for a random sample of 20 pitchers from the American League of Major League Baseball is provided. Player Team W L ERA SO/IP HR/IP R/IP Verlander, J DET 24 5 2.40 1.00 0.10 0.29 Beckett, J BOS 13 7 2.89 0.91 0.11 0.34 Wilson, C TEX 16 7 2.94 0.92 0.07 0.40 Sabathia, C NYY 19 8 3.00 0.97 0.07 0.37 Haren, D LAA 16 10 3.17...
A statistical program is recommended. The National Football League (NFL) records a variety of performance data...
A statistical program is recommended. The National Football League (NFL) records a variety of performance data for individuals and teams. To investigate the importance of passing on the percentage of games won by a team, the following data show the conference (Conf), average number of passing yards per attempt (Yds/Att), the number of interceptions thrown per attempt (Int/Att), and the percentage of games won (Win%) for a random sample of 16 NFL teams for one full season. Team Conf Yds/Att...
A statistical program is recommended. Consider the following data for two variables, x and y. x...
A statistical program is recommended. Consider the following data for two variables, x and y. x 22 24 26 30 35 40 y 12 20 33 35 40 36 (a) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x.  (Round b0 to one decimal place and b1 to three decimal places. ŷ = (b) Use the results from part (a) to test for a significant relationship between x and y. Use α = 0.05....
A statistical program is recommended. Consider the following data for two variables, x and y. x...
A statistical program is recommended. Consider the following data for two variables, x and y. x 9 32 18 15 26 y 11 20 22 17 22 (a) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (Round b0 to two decimal places and b1 to three decimal places.) ŷ = Comment on the adequacy of this equation for predicting y. (Use α = 0.05.) The high p-value and low coefficient of determination...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT