Question

In: Math

            Develop a simple linear regression model to predict the price of a house based upon...

  1.             Develop a simple linear regression model to predict the price of a house based upon the living area (square feet) using a 95% level of confidence.
  1.             Write the reqression equation
  2.             Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence.
  3.              Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence.
  4.             Interpret the coefficient for the independent variable.
  5.             What percentage of the observed variation in housing prices is explained by the model?
  6.              Predict the value of a house with 3,000 square feet of living area.
  1.             Develop a simple linear regression model to predict the price of a house based upon the number of bedrooms using a 95% level of confidence.
  1.             Write the reqression equation
  2.             Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence.
  3.              Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence.
  4.             Interpret the coefficient for the independent variable.
  5.             What percentage of the observed variation in housing prices is explained by the model?
  6.              Predict the value of a house with 3 bedrooms.
  1.             Develop a simple linear regression model to predict the price of a house based upon the number of bathrooms using a 95% level of confidence.
  1.             Write the reqression equation
  2.             Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence.
  3.              Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence.
  4.             Interpret the coefficient for the independent variable.
  5.             What percentage of the observed variation in housing prices is explained by the model?
  6.              Predict the value of a house with 2.5 bathrooms.
  1.             Develop a simple linear regression model to predict the price of a house based upon its age using a 95% level of confidence.
  1.             Write the reqression equation
  2.             Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence.
  3.              Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence.
  4.             Interpret the coefficient for the independent variable.
  5.             What percentage of the observed variation in housing prices is explained by the model?
  6.              Predict the value of a house that is 50 years old.
  1.             Compare the preceding four simple linear regression models to determine which model is the preferred model. Use the Significance F values, p-values for independent variable coefficients, R-squared or Adjusted R-squared values (as appropriate), and standard errors to explain your selection.
  2.             Calculate the predicted sale price of a 50 year old house with 3,000 square feet of living area, 3 bedrooms, and 2.5 bathrooms using your preferred regression model from part 5.

Prepare a single Microsoft Excel file, using a separate worksheet for each regression model, to document your regression analyses. Prepare a single Microsoft Word document that outlines your responses for each portions of the case study.

Selling Price        Age (Years)         Living Area (Sq Feet)       No. Bathrooms No Bedrooms

$92,000                 18           1,527     2              4

$211,002              0              2,195     2.5          4

$115,000              14           1,480     1.5          3

$113,000              53           1,452     2              3

$216,300              0              2,360     2.5          4

$145,000              32           1,440     1              3

$114,000              14           1,480     2.5          2

$139,050              125         1,879     2.5          3

$104,000              14           1,480     1.5          3

$169,900              11           1,792     2.5          3

$177,900              2              1,386     2.5          3

$133,000              14           1,676     2              2

$185,000              0              768         2              4

$115,000              16           1,560     1.5          3

$100,000              91           1,000     1              3

$117,000              15           1,676     1.5          4

$150,000              11           1,656     1.5          3

$187,500              11           2,300     1.5          3

$107,000              25           1,712     1              3

$126,900              26           1,350     1.5          3

$147,000              15           1,676     2.5          3

$62,000                 103         1,317     1.5          3

$101,000              30           1,056     2              3

$143,500              13           912         1              3

$113,400              18           1,232     2              2

$112,000              36           1,280     1              3

$112,500              43           1,232     1              3

$97,000                 45           1,406     1.5          3

$121,000              6              1,164     2              3

$65,720                 123         1,198     1              3

$225,000              10           2,206     2.5          4

Solutions

Expert Solution

Develop a simple linear regression model to predict the price of a house based upon the living area (square feet) using a 95% level of confidence

Dependent variable = price
Independent variable = area
1. Put the values in excel as shown below.


2. We use the regression option under the Data analysis tab.


3. Input the data as shown below.


4. The output will be generated as follows


5. We formulate the regression equation using the output (highlighted in yellow).

Write the reqression equation
Price = 39438.54 + 61.8185 Area

Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence.

For this we look that pvalue of the Anova. The Pvalue of anova (highlighted in green)
Since the pvalue is less than 0.05, the model is significant.


Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence.
For this, we look that pvalue of the coefficient . The Pvalue of the coefficient (highlighted in orange)
Since the pvalue is less than 0.05, hence the variable is significant in predicting the dependent variable.

Interpret the coefficient for the independent variable.
One unit increase in the area, increases the price by 61.81 dollars

What percentage of the observed variation in housing prices is explained by the model?
Variation explained by the model is given by the Rsquare (highlighted in blue) which 0.3444
Note - Higher the number, better the model is.


Predict the value of a house with 3,000 square feet of living area.
Price = 39438.54 + 61.8185 Area
Price = 39438.54 + 61.8185 (3000) =224894.30

---------------------------------------------------------

Follow the similar steps for the next variable. The screenshot are provided in case you get stuck.

Develop a simple linear regression model to predict the price of a house based upon the number of bedrooms using a 95% level of confidence.

Dependent variable = price
Independent variable = bedrooms

Write the reqression equation
Price = 30701.86 + 32988.68 Bedroom


Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence.

For this we look that pvalue of the Anova. The Pvalue of anova (highlighted in green)
Since the pvalue is less than 0.05, the model is significant.


Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence.
For this we look that pvalue of the coefficient . The Pvalue of the coefficient (highlighted in orange)
Since the pvalue is less than 0.05, hence the variable is significant in predicting the dependent variable.

Interpret the coefficient for the independent variable.
One unit increase in the bedroom increases the price by 32988.68 dollars

What percentage of the observed variation in housing prices is explained by the model?
Variation explained by the model is given by the Rsquare (highlighted in blue) which 0.1873
Note - Higher the number, better the model is.


Predict the value of a house with 3 bedrooms.
Price = 30701.86 + 32988.68 Bedroom
Price = 30701.86 + 32988.68 (3)=129667.93

--------------------------
Develop a simple linear regression model to predict the price of a house based upon the number of bathrooms using a 95% level of confidence.

Dependent variable = price
Independent variable = bathroom

Write the reqression equation
Price = 63247.78 + 39596.15 bathroom


Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence.

For this we look that pvalue of the Anova. The Pvalue of anova (highlighted in green)
Since the pvalue is less than 0.05, the model is significant.


Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence.
For this we look that pvalue of the coefficient . The Pvalue of the coefficient (highlighted in orange)
Since the pvalue is less than 0.05, hence the variable is significant in predicting the dependent variable.

Interpret the coefficient for the independent variable.
One unit increase in the bathroom increases the price by 39596.157 dollars

What percentage of the observed variation in housing prices is explained by the model?
Variation explained by the model is given by the Rsquare (highlighted in blue) which 0.2924
Note - Higher the number, better the model is.


Predict the value of a house with 2.5 bathrooms.
Price = 63247.78 + 39596.15 bathroom
Price = 63247.78 + 39596.15 (2.5)=162238.18

-------------------------

Develop a simple linear regression model to predict the price of a house based upon its age using a 95% level of confidence.

Dependent variable = price
Independent variable = age

Write the reqression equation
Price = 152772.22 -660.88 age


Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence.

For this we look that pvalue of the Anova. The Pvalue of anova (highlighted in green)
Since the pvalue is less than 0.05, the model is significant.


Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence.
For this we look that pvalue of the coefficient . The Pvalue of the coefficient (highlighted in orange)
Since the pvalue is less than 0.05, hence the variable is significant in predicting the dependent variable.

Interpret the coefficient for the independent variable.
One  unit increase in the age decreases the price by 660.8855
dollars

What percentage of the observed variation in housing prices is explained by the model?
Variation explained by the model is given by the Rsquare (highlighted in blue) which 0.3066
Note - Higher the number, better the model is.


Predict the value of a house that is 50 years old.
Price = 152772.22 -660.88 age
Price = 152772.22 -660.88 (50)= 119727.95

Compare the preceding four simple linear regression models to determine which model is the preferred model. Use the Significance F values, p-values for independent variable coefficients, R-squared or Adjusted R-squared values (as appropriate), and standard errors to explain your selection.
Calculate the predicted sale price of a 50 year old house with 3,000 square feet of living area, 3 bedrooms, and 2.5 bathrooms using your preferred regression model from part 5.

This is a summary of all the model. We see that all the model are significant and the variable are significant predictors of the dependent variable.


However we see that the Rsquare for model with Area is higher compared to other. This indicates that the model is able to explain a higher percentage of the variability of y.

Hence I would prefer the model with Area.

Price = 39438.54 + 61.8185 Area
Price = 39438.54 + 61.8185 (3000) =224894.30


Related Solutions

. Develop a simple linear regression model to predict a person’s income (INCOME) based upon their...
. Develop a simple linear regression model to predict a person’s income (INCOME) based upon their years of education (EDUC) using a 95% level of confidence. a. Write the reqression equation. b. Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence. c. Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence. d. Interpret the coefficient...
Develop a simple linear regression model to predict the Cost of Living Index based upon Restaurant...
Develop a simple linear regression model to predict the Cost of Living Index based upon Restaurant Price Index using a 95% level of confidence. Write the reqression equation. Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence. Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence. Interpret the coefficient for the independent variable. What percentage...
1.Develop a multiple linear regression model to predict the price of a house using the square...
1.Develop a multiple linear regression model to predict the price of a house using the square feet of living area, number of bedrooms, and number of bathrooms as the predictor variables     Write the reqression equation.      Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence. Discuss the statistical significance of the coefficient for each independent variable using the appropriate regression statistics at a 95% level of confidence....
Develop a simple linear regression model to predict a person’s income (INCOME) based on their age...
Develop a simple linear regression model to predict a person’s income (INCOME) based on their age (AGE) using a 95% level of confidence. a. Write the regression equation. Discuss the statistical significance of the model as whole using the appropriate regression statistic at a 95% level of confidence. Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence. Interpret the coefficient for the independent variable. What percentage of...
Question 4 A simple linear regression model was used in order to predict y, duration of...
Question 4 A simple linear regression model was used in order to predict y, duration of relief from allergy, from x, dosage of medication. A total of n=10 subjects were given varying doses, and their recovery times noted. Here is the R output. summary(lmod4) ## ## Call: ## lm(formula = y ~ x) ## ## Residuals: ##     Min      1Q Median      3Q     Max ## -3.6180 -1.9901 -0.4798 2.2048 3.7385 ## ## Coefficients: ##             Estimate Std. Error t value Pr(>|t|)    ## (Intercept)...
A simple linear regression model based on 26 observations. The F-stat for the model is 6.45...
A simple linear regression model based on 26 observations. The F-stat for the model is 6.45 and the standard error for the coefficient of X is 0.2. MSR = 54.75 Complete an ANOVA table. Find the t-stat and the coefficient of X. Find R2.
I have conducted a linear regression model to predict student scores on an exam based on...
I have conducted a linear regression model to predict student scores on an exam based on the number of hours they studied. I get a coefficient (slope) of +2.5 for the variable of hours studied. The pvalue for this coefficient is 0.45 and the 95% confidence interval is [-2.5, +7]. Which of the following conclusions CANNOT be drawn from these results? At an alpha of 0.05, we can say that the effect of hours studied on exam score is significant...
1. In the iris data, build a linear regression model to predict Sepal.Length based on both...
1. In the iris data, build a linear regression model to predict Sepal.Length based on both Petal.Length and Species. a. Calculate the regression equation, including the interaction. b. From this equation, you should be able to find 3 regression lines (one for each Species). Interpret each of the 3 slopes of the lines in the context of the problem. Remember that both numerical variables are measured in centimeters. c. Plot the 3 regression lines in a scatterplot of Sepal.Length vs....
Using the Instructor ranking data conduct a simple linear regression to predict a student’s scores based...
Using the Instructor ranking data conduct a simple linear regression to predict a student’s scores based on the number of hours the student studies and answer the following questions: What is the value of the intercept of this regression equation? What is the value of the slope of this regression equation and what is its interpretation within the context of this problem? Write the equation for this logistic regression model R2 =    What does this number mean? F=     Explain how F (F...
When we estimate a linear multiple regression model (including a linear simple regression model), it appears...
When we estimate a linear multiple regression model (including a linear simple regression model), it appears that the calculation of the coefficient of determination, R2, for this model can be accomplished by using the squared sample correlation coefficient between the original values and the predicted values of the dependent variable of this model. Is this statement true? If yes, why? If not, why not? Please use either matrix algebra or algebra to support your reasoning.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT