Question

In: Statistics and Probability

Develop a simple linear regression model to predict a person’s income (INCOME) based on their age...

  1. Develop a simple linear regression model to predict a person’s income (INCOME) based on their age (AGE) using a 95% level of confidence.

a. Write the regression equation.

  1. Discuss the statistical significance of the model as whole using the appropriate regression statistic at a 95% level of confidence.
  2. Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence.
  3. Interpret the coefficient for the independent variable.

What percentage of the observed variation in a person’s income is explained by the model?

  1. Predict the value of a person’s income who is 45 years old, using this regression model.
INCOME AGE EARNRS EDUC CHILDS HRS1
500 27 3 12 0 56
500 23 3 12 1 10
500 78 0 16 2 0
500 64 0 17 0 0
500 54 1 14 3 0
500 22 2 13 1 0
500 21 1 12 2 24
500 53 2 14 4 0
500 57 1 16 0 4
500 22 2 12 0 0
500 21 2 14 0 0
500 23 2 12 0 57
500 45 2 14 0 9
500 21 1 14 0 0
500 18 2 12 0 35
500 38 3 15 3 20
500 43 2 12 1 50
500 71 0 12 2 12

Solutions

Expert Solution

a) Regression Line -

Income = 500 - 0.00000 * Age

Output from R -

lm(formula = Income ~ Age)

Coefficients:
(Intercept) Age
5.000e+02 -8.709e-16

b) Following is the model significance

F-value = 17.34

p-value = 0.0007 < 0.05

Hence, model is not significant at 95% confidence level.

Call:
lm(formula = Income ~ Age)

Residuals:
Min 1Q Median 3Q Max
-4.504e-14 -4.155e-14 -2.544e-14 -1.172e-14 4.451e-13

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 5.000e+02 6.182e-14 8.088e+15 <2e-16 ***
Age -8.709e-16 1.424e-15 -6.110e-01 0.549
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 1.158e-13 on 16 degrees of freedom
Multiple R-squared: 0.5201, Adjusted R-squared: 0.4901
F-statistic: 17.34 on 1 and 16 DF, p-value: 0.0007319

Warning message:
In summary.lm(Lin_Mod) : essentially perfect fit: summary may be unreliable

c) siginificance of the independent variable

t-value for variable Age = -0.611

p-value = 0.549 > 0.05

Hence, independent variable Age is significant at 95% confidence level.

d) Interpretation of independent variables

average unit change in Age changes income by unit.

e) At Age = 45, income = 500 + 0.00000 * 45 = 500

At age 45, income = 500


Related Solutions

. Develop a simple linear regression model to predict a person’s income (INCOME) based upon their...
. Develop a simple linear regression model to predict a person’s income (INCOME) based upon their years of education (EDUC) using a 95% level of confidence. a. Write the reqression equation. b. Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence. c. Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence. d. Interpret the coefficient...
            Develop a simple linear regression model to predict the price of a house based upon...
            Develop a simple linear regression model to predict the price of a house based upon the living area (square feet) using a 95% level of confidence.             Write the reqression equation             Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence.              Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence.             Interpret the...
Develop a simple linear regression model to predict the Cost of Living Index based upon Restaurant...
Develop a simple linear regression model to predict the Cost of Living Index based upon Restaurant Price Index using a 95% level of confidence. Write the reqression equation. Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence. Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence. Interpret the coefficient for the independent variable. What percentage...
1.Develop a multiple linear regression model to predict the price of a house using the square...
1.Develop a multiple linear regression model to predict the price of a house using the square feet of living area, number of bedrooms, and number of bathrooms as the predictor variables     Write the reqression equation.      Discuss the statistical significance of the model as a whole using the appropriate regression statistic at a 95% level of confidence. Discuss the statistical significance of the coefficient for each independent variable using the appropriate regression statistics at a 95% level of confidence....
Question 4 A simple linear regression model was used in order to predict y, duration of...
Question 4 A simple linear regression model was used in order to predict y, duration of relief from allergy, from x, dosage of medication. A total of n=10 subjects were given varying doses, and their recovery times noted. Here is the R output. summary(lmod4) ## ## Call: ## lm(formula = y ~ x) ## ## Residuals: ##     Min      1Q Median      3Q     Max ## -3.6180 -1.9901 -0.4798 2.2048 3.7385 ## ## Coefficients: ##             Estimate Std. Error t value Pr(>|t|)    ## (Intercept)...
A simple linear regression model based on 26 observations. The F-stat for the model is 6.45...
A simple linear regression model based on 26 observations. The F-stat for the model is 6.45 and the standard error for the coefficient of X is 0.2. MSR = 54.75 Complete an ANOVA table. Find the t-stat and the coefficient of X. Find R2.
I have conducted a linear regression model to predict student scores on an exam based on...
I have conducted a linear regression model to predict student scores on an exam based on the number of hours they studied. I get a coefficient (slope) of +2.5 for the variable of hours studied. The pvalue for this coefficient is 0.45 and the 95% confidence interval is [-2.5, +7]. Which of the following conclusions CANNOT be drawn from these results? At an alpha of 0.05, we can say that the effect of hours studied on exam score is significant...
1. In the iris data, build a linear regression model to predict Sepal.Length based on both...
1. In the iris data, build a linear regression model to predict Sepal.Length based on both Petal.Length and Species. a. Calculate the regression equation, including the interaction. b. From this equation, you should be able to find 3 regression lines (one for each Species). Interpret each of the 3 slopes of the lines in the context of the problem. Remember that both numerical variables are measured in centimeters. c. Plot the 3 regression lines in a scatterplot of Sepal.Length vs....
Using the Instructor ranking data conduct a simple linear regression to predict a student’s scores based...
Using the Instructor ranking data conduct a simple linear regression to predict a student’s scores based on the number of hours the student studies and answer the following questions: What is the value of the intercept of this regression equation? What is the value of the slope of this regression equation and what is its interpretation within the context of this problem? Write the equation for this logistic regression model R2 =    What does this number mean? F=     Explain how F (F...
When we estimate a linear multiple regression model (including a linear simple regression model), it appears...
When we estimate a linear multiple regression model (including a linear simple regression model), it appears that the calculation of the coefficient of determination, R2, for this model can be accomplished by using the squared sample correlation coefficient between the original values and the predicted values of the dependent variable of this model. Is this statement true? If yes, why? If not, why not? Please use either matrix algebra or algebra to support your reasoning.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT