Question

In: Statistics and Probability

Parameter estimates: intercept: 14179.871 3298.484 4.30 0.0003* (estimate Std Error tRatio Prob>t ) InCostAid: 0.7768753 0.254022...

Parameter estimates:

intercept: 14179.871 3298.484 4.30 0.0003* (estimate Std Error tRatio Prob>t )

InCostAid: 0.7768753 0.254022 3.06 0,0058*   (estimate Std Error tRatio Prob>t )

Baruch College has substantially less average debt compared to the other schools with similar in-state costs. Figure 10.13 contains JMP output for the simple linear regression of AveDebt on InCostAid with this case removed.

a) State the least-squares regression line.

b) The University of North Florida is one school in this sample. It has an in-state cost of $11,421 and average debt of $17,617. What is the residual?

c) Construct a 95% confidence interval for the slope. What does this interval tell you about the change in average debt for a $1000 change in the in-state cost?

ans:

a)    y = 0.7768753InCostAid+14179.871

b)   0.7768*11421+14179.871 = 23052.5628

residual = 23052.5628 – 17617 = 5435.562801

c) 0.7768753+- t*0.254022 = 0.7768753+- 2.073873*0.254022 =

  (0.2500659328, 1.303684667)

Baruch College was removed from this analysis because it was deemed an outlier. Let’s investigate its impact on the fit.

A) Refit the model using the entire sample of 25 schools. Create a table that summarizes the model estimates with and without this case.

B) Describe the impact this observation has on the fit of the linear regression model.

C) If you were writing a report for publication, would you include the fit with or without this case? Explain your answer.

Need answer for A, B, C

Solutions

Expert Solution

Ans. Suppose in my 25 observation sample data I want to keep variable like incercept , IncostAid(x) , average debt (y) for my regression model. I want to do a regression analysis between x (incostaid) & y (average debt) on SPSS. Please look at my data first. Suppose Intercept is 14,180 for my case.

Now I want to update the image of caculations of the predicted value of average debt (ybar) & residuals in each data. please look at the data

A). ans.

From the above image you can easily get know that Error = Observed (y) - Predicted (y)

B) ans. Now impact of the observations on the model , to see it we need to understand the entire regression result. Please look at the image below :-

From the above result we easily see that from the first table that Rsquare = 0.44.

So, 44% of the fitted model is good.

From the anova table we can see that Total variation of the average debt is 6657172.5 and among that 2932067.36 are defined by Incostaid & rest are errors.

Our null hypothesis in this regression is H0 : Beta = 0

alternative Ha : Beta 0

By seeing the p-value from co-efficients table of regression result (sig. column) , which is 0.000 , we could decide to reject null that is Beta = 0. Also we can see it from the value of anaova table.

Lastly from the residual statistics table we get the entire descriptive statistics for the residuals.

C)ans. The model Rsquare = 0.44. So, only 44% is of Average Debt is defined by Incostaid. Now I would like to find some other predictor variables , which are caused for Average _Debt. Then run a regression. It would help me to get a high R values for more significant variable.


Related Solutions

1.-Interpret the following regression model Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) -7.819e+05 7.468e+04 -10.470...
1.-Interpret the following regression model Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) -7.819e+05 7.468e+04 -10.470 < 2e-16 *** Lot.Size -5.359e-01 1.163e-01 -4.610 4.67e-06 *** Square.Feet 1.108e+02 1.109e+01 9.986 < 2e-16 *** Num.Baths 2.985e+04 9.650e+03 3.094 0.00204 ** API.2011 1.226e+03 9.034e+01 13.568 < 2e-16 *** dis_coast -7.706e+00 2.550e+00 -3.022 0.00259 ** dis_fwy 1.617e+01 1.232e+01 1.312 0.18995 dis_down 5.364e+00 3.299e+00 1.626 0.10429 I(dis_fwy * dis_down) -4.414e-04 5.143e-04 -0.858 0.39098 Pool 1.044e+05 2.010e+04 5.194 2.59e-07 *** --- Signif. codes: 0 ‘***’ 0.001...
For a regression with log(bweight) as the dependent variable, we have Variable Estimate Standard Error Constant/Intercept...
For a regression with log(bweight) as the dependent variable, we have Variable Estimate Standard Error Constant/Intercept 4.66 0.029 Male 0.03 0.010 Motheduc 0.002 0.002 lfaminc 0.02 0.006 You expect an increase in family income to increase birth weight. Test this hypothesis at the 1% level of significance. Make sure you explicitly include all of the steps.
Question 2: Related-Samples t-Test Paired Samples Statistics Mean N Std. Deviation Std. Error Mean Pair 1...
Question 2: Related-Samples t-Test Paired Samples Statistics Mean N Std. Deviation Std. Error Mean Pair 1 freshman yr science score 51.8500 200 9.90089 .70010 senior yr science score 53.0450 200 12.10145 .85570 Paired Samples Correlations N Correlation Sig. Pair 1 freshman yr science score & senior yr science score 200 .878 .000 Paired Samples Test Paired Differences t df Sig. (2-tailed) Mean Std. Deviation Std. Error Mean 95% Confidence Interval of the Difference Lower Upper Pair 1 freshman yr science...
Question 2: Related-Samples t-Test Paired Samples Statistics Mean N Std. Deviation Std. Error Mean Pair 1...
Question 2: Related-Samples t-Test Paired Samples Statistics Mean N Std. Deviation Std. Error Mean Pair 1 freshman yr science score 51.8500 200 9.90089 .70010 senior yr science score 53.0450 200 12.10145 .85570 Paired Samples Correlations N Correlation Sig. Pair 1 freshman yr science score & senior yr science score 200 .878 .000 Paired Samples Test Paired Differences t df Sig. (2-tailed) Mean Std. Deviation Std. Error Mean 95% Confidence Interval of the Difference Lower Upper Pair 1 freshman yr science...
When I ran a bivariate regression, I got the following table Coefficients: Estimate Std. Error z...
When I ran a bivariate regression, I got the following table Coefficients: Estimate Std. Error z value Pr(>|z|) (Intercept) 10.8681 2.8754 3.780 0.000157 *** ETHWAR -1.0170 0.4524 -2.248 0.024570 * When I ran a multivariate regression, I got the following table Coefficients: Estimate Std. Error z value Pr(>|z|) (Intercept) 10.811 2.987 3.619 0.000296 *** ETHWAR -13.804 4844.876 -0.003 0.997727 CIVTOT 12.730 4844.877 0.003 0.997903 Why did the p-value for ETHWAR change? And why did it change so dramatically?
Model Summary Model R R Square Adjusted R Square Std. Error of the Estimate 1 .941a...
Model Summary Model R R Square Adjusted R Square Std. Error of the Estimate 1 .941a .885 .872 1.00528 a. Predictors: (Constant), SelfControl, NumStrains ANOVAa Model Sum of Squares df Mean Square F Sig. 1 Regression 132.570 2 66.285 65.590 .000b Residual 17.180 17 1.011 Total 149.750 19 a. Dependent Variable: AgeFirstArrest b. Predictors: (Constant), SelfControl, NumStrains Coefficientsa Model Unstandardized Coefficients Standardized Coefficients t Sig. 95.0% Confidence Interval for B Collinearity Statistics B Std. Error Beta Lower Bound Upper Bound...
Differentiate Standard error of regression (Sy) Standard error of slope (Sm) Standard error of y-intercept (Sb)
Differentiate Standard error of regression (Sy) Standard error of slope (Sm) Standard error of y-intercept (Sb)
r studio answer to these questions What is the intercept parameter (2dp) for the regression equation...
r studio answer to these questions What is the intercept parameter (2dp) for the regression equation of height (y) versus age (x) using the pine_growth.csv data? What is the total amount of variation explained by the regression model (SSr) of height (y) versus age (x) using the pine_growth.csv data? What is the residual error sum of squares (SSe) of a regression model of height (y) versus age (x) using the pine_growth.csv data. Use your regression equation fitted to the pine_growth.csv...
Coefficients Standard Error t-Stat    P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept Calories...
Coefficients Standard Error t-Stat    P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept Calories Calories (x) Fat Content (y) Predicted y value Residual 140 7 5.75 1.25 160 8 6.77 1.23 70 2 2.18 -0.18 120 3.5 4.73 -1.23 170 16 7.28 8.72 290 12 13.4 -1.4 210 17 9.32 7.68 190 12 8.3 3.7 210 1.5 9.32 -7.82 130 3.5 5.24 -1.74 150 4.5 6.26 -1.76 160 12 6.77 5.23 150 7 6.26 0.74 140 6 5.75...
true or fulse q1 In a regression model it is given that the estimate of intercept...
true or fulse q1 In a regression model it is given that the estimate of intercept is 5 and the estimate of slope is 4 , then the value of dependent variable y for x = 2 is 14. True False q2 When looking at the waiting line at SEU daam system, we can assume that calling population is limited. True False q3 A coffee machine can serve customers at the rate of 20 per hour . The customers arrive...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT