Question

In: Statistics and Probability

The standard project is to use multiple regression analysis to analyze a data set. The data...

The standard project is to use multiple regression analysis to analyze a data set. The data set is a study of student persistent enrolling in the next semester based on Gender, Age, GPA, a 22 questionnaire on self-efficacy, and student enrollment status.

The educational researcher wants to study the relationship between student enrollment status as it relates to gender, age, GPA, and the total response to a 22 questionnaire survey.

a. The estimated multiple regression analysis equation. b. Does the model work? Research this question, use the Significance F value and compare it using p-value. c. How well does the model work? Research this question using p-values and R Square. d. Which variables contribute to the model? Research this question using p-values. e. General interpretation of the data and the data analysis.

Regression Statistics
Multiple R 0.422451381
R Square 0.178465169
Adjusted R Square 0.148037953
Standard Error 0.431974891
Observations 113
ANOVA
df SS MS F Significance F
Regression 4 4.377924324 1.094481081 5.86531378 0.000260992
Residual 108 20.15304913 0.186602307
Total 112 24.53097345
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 0.327601267 0.33188857 0.9870821 0.325808653 -0.330259456 0.985461991 -0.330259456 0.985461991
Gender -0.30076921 0.083982457 -3.581333792 0.000513969 -0.467237009 -0.134301411 -0.467237009 -0.134301411
Age 0.006147102 0.09460487 0.064976589 0.94831276 -0.181376164 0.193670367 -0.181376164 0.193670367
GPA 0.165452103 0.054477398 3.037077944 0.002995123 0.05746845 0.273435756 0.05746845 0.273435756
Total Q 0.000365667 0.003539872 0.103299547 0.917916809 -0.006650973 0.007382307 -0.006650973 0.007382307

Solutions

Expert Solution

Part a )
The regression equation is :

Enrollment status = 0.3276-0.3008*Gender+0.006147*Age+0.1655*Gpa+0.000366*TotalQ

The coefficient values are obtained from the following part of the output:

.

part b)

The model got F value = 5.8653 and a P value = 0.000261

Since the P value 0.000261 < alpha 0.05 , this implies that the model has a significant effect in predicting the value of enrollment status

.

Part c)

The P value obtained suggest that the model is significant in predicting the values of the response variable

But the value of r square = 0.1785 ( rounded to 4 decimal values) is very small

A r square value 0.1785*100 = 17.85% implies that the model is able to explain on 17.85% of the variation in the value of the response variable

Hence in true sense though the model is statistically significant , it doesnot actually help is explaining the variation in the response variable student enrollment status

.

Part d)

The variables with P value < 0.05 contribute to the model significantly

Hence variables: Gender and GPA have got the respective P values < 0.05., hence they have significant influence on the model

The variables: Age and Total Q have respectively a larger p value > 0.05 , hence they so not have significant influence on the model

.

Part e)

Total 113 observations are taken to predict the relation of response variable enrollment status based on the independent variables, age, gender , GPA and Total Q. The P value obtained for model is 0.000261 with F value 5.8653 thus indicating that the model is significant. If we observe the variables individually, once can observe that P values for Gender and GPA are less than 0.05 thus indicating that these variables have significant influence on the model. The R square value of the model is : 17.85% which implies only 17.85% of the variation in the enrollment status is explained by this model.


Related Solutions

Research project Use econometrics analysis (analyze data by regression) to write a research on, "Do banks...
Research project Use econometrics analysis (analyze data by regression) to write a research on, "Do banks discriminate minorities when lending?"
Refer to the TV Revenue data set. Perform a complete multiple regression analysis that might be...
Refer to the TV Revenue data set. Perform a complete multiple regression analysis that might be used to predict net revenue using all provided explanatory variables (there are 4 explanatory variables). Complete all steps for the multiple regression as outlined in class and modify the original model if necessary. Use an alpha = .10 for all hypotheses tests. Make sure you show each required step for any hypothesis test. Provide all required Minitab output with your written responses. Obs NetRevenue...
13. Linear regression analysis was performed for a standard addition data set based on this experiment....
13. Linear regression analysis was performed for a standard addition data set based on this experiment. The analysis yielded the following results: m = 0.006110, b = 0.008170. What is the unknown's riboflavin concentration (ppm) in the standard addition solutions? a) 2.0 b) 13.37 c) 1.337 d) 1.67 15. The riboflavin concentration in the unknown powder solution is 58.0 ppm. The total volume of the powder solution is 250.00 mL. What is the concentration in the tablet (mg ribo/tablet) if...
i. Use MS Excel Data Analysis ToolPak to perform a multiple regression analysis using Quality as...
i. Use MS Excel Data Analysis ToolPak to perform a multiple regression analysis using Quality as the response variable and Helpfulness and Clarity as the explanatory variables. Write down the corresponding coefficient estimates and provide the regression output. j. Perform an F-test for the overall usefulness of the model in part i) using a 5% significance level. Make sure you follow all the steps for hypothesis testing indicated in the Instructions section and clearly state your conclusion. k. Test manually...
a) Run a regression analysis on the following bivariate set of data with y as the...
a) Run a regression analysis on the following bivariate set of data with y as the response variable. x y 10.7 81.6 13.7 81.5 36.7 56.5 4 72.1 50.7 23.2 47.6 -4.8 37.3 31.9 24.3 75.2 21.5 59.3 17.2 54.6 23.6 75.5 22.2 60.8 29.3 51 14 63.4 0.2 102.7 30.7 48.2 10.3 74.8 26.5 48.2 23.1 87 Verify that the correlation is significant at an ?=0.05?=0.05. If the correlation is indeed significant, predict what value (on average) for the...
Use multiple regression with dummies, since the data is seasonal for the regression model. Year Sales...
Use multiple regression with dummies, since the data is seasonal for the regression model. Year Sales (Millions) Trend 2014 1 480.0 1 2014 Q2 864.0 2 2014 Q3 942.0 3 2014 Q4 1,100.0 4 2015 Q1 1,200.0 5 2015 Q2 1,900.0 6 2015 Q3 1,900.0 7 2015 Q4 1,300.0 8 2016 Q1 1,200.0 9 2016 Q2 1,500.0 10 2016 Q3 1,200.0 11 2016 Q4 500.0 12 2017 Q1 356.0 13 2017 Q2 1,300.0 14 2017 Q3 1,000.0 15 2017 Q4...
(20 pts) Use the “Distance.sav” (SPSS) data set (located below) to perform a linear regression analysis....
(20 pts) Use the “Distance.sav” (SPSS) data set (located below) to perform a linear regression analysis. This dataset shows how far on average a person in Illinois drives each year. Write your findings using the format presented in the class slides. (2 pts) How much of the variation in the dependent variable is explained by the variation in the independent variable? What statistic did you use? (2 pts) Is the linear model significantly different than zero? Why or why not?...
Use the SAT data and create a multiple regression table but this time as input use...
Use the SAT data and create a multiple regression table but this time as input use ONLY two variables: Letters and SAT. Answer questions 1 to 4. Choose the best fitting answer. Note: numbers are truncated unless specified. 1. If an incoming student has Letters = 7 and SAT = 1000 what would his predicted College GPA be? a. 1.99 b. 2.01 c. ​​2.18 d. 2.39 2. What is the approximate error of this prediction? a. 0.58 b. ​​0.61 c....
Use the SAT data and create a multiple regression table but this time as input use...
Use the SAT data and create a multiple regression table but this time as input use ONLY two variables: Letters and SAT. Answer questions 1 to 4. Choose the best fitting answer. Note: numbers are truncated unless specified. C ollege GPA HighSchl GPA SAT Letters 2.04 2.01 1070 5 2.56 3.4 1254 6 3.75 3.68 1466 6 1.1 1.54 706 4 3 3.32 1160 5 0.05 0.33 756 3 1.38 0.36 1058 2 1.5 1.97 1008 7 1.38 2.03 1104...
Based on the below data what will be the value of standard error? Regression Statistics Multiple...
Based on the below data what will be the value of standard error? Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 8 ANOVA df SS MS F Regression 1 33 33.0 16.5 Residual 6 12 2.0 Total 7 Coefficients Standard Error t Stat P-value Intercept 9 31.274666 3.984284 0.007248 Advertising (thousands of $) 24 6.19330674 1.610802 0.158349 Submit Answer format: Number: Round to: 2 decimal places.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT