Question

In: Statistics and Probability

A multiple regression model is to be constructed to predict the final exam score of a...

A multiple regression model is to be constructed to predict the final exam score of a university student doing a particular course based upon their mid-term exam score, the average number of hours spent studying per week and the average number of hours spent watching television per week.

Data has been collected on 30 randomly selected individuals: hide data

Download the data

Final score Mid-term Score Hours studying
per week
Hours watching TV
per week
76 85 19 34
60 85 3 11
42 63 10 33
32 40 6 12
46 65 6 16
48 72 13 30
30 37 14 33
47 47 9 26
33 26 19 7
60 65 18 8
59 79 13 24
28 29 10 22
24 33 7 9
59 77 5 6
66 93 17 7
51 48 18 29
74 98 4 7
29 23 8 15
31 31 6 9
69 79 18 15
60 73 3 6
62 89 11 31
49 52 19 6
37 44 14 9
63 94 10 26
62 89 7 27
30 31 18 32
42 60 14 17
54 70 4 22
73 97 19 28

a)Find the multiple regression equation using all three explanatory variables. Assume that X1 is mid-term score, X2 is hours studying per week and X3 is hours watching television per week. Give your answers to 3 decimal places.

y^ =  + mid-term score + hours studying + hours watching television

b)At a level of significance of 0.05, the result of the F test for this model is that the null hypothesis isis not rejected.

For parts c) and d), using the data, separately calculate the correlations between the response variable and each of the three explanatory variables.

c)The explanatory variable that is most correlated with final score is:

mid-term score
hours studying per week
hours watching television per week

d)The explanatory variable that is least correlated with final score is:

mid-term score
hours studying per week
hours watching television per week

e)The value of R2 for this model, to 2 decimal places, is equal to

f)The value of se for this model, to 3 decimal places, is equal to

g)Construct a new multiple regression model by removing the variable average hours spent watching television per week. Give your answers to 3 decimal places.

The new regression model equation is:

y^ =  + mid-term score + hours studying

h)In the new model compared to the previous one, the value of R2 (to 2 decimal places) is:

increased
decreased
unchanged

i)In the new model compared to the previous one, the value of se (to 3 decimal places) is:

increased
decreased
unchanged

Solutions

Expert Solution

a. The regression line is

y= 8.205 + 0.617 X1 + 0.501 X2 - 0.141 X3

b. At a level of significance of 0.05, the result of the F test for this model is that the null hypothesis is rejected.

c. Since the correlation between Final scores and Mid-term score is highest(i,e 0.931), Thus The explanatory variable that is most correlated with the final score is mid-term scores.

d. Since the correlation between Final scores and the Hours watching TV per week is lowest(i,e 0.060), Thus The explanatory variable that is least correlated with the final score is Hours watching Tv per week

e. The value of R2 for this model, to 2 decimal places, is 0.90

f. The value of se for this model, to 3 decimal places, is 5.226

g. The required regression line is

y^ = 6.812+ 0.610 mid-term score + 0.435 hours studying

h. In the new model compared to the previous one, the value of R2 = 0.89 and  is decreased

i) In the new model compared to the previous one, the value of se = 5.320 and is increased

Thank You. Please, Give Thumbs Up.


Related Solutions

A multiple regression model is to be constructed to predict the heart rate in beats per...
A multiple regression model is to be constructed to predict the heart rate in beats per minute (bpm) of a person based upon their age, weight and height. Data has been collected on 30 randomly selected individuals: hide data Heart Rate (bpm) Age (yrs) Weight (lb) Height (in) 78 23 245 70 91 44 223 68 79 42 178 67 60 33 200 58 57 25 99 68 59 35 123 64 78 30 204 62 98 56 200 63...
A multiple regression model is to be constructed to model the time spent using the internet...
A multiple regression model is to be constructed to model the time spent using the internet per week among internet users. The explanatory variables are age, hours spent working per week and annual income. Data has been collected on 30 randomly selected individuals: Time using internet (minutes) Age Hours working per week Annual income ('000) 140 56 39 28 257 35 31 79 163 35 35 34 115 33 52 27 182 45 36 37 214 51 57 80 187...
A teacher wants to develop a model to predict a student’s grade on the final exam...
A teacher wants to develop a model to predict a student’s grade on the final exam from the number of hours spent studying for the final exam and the student’s GPA at the university. The data (for 22 students) follows below.       PREDICTOR               COEF           STDEV               P-VALUE       Constant                      -1.30             1.429                0.405       Hours                           .0793            .0759                0.344       GPA                             1.11             .7543                0.202       ANOVA       SOURCE                                SS                     DF                 MS                F                                        Regression                             5.0040         ...
A teacher wants to develop a model to predict a student’s grade on the final exam...
A teacher wants to develop a model to predict a student’s grade on the final exam from the number of hours spent studying for the final exam and the student’s GPA at the university. The data (for 22 students) follows below.        PREDICTOR                 COEF             STDEV                       P-VALUE        Constant                         -1.30               1.429                          0.405        Hours                              .0793              .0759                          0.344        GPA                                1.11               .7543                          0.202        ANOVA        SOURCE                                    SS                               DF                   MS                  F                                                      Regression                                 5.0040                                                                     Error                                            1.1548                    TOTAL     (a) What is the student’s expected grade if she has a 2.7 GPA and she studies 12 hours for this test? (b) Interpret the slope coefficient for the variable...
I have conducted a linear regression model to predict student scores on an exam based on...
I have conducted a linear regression model to predict student scores on an exam based on the number of hours they studied. I get a coefficient (slope) of +2.5 for the variable of hours studied. The pvalue for this coefficient is 0.45 and the 95% confidence interval is [-2.5, +7]. Which of the following conclusions CANNOT be drawn from these results? At an alpha of 0.05, we can say that the effect of hours studied on exam score is significant...
Suppose a bank would like to develop a regression model to predict a? person's credit score...
Suppose a bank would like to develop a regression model to predict a? person's credit score based on his or her? age, weekly?income, highest education level? (high school, bachelor? degree, graduate? degree), and whether or not he or she owns or rents his or her primary residence. The accompanying table provides these data for a random sample of customers. Complete parts a through d below Credit_Score   Income_($)      Age      Education        Residence 592                              1,383   55        Bachelor         Own 702                              1,707   65       ...
Use the following data to develop a multiple regression model to predict from and . Discuss...
Use the following data to develop a multiple regression model to predict from and . Discuss the output, including comments about the overall strength of the model, the significance of the regression coefficients, and other indicators of model fit. y x1 x2 198 29 1.64 214 71 2.81 211 54 2.22 219 73 2.70 184 67 1.57 167 32 1.63 201 47 1.99 204 43 2.14 190 60 2.04 222 32 2.93 197 34 2.15 Appendix A Statistical Tables *(Round...
a. Construct a regression model using all three independent variables. Let y be the final exam...
a. Construct a regression model using all three independent variables. Let y be the final exam scores,x1 be the GPAs, x2 be the number of hours spent studying, and x3be the number of absences during the semester. b.Calculate the multiple coefficient of determination c.Test the significance of the overall regression model using significance=0.10 d.Calculate the adjusted multiple coefficent of determination Score   GPA   Hours   Absences 67   2.53   3.0   0 68   2.25   4.0   3 69   2.60   2.5   1 71   3.11   0.5   0...
a. Construct a regression model using all three independent variables. Let y be the final exam...
a. Construct a regression model using all three independent variables. Let y be the final exam scores,x1 be the GPAs, x2 be the number of hours spent studying, and x3be the number of absences during the semester. b.Calculate the multiple coefficient of determination c.Test the significance of the overall regression model using significance=0.10 d.Calculate the adjusted multiple coefficent of determination Score   GPA   Hours   Absences 67   2.53   3.0   0 68   2.25   4.0   3 69   2.60   2.5   1 71   3.11   0.5   0...
Suppose we wish to build a multiple regression model to predict the cost of rent (dollars)...
Suppose we wish to build a multiple regression model to predict the cost of rent (dollars) in a city based on population (thousands of people), and income (thousands of dollars). Use the alpha level of 0.05. City Monthly Rent ($) 2018 Population (Thousands) 2010 Median Income (Thousands of Dollars) Denver, CO 998 586.158 45.438 Birmingham, AL 711 212.237 301.704 San Diego, CA 1414 1307.402 61.962 Gainesville, FL 741 124.354 28.653 Winston-Salem, NC 750 239.617 41.979 Memphis, TN 819 646.889 36.535...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT