Question

In: Statistics and Probability

Consider the following data for 15 subjects with two predictors. The dependent variable, MARK, is the...

Consider the following data for 15 subjects with two predictors. The dependent variable, MARK, is the total score for a subject on an examination. The first predictor, COMP, is the score for the subject on a so-called compulsory paper. The other predictor, CERTIF, is the score for the subject on a previous exam.

Student

MARK

COMP

CERTIF

1

476

111

68

2

457

92

46

3

540

90

50

4

551

107

59

5

575

98

50

6

698

150

66

7

545

118

54

8

574

110

51

9

645

117

59

10

556

94

97

11

634

130

57

12

637

118

51

13

390

91

44

14

562

118

61

15

560

109

66

a.Run a stepwise regression on the dataset

b.Does CERTIF add anything to predicting MARK, above and beyond that of COMP?

c. Write out the prediction equation

d. A statistician wishes to know the sample size needed in a multiple regression study. She has four predictors and can tolerate at most a .10 drop-off in predictive power. But she wants this to be the case with .95 probability. From previous related research, the estimated squared population multiple correlation is .62. How many subjects are needed?

Solutions

Expert Solution

a.Run a stepwise regression on the dataset

Load the data into Excel.

Go to Data>Megastat.

Select the option Correlation/Regression and go to Regression.

Select COMP and CERTIF as the independent variable(s), x.

Select MARK as the dependent variable, y.

Click OK.

The output will be as follows:

0.583
Adjusted R² 0.514 n   15
R   0.764 k   2
Std. Error   54.463 Dep. Var. MARK
ANOVA table
Source SS   df   MS F p-value
Regression 49,791.1563 2   24,895.5782 8.39 .0052
Residual 35,594.8437 12   2,966.2370
Total 85,386.0000 14  
Regression output confidence interval
variables coefficients std. error    t (df=12) p-value 95% lower 95% upper
Intercept 124.0641
COMP 3.5120 0.8975 3.913 .0021 1.5566 5.4674
CERTIF 0.8346 1.1346 0.736 .4761 -1.6375 3.3068

b.Does CERTIF add anything to predicting MARK, above and beyond that of COMP?

CERTIF add a value 0.8345 to predict MARK, which is beyond that of COMP.

c. Write out the prediction equation.

The prediction equation is:

MARK = 124.0641 + 3.5120*COMP + 0.8346*CERTIF

Or

The total score for a subject on an examination = 124.0641 + 3.5120*Score for the subject on a so-called compulsory paper + 0.8346*Score for the subject on a previous exam

d. A statistician wishes to know the sample size needed in a multiple regression study. She has four predictors and can tolerate at most a .10 drop-off in predictive power. But she wants this to be the case with .95 probability. From previous related research, the estimated squared population multiple correlation is .62. How many subjects are needed?

The formula for calculating sample size is:

n = p(1 - p)(z/E)2

We are given:

p = 0.62 and E = 0.1

The critical value at the 0.05 significance level is 1.96.

Subject needed = 0.62(1 - 0.62)(1.96/0.1)2

= 0.2356*(19.6)2

= 0.2356*384.16

= 90.5 = 91

Therefore, we need atleast 91 subjects.


Related Solutions

A statistical program is recommended. Consider the following data for a dependent variable y and two...
A statistical program is recommended. Consider the following data for a dependent variable y and two independent variables, x1 and x2. x1 x2 y 30 12 94 47 10 108 25 17 112 51 16 178 40 5 94 51 19 175 74 7 170 36 12 117 59 13 142 76 16 211 (a) Develop an estimated regression equation relating y to x1. (Round your numerical values to one decimal place.)ŷ =   Predict y if x1 = 43. (Round...
Consider the following data for a dependent variable and two independent variables, and . x1= 30...
Consider the following data for a dependent variable and two independent variables, and . x1= 30 46 24 50 41 51 74 36 60 77. x2 = 13 11 17 17 6 20 8 12 13 16. y = 95 109 112 178 94 176 170 117 142 212 Round your all answers to two decimal places. Enter negative values as negative numbers, if necessary. a. Develop an estimated regression equation relating to y to x1 . Predict y if...
Consider the following data for a dependent variable y and two independent variables, x2 and x1....
Consider the following data for a dependent variable y and two independent variables, x2 and x1. x1 x2 y 30 12 94 46 10 109 24 17 113 50 17 179 41 5 94 51 19 175 75 8 171 36 12 118 59 13 143 77 17 212 Round your all answers to two decimal places. Enter negative values as negative numbers, if necessary. a. Develop an estimated regression equation relating y to x1 . Predict y if x1=45....
Consider the following data for a dependent variable y and two independent variables, x1 and x2.
You may need to use the appropriate technology to answer this question. Consider the following data for a dependent variable y and two independent variables, x1 and x2. x1 x2 y 30 12 93 47 10 108 25 17 112 51 16 178 40 5 94 51 19 175 74 7 170 36 12 117 59 13 142 76 16 211 The estimated regression equation for these data is ŷ = −18.89 + 2.02x1 + 4.74x2. Here, SST = 15,276.0,...
Consider the following data for a dependent variable y and two independent variables, x1 and x2....
Consider the following data for a dependent variable y and two independent variables, x1 and x2. x1 x2 y 30 12 94 47 10 108 25 17 112 51 16 178 40 5 94 51 19 175 74 7 170 36 12 117 59 13 142 76 16 209 The estimated regression equation for these data is ŷ = −17.02 + 1.99x1 + 4.70x2. Here, SST = 14,902.9, SSR = 13,773.1, sb1 = 0.2470, and sb2 = 0.9480. (1a) Test...
Consider the following data for a dependent variable y and two independent variables, x1 and x2....
Consider the following data for a dependent variable y and two independent variables, x1 and x2. x1 x2 y 30 13 95 46 10 108 25 18 113 50 16 179 40 5 95 51 20 176 74 7 170 36 12 117 59 13 142 77 16 211 Round your all answers to two decimal places. Enter negative values as negative numbers, if necessary. a. Develop an estimated regression equation relating y to x1. ŷ =_________ +___________ x1 Predict...
Consider the following data for a dependent variable y and two independent variables, x1 and x2....
Consider the following data for a dependent variable y and two independent variables, x1 and x2. x1 x2 y 30 12 95 47 10 108 25 17 112 51 16 178 40 5 94 51 19 175 74 7 170 36 12 117 59 13 142 76 16 212 The estimated regression equation for these data is ŷ = −18.52 + 2.01x1 + 4.75x2. Here, SST = 15,234.1, SSR = 14,109.8, sb1 = 0.2464, and sb2 = 0.9457. (a)Test for...
Consider the following data for a dependent variable y and two independent variables, x1and x2. x...
Consider the following data for a dependent variable y and two independent variables, x1and x2. x 1 x 2 y 29 13 94 47 10 109 24 17 113 50 16 178 40 6 95 52 20 176 75 7 171 37 13 118 59 14 142 77 17 211 Round your all answers to two decimal places. Enter negative values as negative numbers, if necessary. a. Develop an estimated regression equation relating y to x1. ŷ =  +  x1 Predict y...
Consider the following regression results of B for time series data. The dependent variable is the...
Consider the following regression results of B for time series data. The dependent variable is the log of real consumption and the regressors are all lagged by one period. The coefficients are estimated by OLS method. *, **, and *** indicate that the coefficients are significant at the 10%, 5%, and 1% level, respectively. The F- test in the last row tests the joint hypothesis that all the coefficients except the constant are zero and their p-values are provided in...
15. In an experiment, an independent variable is _______ and a dependent variable is _______. Group...
15. In an experiment, an independent variable is _______ and a dependent variable is _______. Group of answer choices Manipulated, measured Measured, manipulated Discrete, summation Continuous, manipulated 16. Outliers are Group of answer choices The lowest and highest scores in a data set Extreme or unusual values All options present The lowest value in a data set 17. Assume that we have the following set of data:     Score                 11, 12, 17, 18, 19, 20, 21, 22, 23, 24,...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT