Question

In: Statistics and Probability

Problem 1 The following table lists data on the participation scores and midterm exam scores for...

Problem 1

The following table lists data on the participation scores and midterm exam scores for 23 students who took the Accounting Theory class in Spring 2019.

Student Number

Participation Score

Midterm Score

1

5

75

2

6.5

84

3

6.5

73

4

7

96

5

7.5

83

6

7.5

88

7

7.5

75

8

8

75

9

8.5

82.5

10

8.5

89

11

8.5

90

12

8.5

91

13

9

92

14

9

81.5

15

9

95

16

9

88

17

9

89.5

18

9

93

19

9.5

90

20

10

99

21

10

87.5

22

10

88

23

10

81

  1. Using the example above as a guide, construct a regression model that captures the relationship between participation scores and midterm scores.
  2. Using the coefficients from your model, calculate the expected midterm scores for students whose participation scores are 5, 7 and 8 respectively.
  3. Using the expected scores that you computed in (2) above, calculate the unexpected scores for student numbers 1, 4 and 8.
  4. Give three reasons why the expected midterm scores for these students does not equal their actual midterm scores.

Solutions

Expert Solution

Solution

Part (1)

NOTE: Final answers are given below. Back-up Theory and Details of calculations follow at the end.

Regression model:

Mid-term score (y) = 62.6070 + (2.8292 x Participation score) (x) Answer 1

Part (2)

Substituting x = 5, 7 and 8 in the above regression equation we get:

Expected midterm scores for students whose participation scores are 5 is: 76.75 Answer 2

Expected midterm scores for students whose participation scores are 7 is: 82.41 Answer 3

Expected midterm scores for students whose participation scores are 8 is: 85.24 Answer 4 Part (3)

Since participation scores for student numbers 1, 4 and 8 are 5, 7 and 8 respectively,

Expected mid-term scores for student numbers 1, 4 and 8 are respectively:

76.75, 82.41 and 85.24 Answer 5

Part (4)

Comparison

Student #

Expected score (E)

Actual score (A)

Difference E – A

1

76.75

75

1.75

4

82.41

96.

- 13.59

8

85.24

75

10.24

In two cases, E > A and in one case A > E.

The difference is quite significant for students 4 and 8.

Explanations for the difference

1. The model assumed (simple linear) is not adequate to predict. This is quite possible since the correlation coefficient is only 0.51.

2. Assumption made for the linear model such as error terms are iid N(0, σ2), independence of x with error term, etc are not fulfilled.

3. Data recording is not reliable.

Answer 6

Back-up Theory and Details of calculations

The linear regression model: Y = β0 + β1X + ε, ……………………..............................................................…………..(1)

where ε is the error term, which is assumed to be Normally distributed with mean 0 and variance σ2.

Estimated Regression of Y on X is given by: Yhat = β0hat + β1hatX, …………......................................................…….(2)

β1cap = Sxy/Sxx and β0cap = Ybar – β1cap.Xbar.............………………...............................................................….…..(3)

where

[Mean X = Xbar = (1/n) Σ(i = 1 to n)xi ; Mean Y = Ybar = (1/n) Σ(i = 1 to n)yi Sxx = Σ(i = 1 to n)(xi – Xbar)2

Syy = Σ(i = 1 to n)(yi – Ybar)2 ; Sxy = Σ(i = 1 to n){(xi – Xbar)(yi – Ybar)} ] ................................................................ (4)

n

23

Xbar

8.3913

ybar

86.3478

Sxx

36.9783

Syy

1127.2174

Sxy

104.6196

β1cap

2.8292

β0cap

62.6070

r

0.5124

DONE


Related Solutions

An investigator collected data on midterm exam scores and final exam scores of elementary school students;...
An investigator collected data on midterm exam scores and final exam scores of elementary school students; results can summarized as follows. Average SD -------------------------------------------------- Boys' midterm score 70 20 Boys' final score 65 23 girls' midterm score 75 20 girls' final score 80 23 The correlation coefficient between midterm score and final score for the boys was about 0.70; for the girls, it was about the same. If you take the boys and the girls together, the correlation between midterm...
1. The following lists midterm and final exam grades for randomly selected students: Midterm (x) 82...
1. The following lists midterm and final exam grades for randomly selected students: Midterm (x) 82 65 93 70 85 50 95 80 Final (y) 94 77 94 79 95 70 97 91 a). Plot the scatter diagram. b). find the value of the linear correlation coefficient r. c). find the equation of the regression line that best fits the data. d). If you received a 76 on the midterm, what grade could you expect on the final? e). If...
Below is a data set for the scores on a quantum physics midterm exam for five...
Below is a data set for the scores on a quantum physics midterm exam for five students vs. amount of time spent studying for it Time Spent Studying (Hours) Score 0.5 32 1 44 2 63 3.5 79 5 96 Find the equation of the best-fit line. You can do this through Excel or calculating by hand (slope and y-intercept). What does the slope physically represent? What does the y-intercept physically represent? Using your equation, if a student spent 4.3...
Imagine there are five students in a class. Their scores on the midterm exam are: 93,...
Imagine there are five students in a class. Their scores on the midterm exam are: 93, 85, 70, 78, and 64. Calculate the mean, the variance and the standard deviation and interpret the meaning of the variance and standard deviation.
The Math 122 Midterm Exam is coming up. Suppose the exam scores are normally distributed with...
The Math 122 Midterm Exam is coming up. Suppose the exam scores are normally distributed with a population mean of 73.8% and a standard deviation of 11.3%. Let's first create a simulation to observe the expected results for a class of Math 122 students. In Excel, create 25 random samples of 23 students each. This means you should have 23 entries in each column, and you should be using columns A - Y. If you need a refresher for creating...
Exercise 3 The data in the table represent the "Exam Scores" for two random samples of...
Exercise 3 The data in the table represent the "Exam Scores" for two random samples of students. The first group of = 6 students were under active-learning course, and the second group of = 6 students were under traditional lecturing. Note that the standard deviations in the Active group is = 3.43 and in the Traditional group is = 3.03. Active learning Traditional learning 0 7 5 0 7 8 8 2 0 4 3 3 Please answer the following...
Exercise 3 The data in the table represent the "Exam Scores" for two random samples of...
Exercise 3 The data in the table represent the "Exam Scores" for two random samples of students. The first group of n1 = 6 students were under active-learning course, and the second group of n2 = 6 students were under traditional lecturing. Note that the standard deviations in the Active group is s1= 3.43 and in the Traditional group is s2 = 3.03. Active learning Traditional learning 0 7 5 0 7 8 8 2 0 4 3 3 Please...
The following are the midterm exam grades (in %) of a simple random sample of 39...
The following are the midterm exam grades (in %) of a simple random sample of 39 statistical students: 85 64 45 77 53 72 99 59 68 92 48 75 51 93 67 78 89 56 83 71 49 94 63 77 79 88 42 65 92 69 73 56 81 69 61 75 58 67 81 a). make a 75% confidence statement about the mean grade of all statistical students on a similar midterm b). what is the sample...
1 (a). The following data set lists a series of ACT scores {8, 12, 15, 16,...
1 (a). The following data set lists a series of ACT scores {8, 12, 15, 16, 16, 17, 17, 18, 18, 18, 18, 19, 19, 19, 19, 20, 20, 20, 20, 20, 21, 21, 21, 23, 23, 24, 25, 36}. Write down the five number summary and construct a boxplot which includes the position of the upper and lower fences (b). At a tennis tournament a statistician keeps track of every serve. The statistician reported that the mean serve speed...
Problem 1: The following data is in the following of Table. Education Expenditure data, Assuming there...
Problem 1: The following data is in the following of Table. Education Expenditure data, Assuming there is unique variance in each region, using two stage approach weighted least square approach to estimate X1, X2 and X3 effect on Y. (sas programming) STATE Y X1 X2 X3 Region ME 189 2828 351 508 1 NH 169 3259 346 564 1 VT 230 3072 348 322 1 MA 168 3835 335 846 1 RI 180 3549 327 871 1 CT 193 4256...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT