Question

In: Statistics and Probability

A psychologist conducted an experiment analysing the relationship between student scores in an exam and the...

A psychologist conducted an experiment analysing the relationship between student scores in an exam and the amount of attention they paid in class. The latter was measured using a type of brain monitor. The Psychologist believed that scores would increase by 1 for every two unit increase in attention. The data are listed in the excel spreadsheet.

Estimate a linear regression between the score (Y) and the measure of attention(X).

(a) Write out the equation for Y in the form , but with coefficients. Show the estimated standard errors in parenthesis below the coefficients. What is the R2 of the regression? Calculate a 99 percent confidence interval for β.                                                               [5 pts]

(b) What are the mean and the estimated standard deviation of the estimated residuals? [2 pts]

Hint: the first answer is definitional and the second answer is easily seen from the output.

(c )Test the hypothesis that there is no relationship between the variables at the 90 percent significance level.                                                                                                    [3 pts]

(d) Test the hypothesis that the coefficient β=0.5 at the 99% significance level.               [3 pts]

(e) The Psychologist concluded from the experiment that test scores increase significantly if students pay attention in class. In one word, how would you describe the results of this experiment based on the data you have?                                                                                                       [2 pts]

DATA:

Regression data for Psychology Experiment
Attention Score
18 80
35 90
86 80
22 50
72 76
102 74
86 75
30 80
35 85
94 82
16 80
42 41
50 50
96 96
60 80
106 70
80 65
14 14
11 14
80 85
12 14
37 43
26 80
86 70
5 20
17 20
35 80
76 68
50 70
15 16
90 86
96 80
7 16
10 14
35 65
88 88
20 32
22 70
50 65
22 62
35 50
64 92
68 84
13 15
102 102
86 85
18 24
78 64
98 78
70 80
60 70
98 98
9 14
50 90
104 72
35 45
60 60
74 72
88 88
80 95
22 58
8 14
86 110
60 75
92 84
60 100
80 75
86 95
16 18
86 90
35 75
35 60
80 60
80 70
104 104
80 100
60 90
86 100
62 96
60 65
39 41
50 80
50 75
6 18
60 95
22 54
21 40
100 100
94 94
80 90
48 41
106 106
50 43
46 41
90 90
60 85
92 92
22 80
35 70
66 88
80 60
50 60
80 80
100 76
50 45
86 65
19 28
50 85
22 75
86 105

Solutions

Expert Solution

a)

ΣX ΣY Σ(x-x̅)² Σ(y-ȳ)² Σ(x-x̅)(y-ȳ)
total sum 6262 7445 100985.4182 74369.9 63083.45
mean 56.93 67.68 SSxx SSyy SSxy

sample size ,   n =   110          
here, x̅ = Σx / n=   56.93   ,     ȳ = Σy/n =   67.68  
                  
SSxx =    Σ(x-x̅)² =    100985.4182          
SSxy=   Σ(x-x̅)(y-ȳ) =   63083.5          
                  
estimated slope , ß1 = SSxy/SSxx =   63083.5   /   100985.418   =   0.6247
                  
intercept,   ß0 = y̅-ß1* x̄ =   32.1206          
                  
so, regression line is   Ŷ =   32.1206   +   0.6247   *x
(3.65) (0.057)
SSE=   (SSxx * SSyy - SS²xy)/SSxx =    34962.964          
                  
std error ,Se =    √(SSE/(n-2)) =    17.993          
                  
correlation coefficient ,    r = Sxy/√(Sx.Sy) =   0.7279          
                  
R² =    (Sxy)²/(Sx.Sy) =    0.5299   

α=   0.01              
t critical value=   t α/2 =    1.982   [excel function: =t.inv.2t(α/2,df) ]      
estimated std error of slope = Se/√Sxx =    17.99253   /√   100985.42   =   0.057
                  
margin of error ,E= t*std error =    1.982   *   0.057   =   0.112
estimated slope , ß^ =    0.6247              
                  
                  
lower confidence limit = estimated slope - margin of error =   0.6247   -   0.112   =   0.5124
upper confidence limit=estimated slope + margin of error =   0.6247   +   0.112   =   0.7369


B)

ANOVA
df SS MS F Significance F
Regression 1 39406.9 39406.9 121.7272 2.08E-19
Residual 108 34962.96 323.7311
Total 109 74369.86

c)

slope hypothesis test               tail=   2
Ho:   ß1=   0          
H1:   ß1╪   0          
n=   110              
alpha =   0.1              
estimated std error of slope =Se(ß1) = Se/√Sxx =    17.993   /√   100985.42   =   0.0566
                  
t stat = estimated slope/std error =ß1 /Se(ß1) =    0.6247   /   0.0566   =   11.0330
                  
t-critical value=    1.6591   [excel function: =T.INV.2T(α,df) ]          
Degree of freedom ,df = n-2=   108              
p-value =    0.0000              
decison :    p-value<α , reject Ho              
Conclusion:   Reject Ho and conclude that slope is significantly different from zero              

e)

Relationship is moderate positive relatioship. It means although score are increasing but not at the rate expected.

for each 0.6247 increase in attention, score is increaseed by 1.

Please revert back in case of any doubt.

Please upvote. Thanks in advance.



Related Solutions

I have conducted a linear regression model to predict student scores on an exam based on...
I have conducted a linear regression model to predict student scores on an exam based on the number of hours they studied. I get a coefficient (slope) of +2.5 for the variable of hours studied. The pvalue for this coefficient is 0.45 and the 95% confidence interval is [-2.5, +7]. Which of the following conclusions CANNOT be drawn from these results? At an alpha of 0.05, we can say that the effect of hours studied on exam score is significant...
A marketing research experiment was conducted to study the relationship between the length of time necessary...
A marketing research experiment was conducted to study the relationship between the length of time necessary for a buyer to reach a decision (y) and the number of alternative package designs (2, 3, or 4 designs). Remember that s = √MSE. Use the tables below to answer the following questions. Estimate Std. Error t value Pr(>|t|) (Intercept) 3.633 0.865 4.200 0.001 Design 1.700 0.449 3.786 0.001 Observations s R-squared Sxx mean(x) 15 1.468 0.508 10 3 a) Predict the mean...
A health psychologist conducted an experiment in which participants watched a film that either did or...
A health psychologist conducted an experiment in which participants watched a film that either did or did not include a person being injured because of not wearing a seatbelt. A week later, as part of a seemingly different study, these same participants reported how important they thought it was to wear seatbelts (higher scores = greater importance). The 16 participants who had seen the injury film gave a mean rating of 8.9, with a standard deviation of 2.1. The 16...
A student conducted an experiment to determine what factors are important in the rate of a...
A student conducted an experiment to determine what factors are important in the rate of a reaction between potassium carbonate and hydrochloric acid. The student diluted 2.000 mL of 4.000 M K2CO3 to 75.00 mL, then combined that solution with 75.00 mL of 2.000 M HCl. The student tabulated the amount of CO2 gas collected over time and recorded the results in the columns to the left. Time (min) Volume (mL) 1 0.2 2 0.3 3 0.5 4 0.7 5...
Student scores on Professor Combs' Stats final exam are normally distributed with a mean of 72...
Student scores on Professor Combs' Stats final exam are normally distributed with a mean of 72 and a standard deviation of 7.5 Find the probability of the following: **(use 4 decimal places)** a.) The probability that one student chosen at random scores above an 77.   b.) The probability that 20 students chosen at random have a mean score above an 77.   c.) The probability that one student chosen at random scores between a 67 and an 77.   d.) The probability...
Student scores on Professor Combs' Stats final exam are normally distributed with a mean of 77...
Student scores on Professor Combs' Stats final exam are normally distributed with a mean of 77 and a standard deviation of 7.5 Find the probability of the following: **(use 4 decimal places)** a.) The probability that one student chosen at random scores above an 82. b.) The probability that 20 students chosen at random have a mean score above an 82. c.) The probability that one student chosen at random scores between a 72 and an 82. d.) The probability...
A student is taking an exam. Suppose that the probability that the student finishes the exam...
A student is taking an exam. Suppose that the probability that the student finishes the exam in less than x hours is x/2 for x∈[0,2]. Show that the conditional probability that the student does not finish the exam in one hour given that they are still working after 45 minutes is 0.8.
A psychologist is interested in the relationship between color of food and appetite. To explore this...
A psychologist is interested in the relationship between color of food and appetite. To explore this relationship, the researcher bakes small coockies with icing of one of three different colors (green, red or blue). The researcher offers cookies to subjects while they are performing a boring task. Each subject is run individually under the same conditions, except for the color of the icing on the cookies that are available. Six subjects are randomly assigned to each color. The number of...
The score of a student on a certain exam is represented by a number between 0...
The score of a student on a certain exam is represented by a number between 0 and 1. Suppose that the student passes the exam if this number is at least 0.55. Suppose we model this experiment by a continuous random variable X, the score, whose probability density function is given by f(x) = { x if 0 <= x < 0.5 5x - 2 if 0.5 <= x <= 1 0 otherwise } a. What is the probability that...
The file SAT Excel data lists the average high school student scores on the SAT exam...
The file SAT Excel data lists the average high school student scores on the SAT exam by state. There are three components of the SAT: Critical reading, math and writing. These components are listed in Excel. Also, the sum of all 3 components (The Combined column) is listed. The percentage of all potential students who took the SAT is listed by state. Use Excel to help you answer the following questions. Find the mean, median, and mode for each of...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT