Question

In: Statistics and Probability

A psychology instructor wants to find out a suitable predictor of the Final examination marks of...

A psychology instructor wants to find out a suitable predictor of the Final examination marks of his students. He thinks that the Assignment marks or the Mid-term test marks can be used for this purpose. However he is not sure which of those is more suitable. The following table shows the Assignment marks (out of 20), Mid-term test marks (out of 20) and the Final examination marks (out of 40) of 5 randomly selected students of his psychology class last year. The data in a given row are related to the same student.

student number marks mid term test marks final exam marks
1 14 11 23
2 17 15 40
3 20 20 40
4 10 11 29
5 16 13 35

Assuming that there is a linear relationship between the Assignment marks and the Final examination marks, calculate the Pearson’s correlation coefficient. Round your answer to 3 decimal places

Assuming that there is a linear relationship between the Mid-term test marks and the Final examination marks,

i. Derive the least squares prediction line to predict the Final examination marks based on the Mid-term test marks. (9 points)

ii. Calculate the coefficient of determination for the least squares prediction line. (3 points)

iii. Interpret the value of the coefficient of determination in relation to this situation. (2 points)

Out of the two variables ‘Assignment marks’ and ‘Mid-term test marks’, which variable is more suitable to use as the independent variable in a least squares prediction line to predict the Final examination marks? Explain the reason for your answer. (

Solutions

Expert Solution

x y (x-x̅)² (y-ȳ)² (x-x̅)(y-ȳ)
14 23 1.96 108.16 14.56
17 40 2.56 43.56 10.56
20 40 21.16 43.56 30.36
10 29 29.16 19.36 23.76
16 35 0.36 2.56 0.96
ΣX ΣY Σ(x-x̅)² Σ(y-ȳ)² Σ(x-x̅)(y-ȳ)
total sum 77.00 167.00 55.20 217.20 80.20
mean 15.40 33.40 SSxx SSyy SSxy

correlation coefficient ,    r = Sxy/√(Sx.Sy) =   0.732

====================

x y (x-x̅)² (y-ȳ)² (x-x̅)(y-ȳ)
11 23 9.00 108.16 31.20
15 40 1.00 43.56 6.60
20 40 36.00 43.56 39.60
11 29 9.00 19.36 13.20
13 35 1.00 2.56 -1.60
ΣX ΣY Σ(x-x̅)² Σ(y-ȳ)² Σ(x-x̅)(y-ȳ)
total sum 70.00 167.00 56.00 217.20 89.00
mean 14.00 33.40 SSxx SSyy SSxy

i)

sample size ,   n =   5          
here, x̅ = Σx / n=   14.000   ,     ȳ = Σy/n =   33.400  
                  
SSxx =    Σ(x-x̅)² =    56.0000          
SSxy=   Σ(x-x̅)(y-ȳ) =   89.0          
                  
estimated slope , ß1 = SSxy/SSxx =   89.0   /   56.000   =   1.58929
                  
intercept,   ß0 = y̅-ß1* x̄ =   11.15000          
                  
so, regression line is   Ŷ =   11.15   +   1.59   *x

ii) R² =    (Sxy)²/(Sx.Sy) =    0.6512

iii) about 65.12% of variation in observation of Y is explained by variable X

iv) ‘Mid-term test marks’, variable is more suitable to use as the independent variable in a least squares prediction line to predict the Final examination marks

because R² is more in case of ‘Mid-term test marks’,


Related Solutions

the mean final examination scores for students taking SM2703 is 30 marks (out f 50 marks)...
the mean final examination scores for students taking SM2703 is 30 marks (out f 50 marks) with standard deviation of 6 marks. Assume that the final scores are approximately normal. Two random samples were taken randomly consisting of 32 and 50 students respectively. What is the probability that: a) The mean final examination scores will differ by more than 3 marks? b) Mean final examination scores from group 1 is larger than group 2? vv
A STAT 200 instructor believes that the average quiz score is a good predictor of final...
A STAT 200 instructor believes that the average quiz score is a good predictor of final exam score. A random sample of 10 students produced the following data where x is the average quiz score and y is the final exam score. x- 80, 95, 50, 60, 100, 55, 85, 70, 75, 85 y - 70, 96, 50, 63, 96, 60, 83, 60, 77, 87 (a) Find an equation of the least squares regression line. Round the slope and y-intercept...
I understand most of this information is more suitable to C++ but our instructor wants us...
I understand most of this information is more suitable to C++ but our instructor wants us to modify it to do it in Python. As long as you fufill the parameters the best you can in Python and works that all I want. Thank you Ask the user for the number of elements, not to exceed SORT_MAX_SIZE = 16 (put appropriate input validation) Ask the user for the type of data they will enter - Dollar or CIS22C_Dollar objects from...
An astronaut wants to find out his mass while in orbit, to find out if he...
An astronaut wants to find out his mass while in orbit, to find out if he is staying healthy while in space. Since he can't use a bathroom scale (why not?), he attaches himself to a spring (k=2500 N/m), pulls himself back from the spring's equil length by 5 m, and times one oscillation to take 1 s. a.) What is the mass of the astronaut? ________ kg b.) Find the potential energy stored in the spring when it is...
There is an experiment. Amy wants to find out if reading a book that outlines teaching...
There is an experiment. Amy wants to find out if reading a book that outlines teaching strategies helps one teach better. She will have all of the subjects read a book and then give a lecture to a class. The independent variable is the type of book read. The book is either a teaching guide or a visitor’s guide to Scotland. The dependent variable is how well the class does on a test given right after the lecture. It is...
Your client wants to invest to current investments, your client wants ask you to find out...
Your client wants to invest to current investments, your client wants ask you to find out the next 5 years expected rate of return. 1R1 = 1.50% E(2r1) = 2.5% E(3r1) = 3.0% E(4r1) = 3.5% E(5r1) = 4.5% Instructions: 1] Please using the Unbiased Expectation Theory of the Term Structure of Interest Rates.   2] Please provide a clear calculation and brief explanation to support your calculation.
6) A researcher wants to find out the association between hospitalizations and infections while in the...
6) A researcher wants to find out the association between hospitalizations and infections while in the hospital. He found that people who had longer stay in the hospital tend to have more infections at a rate of 3.0 per 35 days of hospitalization. People who did who discharged relatively quickly had infections at 1.2 per 35 days. Calculate the rate difference associated with hospital stay. Briefly explain your results. 7) In your own words explain the difference between random error...
A researcher wants to find out which of four routes is the fastest way to drive...
A researcher wants to find out which of four routes is the fastest way to drive from a suburb to downtown Chicago. He divides 20 participants into four groups, and he times each of them as they go on their assigned route. The following are the number of minutes for each route:                 Route 1 Route 2 Route 3 Route 4                 __________________________________                   22 25 26 26                   26 27 29 28                   25 28 33 27                   25 26 30 30                   31 29 33...
You are the market researcher for Chili’s. Your boss wants to find out if the restaurant...
You are the market researcher for Chili’s. Your boss wants to find out if the restaurant enjoys the same market penetration in the households with children as in households without children. He asked you, the market researcher, to look into this problem. You decide to take a random sample of 350 households with children and find that 45% of them eat at Chili’s. Then, you take a random sample of 250 households without children and find out that 54% of...
3. A company wants to find out if the average response time to a request differs...
3. A company wants to find out if the average response time to a request differs across its two servers. Say µ1 is the true mean/expectation response time on server 1 and µ2 the true mean/expectation response time on server 2. Independent samples across the two servers is taken. 196 observations on server 1 return a sample mean of X¯ = 12.5 seconds and a sample variance of s 2 = 9 seconds. 225 observations on server 2 return a...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT