In: Statistics and Probability
You are given the following data set: {(0,3), (0.5,4.2), (1,5.2), (1.1, 5), (1.5, 5.6)}, where the first coordinate is the independent (explanatory) variable, and the second coordinate is the dependent variable.
(a) What is the variance of the data?
(b) What is the mean squared error of the model that has
y-intercept 3 and slope 2?
(c) Use R2 to argue how much the feature in (c) explains the variation in the data.
(Note: In Part b the estimated parameter values obtained using least square method are rounded to nearest integer. )