Question

In: Statistics and Probability

Here is data with y as the response variable. x y 43.2 52.4 52.8 58.7 52.5...

Here is data with y as the response variable.

x y
43.2 52.4
52.8 58.7
52.5 48.6
189.8 112.4
64.6 49.8
47.6 57.2
31.4 36.4
66.6 60.1

Make a scatter plot of this data. Which point is an outlier?

Enter as an ordered pair. For example (a,b) - with parenthesis.


Find the regression equation for the data set without the outlier.


Enter as an equation of the form y=a+bxy=a+bx. Rounded to three decimal places. For this WAMAP question, do not include the hat in y-hat.


Find the regression equation for the data set with the outlier.


Enter as an equation of the form y=a+bxy=a+bx. Rounded to three decimal places. For this WAMAP question, do not include the hat in y-hat.

Is this outlier an influential point? An influential point, when removed from the data, will change the regression equation drastically.

  • No, the outlier does not appear to be an influential point.
  • Yes, the outlier appears to be an influential point.


Solutions

Expert Solution

....................

(43.2, 52.4)
(52.8 , 58.7)
(52.5 , 48.6)

(189.8 ,112.4
(64.6 , 49.8)
(47.6 , 57.2)
(31.4 , 36.4)
(66.6 , 60.1)
)

ΣX ΣY Σ(x-x̅)² Σ(y-ȳ)² Σ(x-x̅)(y-ȳ)
total sum 358.7 363.2 889.9571429 397.4 388.55
mean 51.24 51.89 SSxx SSyy SSxy

sample size ,   n =   7          
here, x̅ = Σx / n=   51.24   ,     ȳ = Σy/n =   51.89  
                  
SSxx =    Σ(x-x̅)² =    889.9571          
SSxy=   Σ(x-x̅)(y-ȳ) =   388.6          
                  
estimated slope , ß1 = SSxy/SSxx =   388.6   /   889.957   =   0.4366
                  
intercept,   ß0 = y̅-ß1* x̄ =   29.5131          
                  
so, regression line is   Ŷ =   29.513   +   0.437   *x
.............

ΣX ΣY Σ(x-x̅)² Σ(y-ȳ)² Σ(x-x̅)(y-ȳ)
total sum 548.2 475.6 17615.615 3601.6 7709.27
mean 68.53 59.45 SSxx SSyy SSxy

sample size ,   n =   8          
here, x̅ = Σx / n=   68.53   ,     ȳ = Σy/n =   59.45  
                  
SSxx =    Σ(x-x̅)² =    17615.6150          
SSxy=   Σ(x-x̅)(y-ȳ) =   7709.3          
                  
estimated slope , ß1 = SSxy/SSxx =   7709.3   /   17615.615   =   0.4376
                  
intercept,   ß0 = y̅-ß1* x̄ =   29.4608          
                  
so, regression line is   Y =   29.461   +   0.438   *x

...............


                  

  • No, the outlier does not appear to be an influential point.

...................

THANKS

revert back for doubt

please upvote


Related Solutions

Here is data with y as the response variable. x y 71.4 25.1 82.8 34.4 81...
Here is data with y as the response variable. x y 71.4 25.1 82.8 34.4 81 75.3 77.1 72.8 84.6 63.4 64.8 75.8 85.4 117.6 -14.1 173.3 76.7 51.2 63.3 140.8 Make a scatter plot of this data. Which point is an outlier? Enter as an ordered pair, e.g., (x,y). (x,y)= Find the regression equation for the data set without the outlier. Enter the equation of the form mx+b rounded to three decimal places. ˆywo= Find the regression equation for...
Here is data with y as the response variable. x y 57.8 47.7 65.3 42.7 61...
Here is data with y as the response variable. x y 57.8 47.7 65.3 42.7 61 30.8 54.5 26.4 -70.7 -338.8 63 45.7 38.9 -46.1 70.2 35.5 Make a scatter plot of this data. Which point is an outlier? Enter as an ordered pair. For example (a,b) - with parenthesis. Find the regression equation for the data set without the outlier. Enter as an equation of the form y = a + b x . Rounded to three decimal places....
Here is data with y as the response variable. (x,y): 59.2 48.5, 64.5 46.8, 47.4 13.8,...
Here is data with y as the response variable. (x,y): 59.2 48.5, 64.5 46.8, 47.4 13.8, -37.3 10, 71.4 84.9, 71.6 18.3, 64.4 28.3, 71.1 89.6, 56.2 6.2, 56.7 11.7, 69.1 23.8 Make a scatter plot of this data. Which point is an outlier? Enter as an ordered pair. For example (a,b) - with parenthesis. 1. Find the regression equation for the data set without the outlier. Enter as an equation of the form y = a + b x...
A. Here is a bivariate data set. x y 25.7 52.5 29.4 64.7 21.8 54.9 35...
A. Here is a bivariate data set. x y 25.7 52.5 29.4 64.7 21.8 54.9 35 63.4 31.4 74.7 21.6 46.5 40.8 69.7 37.9 77.7 16.7 41.1 29.6 78.1 13.1 45.5 36.1 78.4 32.1 68.2 45.2 76.8 36.1 57.9 Find the correlation coefficient and report it accurate to three decimal places. r = What proportion of the variation in y can be explained by the variation in the values of x? Report answer as a percentage accurate to one decimal...
Consider data regarding a response y and an explanatory variable x, both numeric x y 1...
Consider data regarding a response y and an explanatory variable x, both numeric x y 1 2.8 0.7 2 2.6 1.3 3 6.8 -1.1 4 3.0 0.2 5 4.7 1.1 6 5.0 -0.1 7 5.0 0.9 8 2.9 1.0 9 7.0 -0.2 10 3.7 0.8 The null hypothesis that the expected value of the response is constant for all values of the explanatory variable is: Select one: a. Not rejected with a significance level of 5%. b. Rejected with a...
Collect data on one response (dependent or y) variable and two different explanatory (independent or x)...
Collect data on one response (dependent or y) variable and two different explanatory (independent or x) variables. This will require a survey with three questions. For example: To predict a student’s GPA (y), you might collect data on two x variables: SAT score and age. So we would be trying to determine if there was a linear correlation between someone’s SAT score and their GPA, as well as their age and their GPA. (Note: students may not choose GPA as...
Collect data on one response (dependent or y) variable and two different explanatory (independent or x)...
Collect data on one response (dependent or y) variable and two different explanatory (independent or x) variables. This will require a survey with three questions. For example: To predict a student’s GPA (y), you might collect data on two x variables: SAT score and age. So we would be trying to determine if there was a linear correlation between someone’s SAT score and their GPA, as well as their age and their GPA. (Note: students may not choose GPA as...
a)By​ hand, draw a scatter diagram treating x as the explanatory variable and y as the response variable
x 4 5 6 8 9 y 6 8 11 14 16 a)By​ hand, draw a scatter diagram treating x as the explanatory variable and y as the response variable b)Find the equation of the line containing the points (44​,66​) and​ (9,16). c)Graph the line found in part​ (b) on the scatter diagram. d)By​ hand, determine the​ least-squares regression line. e)Graph the​ least-squares regression line on the scatter diagram. f)Compute the sum of the squared residuals for the line found...
Consider a binary response variable y and an explanatory variable x. The following table contains the...
Consider a binary response variable y and an explanatory variable x. The following table contains the parameter estimates of the linear probability model (LPM) and the logit model, with the associated p-values shown in parentheses. Variable LPM Logit Constant −0.78 −5.90 (0.03 ) (0.03 ) x 0.04 0.28 (0.07 ) (0.02 ) a. Test for the significance of the intercept and the slope coefficients at the 5% level in both models. Coefficients LPM Logit Intercept Slope b. What is the...
Consider a binary response variable y and an explanatory variable x. The following table contains the...
Consider a binary response variable y and an explanatory variable x. The following table contains the parameter estimates of the linear probability model (LPM) and the logit model, with the associated p-values shown in parentheses. Variable LPM Logit Constant −0.70 −6.60 (0.03 ) (0.03 ) x 0.04 0.18 (0.04 ) (0.03 ) a. Test for the significance of the intercept and the slope coefficients at the 5% level in both models. coefficient LPM Logit intercept slope b. What is the...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT