Question

In: Statistics and Probability

On this worksheet, make an XY scatter plot linked to the following data: X Y 92...

On this worksheet, make an XY scatter plot linked to the following data:

X Y
92 22
87 23
102 23
80 25
91 27
100 20
95 21
109 19
77 28
100 221
98 25
89 27
97 23
93 22
89 27
91 22
97 21
105 21
88 22
83 24
86 27
89 26
79 30
88 22
94 24
18) Add trendline, regression equation and r squared to the plot.
Add this title. ("Scatterplot of X and Y Data")
19) The scatterplot reveals a point outside the point pattern. Copy the data to a new location in the worksheet. You now have 2 sets of data.
Data that are more tha 1.5 IQR below Q1 or more than 1.5 IQR above Q3 are considered outliers and must be investigated.
It was determined that the outlying point resulted from data entry error. Remove the outlier in the copy of the data.

Make a new scatterplot linked to the cleaned data without the outlier, and add title ("Scatterplot without Outlier,") trendline, and regression equation label

20)

Compare the regression equations of the two plots. How did removal of the outlier affect the slope and R2?

Solutions

Expert Solution

Based on the given data, using excel,

We find that our data has one outlier with co-ordinates (100,221)

Regression Equation and R2:

18. The trend line is expressed as: y = 0.799x - 41.83 and R2 is obtained as 2.6%.

To add the title:

19. Obtaining the inter quartile range:

We find that the point Y = 221 lies outside the interquartile range.

Removing the outlier: Creating a scatter plot, regression line and computing R2 for the cleaned data:

20.Comparing the two scatter plots:

We find that the slope decreased from 0.799 to -0.25, from positive slope to negative, suggesting that the outlier was an influential observation that pulled the fitted regression line towards it.Also removal of the outlier increased R2 significantly from 0.026 to 0.495. The model is now a better fit to the data; as X, now, explains a larger proportion of variation (49.5%) in Y than before (2.6%) .


Related Solutions

17) On this worksheet, make an XY scatter plot linked to the following data: X Y...
17) On this worksheet, make an XY scatter plot linked to the following data: X Y 92 22 87 23 102 23 80 25 91 27 100 20 95 21 109 19 77 28 100 221 98 25 89 27 97 23 93 22 89 27 91 22 97 21 105 21 88 22 83 24 86 27 89 26 79 30 88 22 94 24 18) Add trendline, regression equation and r squared to the plot. Add this title....
On this worksheet, make an XY scatter plot linked to the following data: X 22 48...
On this worksheet, make an XY scatter plot linked to the following data: X 22 48 37 30 24 10 42 30 41 29 16 36 45 11 31 26 31 33 46 22 13 22 32 49 35 Y 3872 9312 5217 4230 4536 1820 8274 121 6314 3828 2448 6156 7515 1309 3534 4576 5797 4983 6670 2464 2197 3278 5408 7497 5705 Add trendline, regression equation and r squared to the plot. Add this title. ("Scatterplot of...
On this worksheet, make an XY scatter plot linked to the following data:1.01,2.8482, 1.48, 4.2772, 1.8,...
On this worksheet, make an XY scatter plot linked to the following data:1.01,2.8482, 1.48, 4.2772, 1.8, 4.788, 1.81, 5.3757, 1.07, 2.5252, 1.53, 3.0906, 1.46, 4.3362, 1.38, 3.2016, 1.77, 4.3542, 1.88, 4.8692, 1.32, 3.8676, 1.75, 3.9375, 1.94, 5.7424, 1.19, 2.4752, 1.31, 26.2, 1.56, 4.5708, 1.16, 2.842, 1.22, 2.44, 1.72, 5.1256, 1.45, 4.3355, 1.43, 4.2471, 1.19, 3.5343, 2, 5.46, 1.6, 3.84, 1.58, 3.8552 Add trendline, regression equation and r squared to the plot.Add this title. ("Scatterplot of X and Y Data"). The...
Plot the contours of u(x,y)=xy and its harmonic conjugate v(x,y).
Plot the contours of u(x,y)=xy and its harmonic conjugate v(x,y).
Complete the following for the data set: Scatter Plot Calculate the regression line in Y-intercept form...
Complete the following for the data set: Scatter Plot Calculate the regression line in Y-intercept form (do this piece by piece in Excel, or by hand) Interpret in Words your Beta coefficient If X=5; then your Y-hat equals what? Is this a good estimate or not (explain in words) Plot the regression line Use STATA to calculate and interpret the R2 Yi Xi 2 11 4 9 4 14 6 9 8 9 10 8 10 13 11 5 13...
Complete the following for the data set: Scatter Plot Calculate the regression line in Y-intercept form...
Complete the following for the data set: Scatter Plot Calculate the regression line in Y-intercept form (do this piece by piece in Excel, or by hand) Interpret in Words your Beta coefficient If X=5; then your Y-hat equals what? Is this a good estimate or not (explain in words) Plot the regression line Use STATA to calculate and interpret the R2 DATA SET 1: Yi Xi 14 3 11 5 11 3 8 8 5 7 7 10 4 9...
For the following data​ (a) display the data in a scatter​ plot, (b) calculate the correlation...
For the following data​ (a) display the data in a scatter​ plot, (b) calculate the correlation coefficient​ r, and​ (c) make a conclusion about the type of correlation. The ages​ (in years) of 6 children and the number of words in their vocabulary ​ Age, x 1 2 3 4 5 6 Vocabulary​ size, y 150 1100 1150 1800 2050 2700 A] The correlation coefficient r is
Contract the scatter plot of these data. Describe relationship between x and y. What type of relationship appears to exist between two variables?
Use the following data: x y 10 3 6 7 9 3 3 8 2 9 8 5 3 7 Contract the scatter plot of these data. Describe relationship between x and y. What type of relationship appears to exist between two variables? (you can copy and past from Excel,SAS,etc) Compute the correlation coefficient r. Test to determine whether the population correlation coefficient is positive. Use the α=0.01 level to conduct test. (calculate test statistics and make conclusion)
Below are four bivariate data sets and the scatter plot for each. (Note that each scatter...
Below are four bivariate data sets and the scatter plot for each. (Note that each scatter plot is displayed on the same scale.) Each data set is made up of sample values drawn from a population. x y 1.0 10.0 2.0 9.0 3.0 8.0 4.0 7.0 5.0 6.0 6.0 5.0 7.0 4.0 8.0 3.0 9.0 2.0 10.0 1.0 x 1 2 3 4 5 6 7 8 9 10 11 y 1 2 3 4 5 6 7 8 9...
1.) Sketch a scatter plot from the following data, and determine the equation of the regression...
1.) Sketch a scatter plot from the following data, and determine the equation of the regression line. x 125 119 103 91 50 29 24 y 2.)Investment analysts generally believe the interest rate on bonds is inversely related to the prime interest rate for loans; that is, bonds perform well when lending rates are down and perform poorly when interest rates are up. Can the bond rate be predicted by the prime interest rate? Use the following data to construct...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT