Question

In: Statistics and Probability

Create 2 data sets. One with 5 observations and the other with 15 observations. Illustrate how...

Create 2 data sets. One with 5 observations and the other with 15 observations. Illustrate how variance is sensitive to an extreme score. Also show how sample size mediates the effect of an extreme score.

Solutions

Expert Solution

take 2 data sets

DATA SET 1:- 1,2,3,4,30

DATA SET 2:-   1,2,3,4,5,6,7,8,9,10,11,12,13,14,30

FOR DATA SET 1:

Sample Standard Deviation, s 12.349089035228
Variance (Sample Standard), s2 152.5
Population Standard Deviation, ? 11.045361017187
Variance (Population Standard), ?2 122

FOR DATA SET 2

Sample Standard Deviation, s 7.0710678118655
Variance (Sample Standard), s2 50
Population Standard Deviation, ? 6.8313005106397
Variance (Population Standard), ?2 46.666666666667

(a) lets say data set is 1,2,3,4,5

its results are

Sample Standard Deviation, s 1.5811388300842
Variance (Sample Standard), s2 2.5
Population Standard Deviation, ? 1.4142135623731
Variance (Population Standard), ?2

2

now compare the results of above data set with data set 1. Due to the presence of value 30,there is a huge gap in the standard deviation from 2.5 to 152.5.

Thus, it measures spread around the mean. Because of its close links with the mean, standard deviation can be greatly affected if the mean gives a poor measure of central tendency. Standard deviation is also influenced by outliers onevalue could contribute largely to the results of the standard deviation.as standard deviation varies,So variance also varies.

(b) now compare data set 1 and data set 2

as the sample size increases from 5 to 15, the variance decreased from 152.5 to 50

If your effect size is small then you will need a large sample size in order to detect the difference otherwise the effect will be masked by the randomness in your samples. Essentially, any difference will be well within the associated confidence intervals and you won’t be able to detect it.larger sample sizes give more reliable results with greater precision and power, but they also cost more time and money.


Related Solutions

Define 2 different measures of correlation of 2 data sets to each other.
Define 2 different measures of correlation of 2 data sets to each other.
Create a histogram of this data with 15 bins. Create a box plot of this data.
7, 9, 8, 11, 14, 7, 11, 17, 18, 12, 10, 9, 16, 17, 15, 13, 7, 12, 7, 8, 14, 16, 20, 12, 11, 14, 22, 8, 10, 14, 15, 20, 17, 14, 12, 22, 12, 15, 17, 16, 9, 11, 16, 18, 11, 12, 11, 9, 11, 9, 13, 7, 12, 9, 19, 9, 8, 15, 12, 16, 16, 20, 21, 9, 11, 17, 17, 8, 11, 7, 10, 17, 13, 15, 14, 11, 19,10, 11, 11, 9,...
I have Standard Deviation and Mean of 2 sets of data. Based on the data, how...
I have Standard Deviation and Mean of 2 sets of data. Based on the data, how can we infer at the 5% significance level that the score of individuals in the 4th year is better than the individuals in 1st year? average 71.29 76.98 S.D. 8.58 8.119 Year 1 Year 4 The sample size is 430
D. The data le TreeAgeDiamSugarMaple.txt is available at the same site as the other data sets...
D. The data le TreeAgeDiamSugarMaple.txt is available at the same site as the other data sets you have used in the homework assignments. The data are from 27 maple trees. The rst column of the le is x=tree diameter and the second column is y=tree age (in years). Do the following for these data: (i) Determine a good polynomial regression model for this data using the AIC and/or BIC criteria. (Fit all polynomial regression models upto a maximum degree of...
DATA 3 8 2 15 2 2 0 0 4 5 2 7 0 1 5...
DATA 3 8 2 15 2 2 0 0 4 5 2 7 0 1 5 3 0 2 5 4 1 6 9 5 3 1 2 10 6 1 1 2 1 19 6 6 6 7 0 4 1 1 1 0 1 9 2 2 2 1 16 10 10 5 2 3 1 4 4 4 3 6 2 8 5 2 7 1 6 4 0 3 1 1 1 Background: A group of...
1. Consider the following data set: D= (5, 10, 15, 15, 5, 10, 15, 15, 5,...
1. Consider the following data set: D= (5, 10, 15, 15, 5, 10, 15, 15, 5, 10, 15, 15) SD= 4.33 How would you add a number to this set while keeping the SD the same? 2. tossing a coin 50 times A. from the 50 flips compute the proportion of heads from your 50 flips. B. For a 95% confidence level find the z critical value C. compute the 95% confidence interval for p, the margin of error from...
Given the following data, illustrate Selection Sort. index 1 2 3 4 5 6 data 11...
Given the following data, illustrate Selection Sort. index 1 2 3 4 5 6 data 11 10 21 3 7 5
Consider the following two sample data sets. Set​ 1: 5 3 2 8 6 Set​ 2:...
Consider the following two sample data sets. Set​ 1: 5 3 2 8 6 Set​ 2: 3 12 13 2 7 a. Calculate the coefficient of variation for each data set. b. Which data set has more​ variability? a. The coefficient of variation for set 1 is nothing ​%. ​(Round to one decimal place as​ needed.)
The following sample observations were randomly selected: 1 2 3 4 5 X: 15 8 9...
The following sample observations were randomly selected: 1 2 3 4 5 X: 15 8 9 12 8 Y: 25 22 16 16 14              a. Not available in Connect. b. Determine the regression equation. (Negative answer should be indicated by a minus sign. Do not round intermediate calculations. Round the final answers to 4 decimal places.)                                             b =----  a =---- Y' = -----+-----  X             c. Determine the value of Y' when X is 26. (Do not round intermediate calculations....
Consider the data. xi 1 2 3 4 5 yi 4 7 5 11 15 The...
Consider the data. xi 1 2 3 4 5 yi 4 7 5 11 15 The estimated regression equation for these data is  ŷ = 0.60 + 2.60x. (a)Compute SSE, SST, and SSR using equations SSE = Σ(yi − ŷi)2, SST = Σ(yi − y)2, and SSR = Σ(ŷi − y)2. SSE=SST=SSR= (b) Compute the coefficient of determination r2. r2 = Comment on the goodness of fit. (For purposes of this exercise, consider a proportion large if it is at least...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT