Question

In: Statistics and Probability

Data Set A: 1.32, 1.01, 0, 2.21, 1.69, 1.73, 2.01, 0, 0.73, 0.91, 0, 3.03, 2.22,...

Data Set A:

1.32, 1.01, 0, 2.21, 1.69, 1.73, 2.01, 0, 0.73, 0.91, 0, 3.03, 2.22, 1.23, 3.71, 0, 0.45, 2.18, 3.12, 1.91

Data Set B:

0, 0.63, 2.11, 1.37, 0, 1.11, 2.93, 0, 3.11, 2.61, 0, 0.38, 0.98, 1.55, 1.83, 0, 3.46, 2.31, 0, 1.49

- Comment on the Robust(trimming) estimates of central tendency differences.

Solutions

Expert Solution

Two Data sets are given

Sorted Data set A is given as

Remove the first 4 smallest and 4 largest value for finding the 20% trimmed mean which means

(0.2*20=4 observations remove)

20% Trimmed mean will be of only 12 observations

Trimmed mean =(0.45+0.73+......+3.12+3.71)/12 is given as 17.38

Data set A
0
0
0
0
0.45
0.73
0.91
1.01
1.23
1.32
1.69
1.73
1.91
2.01
2.18
2.21
2.22
3.03
3.12
3.71

Similarly 20% trimmed mean for sorted data set B will be

20% Trimmed mean will be 13.76 after removing the first four and last four observations which is also highlighted

Data set B
0
0
0
0
0
0
0.38
0.63
0.98
1.11
1.37
1.49
1.55
1.83
2.11
2.31
2.61
2.93
3.11
3.46

Trimmed means provide a better estimation of the location of the bulk of the observations than the mean when sampling from asymmetric distributions

the standard error of the trimmed mean is less affected by outliers and asymmetry than the mean, so that tests using trimmed means can have more power than tests using the mean.


Related Solutions

Data Set A: 1.32, 1.01, 0, 2.21, 1.69, 1.73, 2.01, 0, 0.73, 0.91, 0, 3.03, 2.22,...
Data Set A: 1.32, 1.01, 0, 2.21, 1.69, 1.73, 2.01, 0, 0.73, 0.91, 0, 3.03, 2.22, 1.23, 3.71, 0, 0.45, 2.18, 3.12, 1.91 Data Set B: 0, 0.63, 2.11, 1.37, 0, 1.11, 2.93, 0, 3.11, 2.61, 0, 0.38, 0.98, 1.55, 1.83, 0, 3.46, 2.31, 0, 1.49 - 2-sample parametric estimates - Using SAS or R, perform the appropriate 95% confidence interval for the difference in means, ratios of variances, and the modified-Levene test.
Data Set A: 1.32, 1.01, 0, 2.21, 1.69, 1.73, 2.01, 0, 0.73, 0.91, 0, 3.03, 2.22,...
Data Set A: 1.32, 1.01, 0, 2.21, 1.69, 1.73, 2.01, 0, 0.73, 0.91, 0, 3.03, 2.22, 1.23, 3.71, 0, 0.45, 2.18, 3.12, 1.91 Data Set B: 0, 0.63, 2.11, 1.37, 0, 1.11, 2.93, 0, 3.11, 2.61, 0, 0.38, 0.98, 1.55, 1.83, 0, 3.46, 2.31, 0, 1.49 - Interpret on comment on the nonparametric estimates of central tendency differences.
Solve SVM for a data set with 3 data instances in 2 dimensions: (1,1,+), (-1,1,-),(0,-1,-). Here...
Solve SVM for a data set with 3 data instances in 2 dimensions: (1,1,+), (-1,1,-),(0,-1,-). Here the first 2 number are the 2-dimension coordinates. ‘ +’ in 3rd place is positive class. And ‘-‘ in 3rd place is negative class . Your task is to compute alpha’s, w, b. Then, Solve SVM when data are non-separable, using k=2 when minimizing the violations of the mis-classification, i.e., on those slack variables.
Determine the reaction order and the rate constant for the following data set: Time (s) 0...
Determine the reaction order and the rate constant for the following data set: Time (s) 0 50 100 150 200 250 Concentration (M) 0.01000 0.00887 0.00797 0.00723 0.00662 0.00611
Question 3 0 / 3 pts Suppose you are working with the automobile data set and...
Question 3 0 / 3 pts Suppose you are working with the automobile data set and got the following printout. Assume that you already tested the quadratic terms and dropped them out of the model. The following printout was created. Least Squares Linear Regression of Price Predictor Variables    Coefficient    Std Error    T    P VIF Constant    20339.7 1039.20 19.57    0.0000 0.0 Mileage -0.06642 0.02264    -2.93    0.0052    1.8 Model 1255.69 1454.48    0.86...
A set of data has the following coordinates t 0 1 3 4 7 y 2...
A set of data has the following coordinates t 0 1 3 4 7 y 2 4 5 7 10 a) Find the least-squares fit to this data by a linear function of t (that is, find constants c1,c0 so that y(t) = c1t + c0 is the best linear fit to this set of data). b) Find the equation of the best quadratic fit to the same set of data. Then find the equation of the polynomial of smallest...
Use the data set below to answer the question. x −2 −1 0 1 2 y...
Use the data set below to answer the question. x −2 −1 0 1 2 y 2 2 4 5 5 Find a 90% prediction interval for some value of y to be observed in the future when x = −1. (Round your answers to three decimal places.)
Perform rank correlation analysis on the following data set: x y -3 -0 -2.65 -1.3 -2.15...
Perform rank correlation analysis on the following data set: x y -3 -0 -2.65 -1.3 -2.15 -1.5 -1.85 -1.5 -1.55 -1.7 -1.2 -1.3 -0.8 -0.5 -0.4 -0.9 -0.1 0 0.4 0.9 0.75 -0.3 1.1 1.4 1.6 1.4 1.95 0.3 2.4 1.3 2.7 -0.6 3 -0.9 3.55 0.4 4 0 4.5 -0.9 What is the rank correlation coefficient?  (Round to three decimal places.) What is the critical rho value at a 0.05 significance? Do we have correlation? Yes, the rank correlation coefficient...
For the data set {1, -3, 5.5, 8.1, 9, -2, 0, 0.4, 21, -6.8, -5}, what...
For the data set {1, -3, 5.5, 8.1, 9, -2, 0, 0.4, 21, -6.8, -5}, what are the mean, variance, Q1, Q2, Q3, IQR and mode, all accurate to one decimal point and in that order? Select one: a. 2.6, 63.1, -3, 0.4, 8.1, 7.7, 0. b. 2.6, 63.1, -3.4, 0.4, 8.9, 7.7, no mode. c. 2.6, 7.9, -3, 0.4, 8.1, 7.7, no mode. d. 2.6, 7.9, -3.4, 0.4, 8.9, 7.7, no mode. e. 2.6, 7.9, -3, 0.4, 8.1, 7.7,...
How do you determine if the following data set is linear or quadratic: (-4,10), (-3,7), (-2,4), (-1,1), (0,-2)?
How do you determine if the following data set is linear or quadratic: (-4,10), (-3,7), (-2,4), (-1,1), (0,-2)?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT