Question

In: Math

a comprehensive example (not just a data set - a concept) showing how the mean is...

a comprehensive example (not just a data set - a concept) showing how the mean is effected by outliers, and the effect that number close to the mean have on it

Solutions

Expert Solution

Outlier An extreme value in a set of data which is much higher or lower than the other numbers.Outliers affect the mean value of the data.

As we have seen in data collections that are used to draw graphs or find means, modes and medians the data arrives in relatively closed order. In other words, each element of the data is closely related to the majority of the other data. If not, the data set may have information that is too scattered to be useful in any analysis.

In some data sets there may be a point or two that can be out of context with the bulk of the data. These are referred to as outliers, which are out of line with the normal data set. The outlier can push the mean of the data out of its usual position.
For example, the data set 3,4,5,6,7 has a mean of 5, found by dividing the sum of the data by the number of data elements:

mean=(3+4+5+6+7)/5=5

If the 4 was mistakenly recorded as a 14, the 14 would be unusual for the data set and it would be an outlier.

Then: mean=(3+14+5+6+7)/5=7

And we can see the outlier has moved the mean of the data set.
To solve this problem the unusual data element can either be re-investigated and corrected, or removed from the data set with an explanation.

The former solution may bring back our original 4 after error checking is completed. The latter will return our mean closer to a representative evaluation of the data.

Mean=(3+5+6+7)/4=5=25

Another example

Let's say we have some data. 1,2,3. The mean of this is 2. But if we add an outlier of 94 to the data set, the mean will become 25. As you can see, the mean moved towards the outlier.


Related Solutions

What are the mean, median, and mode of a set of data, and how do they...
What are the mean, median, and mode of a set of data, and how do they differ from each other? What are the different type measures of dispersion? Provide examples of each from your experience.
the mean of the data set: 37634.3 the standard deviation of the data set: 10967.85287 the...
the mean of the data set: 37634.3 the standard deviation of the data set: 10967.85287 the sample size of the data set: 50 Using the numbers above calculate the following Show your step-by-step work for each question: Determine the 90% confidence interval, assuming that sigma is unknown, list each in proper (lower bound, upper bound) notation. Make a confidence statement. Determine the 95% confidence interval, assuming that sigma is unknown, list each in proper (lower bound, upper bound) notation. Make...
Concept Notebook ( This is an example of how to do a concept note). Concept: Acid...
Concept Notebook ( This is an example of how to do a concept note). Concept: Acid Balance 1. Related concepts (explain) – In this section, discuss how other concepts connect to and are affected by the main concept. a. Think about and explain how alterations in the main concept would affect other concepts b. Think about and explain how alterations in other concepts would affect the main concept c. Example – main concept of acid base balance; if acid base...
1. Give an example of a microorganism showing Pathogenicity and an example of a microorganism showing...
1. Give an example of a microorganism showing Pathogenicity and an example of a microorganism showing high Virulence. 2. Some people may already be immune to a certain pathogen such that when they are exposed to that particular pathogen they don’t get sick. How does this observation affect Koch’s postulates? How might the postulates be modified to account for the existence people who already have immunity?
Answer the following bootstrap question by showing the R code : A set of data X...
Answer the following bootstrap question by showing the R code : A set of data X contains the following numbers: 119.7 104.1 92.8 85.4 108.6 93.4 67.1 88.4 101.0 97.2 95.4 77.2 100.0 114.2 150.3 102.3 105.8 107.5 0.9 94.1 We generated n = 20 observations Xi = 10 Wi+100, where Wi has a contaminated normal distribution with proportion of contamination 20% and σc = 4. Suppose we are interested in testing: H0 : μ = 90 versus H1 :...
a) TRUE OR FALSE: For a set of data with a mean of 18 and a...
a) TRUE OR FALSE: For a set of data with a mean of 18 and a variance of 25, approximately 68% of the values will fall between 13 to 23. b) Which of the following statement is true about the relationship between a sample and a population 1)Every sample is a perfect representation of a population 2)The sample size is smaller than a population size 3)Every member of a population is also in the sample 4)A population size is smaller...
1) If you computed the 20% Trimmed mean on a data set with 50 values, how...
1) If you computed the 20% Trimmed mean on a data set with 50 values, how many values would you exclude, or throw away in total? 2) Why do people calculate a trimmed mean, anyway? 3) Calculate the Harmonic Mean for the values: 1, 2, 50, 200.
For the following, the data set has a mean of ? = 100 and a standard...
For the following, the data set has a mean of ? = 100 and a standard deviation of ? = 20.   Find the following areas under the curve for the region that is: 1) Less than 100 2) Greater than 160 For the following, scores on a psychology exam were normally distributed with a mean of 67 and a standard deviation of 8. Find the following: 1) What percentage of scores were below 83? 2) If 500 students took the...
Describe a fictional example and generate a small data set in which data are heteroscedastic and...
Describe a fictional example and generate a small data set in which data are heteroscedastic and you need to apply a GLS.
61)       Which of the following is an example of information as opposed to just data:            ...
61)       Which of the following is an example of information as opposed to just data:             A)    Number of dresses sold.             B)    Color of dresses sold.             C)    Dress sales in Michigan at Walmart Store #3458.             D)    Types of dresses sold per region and season. 62)       Which of the following statements is most true:             A)    Relevant information does not help predict the future.             B)    There are sometimes tradeoffs between information that is relevant and a faithful...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT