Question

In: Statistics and Probability

What is the difference between a suspicious data point and an extreme data point?

What is the difference between a suspicious data point and an extreme data point?

Solutions

Expert Solution

Suspicious data point: when a data points is observed as abnormal in any kind of aspect , then it is called as Suspicious data point.

Extreme data point: when a data point is observed far away from the tendency of the data it is called as extreme data point

Example: let us consider a data set which is having time taken for door deliveries of an online food delivery person.

let us assume in general in and around 5 deliveries per hour completed by the person.

but in the data if suddenly one observation is recorded as 35 deliveries per hour, then it is called as suspicious data point because 35 deliveries per hour is almost impossible. it may be wrongly entered.

where as if any observation is recorded as 15 deliveries per hour, it is considered as extreme data point because sometimes 15 deliveries per hour is possible. but it is very far away from the general observations which are 5 deliveries per hour.

For the data analysis purpose we can remove a suspicious data point and continue the analysis, because it may not be belongs to the data set.

But we until unless we have a strong evidence, we should not remove the extreme data point from the data set. because even it is very far away from the tendency of the data, still it is the part of the data set.

try to understand this for different situations as your concern.


Related Solutions

Is there a difference between Absolute and Extreme Poverty? If so, what is it?
Is there a difference between Absolute and Extreme Poverty? If so, what is it?
What is a data mart? (worth 1 point) What is the difference between a dependent and...
What is a data mart? (worth 1 point) What is the difference between a dependent and independent data mart? (worth 3 points)
5. What is the difference between the capillary melting point and true melting point?
5. What is the difference between the capillary melting point and true melting point?
Question 1 A residual is: choose one The difference between a data point and the regression...
Question 1 A residual is: choose one The difference between a data point and the regression line. A value that can be 1 or zero. A value that is always negative because it is a difference The difference between two different lines. Question 2 The correlation coefficient: choose one Is a number with a range from -1 to 1 If there is no correlation, the coefficient is negative If the correlation coefficient is negative, it indicates a strong positive relationship...
what is the difference between fob shipping point and fob destination
what is the difference between fob shipping point and fob destination
What is the difference between a model and actual data?
What is the difference between a model and actual data?
Describe the difference between discrete and continuous data with examples. (5) What is the difference between...
Describe the difference between discrete and continuous data with examples. (5) What is the difference between the process of using probability calculations for discrete verses continuous data? How do these calculations change? (5)
What is the difference between the firm’s shut down point in the short run and in...
What is the difference between the firm’s shut down point in the short run and in the long run? Why are firms willing to accept losses in the short run but not in the long run?
What is the difference between a silent mutation, point mutation and a missense mutation?
What is the difference between a silent mutation, point mutation and a missense mutation?
What is the difference between ordinal data and ratio data? What is the variance for problem...
What is the difference between ordinal data and ratio data? What is the variance for problem number 3? #3 Identify the mode and median of the following data. Compute the mean, range and standard deviation as well. 3 place decimals please!! 18, 20, 19, 22, 20 25 Points – Mode, median and range are 3 points each, mean is 6 points, and standard deviation is 10 points
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT