Question

In: Statistics and Probability

The table shows the weekly income of 2020 randomly selected​ full-time students. If the student did...

The table shows the weekly income of 2020 randomly selected​ full-time students. If the student did not​ work, a zero was

entered.

0 463 0 0

501 103 527 231

329 385 3383 197

517 165 248 0

572 412 258 93

​(a) Check the data set for outliers.

​(b) Draw a histogram of the data.

​(c) Provide an explanation for any outliers.

  1. List all the outliers in the given data set. Select the correct choice below and fill in any answer boxes in your choice.

A. The​ outlier(s) is/are _____

​(Use a comma to separate answers as​ needed.)

B.There are no outliers

  1. Choose the correct histogram below.

​(c) Choose the possible​ reason(s) for any​ outlier(s) below. Select all that apply.

A. Data entry error

B. A student providing false information

C. A student with unusually high income

D.None of the above

E. There are no outliers.

Solutions

Expert Solution

Answer A:

Arranging the data in Ascending Order-

0 , 0 , 0 , 0, 93 , 103 , 105 , 197 , 231 , 248 , 258 , 329 , 385 , 412 , 463 , 501 , 517 , 527 , 572 , 3383

Number of observations is 20.

The formula for calculation of the median is (( n/2 + (n/2 + 1) ) th observations / 2 )

The median of these observations is ( ( 10th. + 11th. ) observations / 2 ) =( (248 + 258) / 2) = 253

Now, taking the median of the 1st half of the observation set, that is, of the following set-

0 , 0 , 0 , 0, 93 , 103 , 105 , 197 , 231 , 248

The first quartile (Q1) is (( 5th + 6th ) observations /2) = (93 + 103) / 2 = 98

Now taking the median of the 2nd half of the observation set , that is , of the following set-

258 , 329 , 385 , 412 , 463 , 501 , 517 , 527 , 572 , 3383

The third quartile (Q3) is (( 5th + 6th ) observations /2) = (463 + 501)/2 = 482

The Interquartile Range (IQR) = Third quartile - first Quartile = 482 - 98 = 384

Now let t = 1.5 x IQR = 576

Subtracting this value t from first quartile (Q1) , and adding this value t to the   third quartile (Q3) , we get ,

t + Q3 = 576 + 482 = 1058

Q1 - t = 98 - 576 = -478

Therefore , if all the values of the observation set lie in the interval [-478 , 1058]. Then there are no outliers. If not, then there are outliers present in the data set.

The outlier is 3383.

Answer B:

The frequency distribution of the following data set is-

Class Intervals

Frequency

0-250

10

250-500

5

500-750

4

750-1000

0

1000-1250

0

1250-1500

0

1500-1750

0

1750-2000

0

2000-2250

0

2250-2500

0

2500-2750

0

2750-3000

0

3000-3250

0

3250-3500

1

Total

20

The histogram to the data set is-

( where X-axis represents the Class Intervals and Y-axis represents the Frequency )

Answer C:

The three Options A , B , C, can be the reason behind the outlier.

It can be any one of the three options.


Related Solutions

A student at a junior college conducted a survey of 20 randomly selected​ full-time students to...
A student at a junior college conducted a survey of 20 randomly selected​ full-time students to determine the relation between the number of hours of video game playing each​ week, x, and​ grade-point average, y. She found that a linear relation exists between the two variables. The​ least-squares regression line that describes this relation is ModifyingAbove y with caret equals negative 0.0559 x plus 2.9289. (a) Predict the​ grade-point average of a student who plays video games 8 hours per...
A student at a junior college conducted a survey of 20 randomly selected? full-time students to...
A student at a junior college conducted a survey of 20 randomly selected? full-time students to determine the relation between the number of hours of video game playing each? week, x, and? grade-point average, y. She found that a linear relation exists between the two variables. The? least-squares regression line that describes this relation is y = -0.0503x + 2.9381. (a) Predict the grade-point average of a student who plays video games 8 hours per week. The predicted grade-point average...
A student at a junior college conducted a survey of 20 randomly selected​ full-time students to...
A student at a junior college conducted a survey of 20 randomly selected​ full-time students to determine the relation between the number of hours of video game playing each​ week, x, and​ grade-point average, y. She found that a linear relation exists between the two variables. The​ least-squares regression line that describes this relation is ModifyingAbove y with caret equals negative 0.0592 x plus 2.9446. ​(a) Predict the​ grade-point average of a student who plays video games 8 hours per...
The weekly time spent​ (in hours) on homework for 18 randomly selected high school students is...
The weekly time spent​ (in hours) on homework for 18 randomly selected high school students is given below. Use technology to construct​ 90%, 95%, and​ 99% confidence intervals for the population mean. Interpret the results and compare the widths of the confidence intervals. Assume the weekly time spent on homework is normally distributed. 13.113.1 12.112.1 14.714.7 15.215.2 8.68.6 10.710.7 11.811.8 8.58.5 9.99.9 9.69.6 11.111.1 11.911.9 15.915.9 11.911.9 12.212.2 11.511.5 13.513.5 12.212.2 The lower limit of the​ 90% confidence interval is...
Students at a college randomly selected 217 student cars and found that they had ages with...
Students at a college randomly selected 217 student cars and found that they had ages with a mean of 7.89 years and a standard deviation of 3.67 years. They also randomly selected 152 faculty cars and found that they had ages with a mean of 5399 years and a standard deviation of 3.65 years. a) Using a 0.05 significance level, are student cars older than faculty cars? List: Define the parameters u1 and u2 Null and Alternative Hypothesis P-value Initial...
The table below shows the net weekly demand of Audi Car model. The lead time to...
The table below shows the net weekly demand of Audi Car model. The lead time to receive the door is 2 weeks. In week 0, there are 5 cars and 15 doors in inventory. It is expected to receive 30 reworked doors in week 10. Week 8 9 10 11 12 13 14 15 16 17 Net demand (car) 20 25 40 12 5 32 24 18 16 30 Suppose that driver door has four buttons, and each other door...
​​​​​​A sample survey of 36 randomly selected households shows that the median household income of the...
​​​​​​A sample survey of 36 randomly selected households shows that the median household income of the eastern NC residents is $51,000, with a standard deviation of $1075. Test the (null) hypothesis that the actual (real) median household income is $52,752 at 0.05% confidence level.
A study of 50 randomly selected students from a particular college who did not go to...
A study of 50 randomly selected students from a particular college who did not go to the campus Learning Center for statistics tutoring had a mean overall grade (at the end of the semester) of 68.41% with a standard deviation of 9.60%. The same study also found that 48 randomly selected students who did take advantage of the tutoring at the campus Learning Center had a mean overall grade of 79.25% with a standard deviation of 8.71%. The standard deviation...
The table shows the daily sales (in $1000) of shopping mall for some randomly selected days...
The table shows the daily sales (in $1000) of shopping mall for some randomly selected days Sales 1.1-1.5 1.6-2.0 2.1-2.5 2.6-3.0 3.1-3.5 3.6-4.0 4.1-4.5 Days 18 27 31 40 56 55 23 Use it to answer questions 13 and 14. 13. What is the approximate value for the modal daily sales? A. $3,129.41 B. $2,629.41 C. $3,079.41 D. $3,123.53 14. The approximate median daily sales is … A. $3,130.36 B. $2,680.36 C. $3,180.36 D. $2,664
Let X denote the proportion of allotted time that a randomly selected student spends working on...
Let X denote the proportion of allotted time that a randomly selected student spends working on a certain aptitude test. Suppose the pdf of X is f(x) = { (θ+1)x^θ, for 0≤x≤1 0, otherwise } where −1<θ. A random sample of ten students yields data x1=.92, x2=.79, x3=.90, x4=.65, x5=.86, x6=.47, x7=.73, x8=.97, x9=.94, x10=.77. (a) Use the method of moments to obtain an estimator of θ, and then compute the estimate for this data. (b) Obtain the maximum likelihood...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT