Question

In: Statistics and Probability

Six data sets are presented, some of them are samples from a normal distribution, and some...

Six data sets are presented, some of them are samples from a normal distribution, and some of them are samples from populations that are not normally distributed. Identify the samples that are not from normally distributed populations.

L1: Drug concentration six hours after administration

L2: Reading scores on standardized test for elementary children

L3: The number of minutes clerical workers took to complete a certain worksheet

L4: The level of impurities in aluminum cans (in percent)

L5: The number of defective items produced during an hour

L6: The weight of trout in ounces

L1

L2

L3

L4

L5

L6

4.7

72

4.5

2.1

21

9.9

3.2

77

5.8

1.3

16

11.3

5.1

65

3.7

2.8

10

11.4

4.7

85

4.9

1.5

10

9

3.0

68

4.3

1.0

11

10.1

3.4

83

4.7

8.2

9

8.2

4.4

73

5.8

1.9

13

8.9

3.5

79

3.2

9.5

12

9.9

4.5

72

3.0

3.2

11

10.5

5.8

81

5.1

1.3

29

8.6

3.7

79

3.6

4.4

10

7.8

4.9

91

4.3

3.8

14

10.8

5.4

69

3.6

2.7

27

8.4

3.6

67

5.4

8.0

10

9.6

4.3

82

4.7

1.9

11

9.9

3.0

66

3.0

4.9

11

8.4

5.1

77

3.4

4.5

12

9.0

4.3

66

4.3

1.5

19

9.1

Look at your modified boxplots. Which boxplot(s) show(s) outliers in the data?

Drug concentration six hours after administration

Reading scores on standardized test for elementary children

The number of minutes clerical workers took to complete a certain worksheet

The level of impurities in aluminum cans (in percent)

The number of defective items produced in an hour

The weight of trout in ounces

Solutions

Expert Solution

To understand whether the sample is from normal population or not, we can use any noramlity tests like K-S test, Anderson Darling test, Shapiro-Wilk test, etc. Here we are using Shapiro-Wilk test. If the p-value is greater than 0.05, we consider the data to be normal.

Here, we can see that L1,L2,L3,L6 are normal, but L4 and L5 not normal. The boxplots of L4 and L5 show outliers.

Note: Attaching below test results and boxplots for each data.

> shapiro.test(L1)

        Shapiro-Wilk normality test

data:  L1
W = 0.9532, p-value = 0.4773

> shapiro.test(L2)

        Shapiro-Wilk normality test

data:  L2
W = 0.94656, p-value = 0.3736

> shapiro.test(L3)

        Shapiro-Wilk normality test

data:  L3
W = 0.94718, p-value = 0.3825

> shapiro.test(L4)

        Shapiro-Wilk normality test

data:  L4
W = 0.83787, p-value = 0.005484

> shapiro.test(L5)

        Shapiro-Wilk normality test

data:  L5
W = 0.76241, p-value = 0.0004688

> shapiro.test(L6)

        Shapiro-Wilk normality test

data:  L6
W = 0.95834, p-value = 0.5699

Related Solutions

Six data sets are presented, some of them are samples from a normal distribution, and some...
Six data sets are presented, some of them are samples from a normal distribution, and some of them are samples from populations that are not normally distributed. Identify the samples that are not from normally distributed populations. L1: Drug concentration six hours after administration L2: Reading scores on standardized test for elementary children L3: The number of minutes clerical workers took to complete a certain worksheet L4: The level of impurities in aluminum cans (in percent) L5: The number of...
Six data sets are presented, some of them are samples from a normal distribution, and some...
Six data sets are presented, some of them are samples from a normal distribution, and some of them are samples from populations that are not normally distributed. Identify the samples that are not from normally distributed populations. L1: Drug concentration six hours after administration L2: Reading scores on standardized test for elementary children L3: The number of minutes clerical workers took to complete a certain worksheet L4: The level of impurities in aluminum cans (in percent) L5: The number of...
2. The hardness of some cement samples can be modeled by a normal distribution with an...
2. The hardness of some cement samples can be modeled by a normal distribution with an average of 6,000 kg / cm2 and a standard deviation of 1000 kg / cm2. a) What is the probability that the hardness of the sample is less than 6.250 kg / cm2? b) What is the probability that the hardness of the sample is between 5,800 and 5,900 kg / cm2? c) Which value is exceeded by 90% of the hardness? d) Among...
What is Normal Distribution? Why is it so useful in processing data? Is Normal Distribution a...
What is Normal Distribution? Why is it so useful in processing data? Is Normal Distribution a naturally occurring phenomena or there are factors that allows for Normal Distribution? If so, list those factors. Can all data be processed as Normal? Explain why or why not. From the link below or otherwise list two examples each of a Normal distribution and "Not Normal Distribution"
The flow in a river can be modeled as a log-normal distribution. From the data, it...
The flow in a river can be modeled as a log-normal distribution. From the data, it was estimated that, the probability that the flow exceeds 1100 cfs is 50% and the probability that it exceeds 100 cfs is 90%. Let X denote the flow in cfs in the river. Flood conditions occur when flow is 5000 cfs or above. To compute the percentage of time flood conditions occur for this river, we have to find, P(X≥5000)=1-P(Z<a). What is the value...
The following data sets represent simple random samples from a population whose mean is 100. Complete...
The following data sets represent simple random samples from a population whose mean is 100. Complete parts ​(a) through ​(e) below. Full data set Data Set I 106 124 88 126 89 71 74 110 Data Set II 106 124 88 126 89 71 74 110 88 91 109 83 113 118 94 124 97 85 80 104 Data Set III 106 124 88 126 89 71 74 110 88 91 109 83 113 118 94 124 97 85 80...
1. Assume that your data comes from a Normal distribution. Further assume that the distribution has...
1. Assume that your data comes from a Normal distribution. Further assume that the distribution has pop-ulation mean μ equal to your sample mean ̄x and population standard deviationσequal to your samplestandard deviations. Draw the graph of the Normal distribution described, labeling the mean and the tickmarks at standard deviation units. 2. We were sampling from the distribution of rainfall in a particular month. Assuming this distribution is Normal, find the proportion of years in which rainfall in the selected...
a. Generate samples of size 25, 50, 100 from a normal distribution. Construct probability plots. Do...
a. Generate samples of size 25, 50, 100 from a normal distribution. Construct probability plots. Do this several times to get an idea of how probability plots behave when the underlying distribution is really normal. b. Repeat part (a) for a chi-square distribution with 10 df I am required to use the R statistical program and I do not understand how to use it to solve this problem. If you could please show me the R Stats needed to solve...
The two data sets in the table below are dependent random samples. The population of (...
The two data sets in the table below are dependent random samples. The population of ( x − y ) (x-y) differences is approximately normally distributed. A claim is made that the mean difference ( x − y ) (x-y) is less than -31.4. X 25 32 48 37 39 34 37 Y 73 64 66 80 78 67 84 For each part below, enter only a numeric value in the answer box. For example, do not type "z ="...
A random sample of n​ = 7 observations from a normal distribution resulted in the data...
A random sample of n​ = 7 observations from a normal distribution resulted in the data shown in the table. Compute a 90​% confidence interval for sigma squared. 91 99 106 95 84 69 84
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT