Question

In: Statistics and Probability

The ozone dataset gives a sample of ozone measurements (in partial pressures) taken over the South...

The ozone dataset gives a sample of ozone measurements (in partial pressures) taken over the South Pole on September 18, 1997 at altitudes above 10km.

mPa:

1.828, 2.652, 3.307, 3.855, 4.021, 4.173, 4.316, 4.951, 4.787, 4.554, 4.333, 4.039, 4.48, 4.739, 4.791, 5.213, 5.75, 5.79, 5.656, 5.464, 5.092, 3.135, 2.754, 3.14, 5.213, 5.831, 5.736, 5.333, 4.797, 6.704, 6.493, 5.746, 6.31, 6.53, 6.63, 6.071, 5.706, 4.951, 4.392, 4.619, 5.029, 5.302, 5.151, 3.474, 3.285, 3.232, 3.4, 3.503, 3.649, 3.828, 4.235, 4.781, 5.096, 5.262, 5.411, 5.439, 5.08, 4.719, 4.519

1. Compute the five-number summary and sketch the boxplot. Identify any outliers.

2. Compute the mean and standard deviation of the sample.

3. Construct the probability plot of the data. A hypothesis test could be used to compare the mean ozone level on September 18, 1997 to a specified baseline level.

4. Are the assumptions for an hypothesis test of the mean reasonably satisfied?

5. Test to see if the mean ozone amount on September 18 was below 5 mPa, at the 5% level of significance.

Year

106 × km2

1979

2.23

1980

1.88

1981

1.70

1982

3.77

1983

6.24

1984

8.66

1985

12.57

1986

9.58

1987

18.18

1988

8.75

1989

17.75

1990

17.86

1991

18.13

1992

21.28

1993

22.81

1994

22.82

6. Construct a line graph of the data in the table. Note any trends that you see.

7. Does it make sense to apply inferential methods (of the type we have studied) to the mean size of the ozone hole over time? Explain.

In addition to plotting the ozone concentration versus time, create a linear regression model for it and interpret the slope and comment on how well the model fits the data using the coefficient of variation.

(IF POSSIBLE INCLUDE R CODES WITH THE ANSWERS FOR EACH QUESTION)

Solutions

Expert Solution

Solution-1:

R code is

ozone <- c(1.828, 2.652, 3.307, 3.855, 4.021, 4.173, 4.316, 4.951, 4.787, 4.554, 4.333, 4.039, 4.48, 4.739, 4.791, 5.213, 5.75, 5.79, 5.656, 5.464, 5.092, 3.135, 2.754, 3.14, 5.213, 5.831, 5.736, 5.333, 4.797, 6.704, 6.493, 5.746, 6.31, 6.53, 6.63, 6.071, 5.706, 4.951, 4.392, 4.619, 5.029, 5.302, 5.151, 3.474, 3.285, 3.232, 3.4, 3.503, 3.649, 3.828, 4.235, 4.781, 5.096, 5.262, 5.411, 5.439, 5.08, 4.719, 4.519)

length(ozone)
print(ozone)
par(mfrow = c(1, 2))
boxplot(ozone,main="boxplot for ozone sample")
abline(h = min(ozone), col = "Blue")
abline(h = max(ozone), col = "Yellow")
abline(h = median(ozone), col = "Green")
abline(h = quantile(ozone, c(0.25, 0.75)), col = "Red")
fivenum(ozone)

Boxplot:

output:

minimum=1.828

Q1=4.030

Q2=4.791

Q3=5.425

maximum=6.704

outlier seen from boxplot

Solution-2:

Rcode:

mean(ozone)
sd(ozone)

Output:

mean=4.716559

standard deviation=1.073852

Solution-3:

qqnorm(ozone)
qqline(ozone)

From qqplot ozone sample follows normal distribution

4. Are the assumptions for an hypothesis test of the mean reasonably satisfied?

Yes met since sample follows normal distribution

sample is random sample

and independent


Related Solutions

A sample of blood pressure measurements is taken for a group of​ adults, and those values​...
A sample of blood pressure measurements is taken for a group of​ adults, and those values​ (mm Hg) are listed below. The values are matched so that 10 subjects each have a systolic and diastolic measurement. Find the coefficient of variation for each of the two​ samples; then compare the variation. Systolic 116 128 156 95 155 124 118 135 124 121 Diastolic 82 76 73 51 91 87 57 62 71 82 1) The coefficient of variation for the...
A sample of blood pressure measurements is taken for a group of​ adults, and those values​...
A sample of blood pressure measurements is taken for a group of​ adults, and those values​ (mm Hg) are listed below. The values are matched so that 10 subjects each have a systolic and diastolic measurement. Find the coefficient of variation for each of the two​ samples; then compare the variation. Systolic 120120 130130 157157 9494 154154 121121 114114 134134 126126 121121 Diastolic 8282 7676 7474 5353 9292 8989 5959 6262 7171 8383 The coefficient of variation for the systolic...
A sample of blood pressure measurements is taken for a group of adults, and those values...
A sample of blood pressure measurements is taken for a group of adults, and those values (mm Hg) are listed below. The values are matched so that 10 subjects each have a systolic and diastolic measurement. Find the coefficient of variation for each of the two samples; then compare the variation. Systolic 116 127 157 97 158 124 115 138 125 122 Diastolic 81 78 76 50 89 90 59 66 71 81 The coefficient of variation for the systolic...
A sample of 10 employee’s height is taken to design the height of doorway. The measurements...
A sample of 10 employee’s height is taken to design the height of doorway. The measurements (in cm) are in the following table 167.87 153.24 150.39 173.38 177.91 155.88 171.06 155.57 171.43 178.62 What should be the height of the doorway if you want that just a 0.1% of the employees have troubles passing through it? You think that the initial 10-employee sample is too small and you want to ensure a confidence level of 97% with an error in...
A sample of blood pressure measurements is taken for a group of​ adults, and those values​...
A sample of blood pressure measurements is taken for a group of​ adults, and those values​ (mm Hg) are listed below. The values are matched so that 1010 subjects each have a systolic and diastolic measurement. Find the coefficient of variation for each of the two​ samples; then compare the variation. Systolic 118118 128128 158158 9494 158158 122122 118118 138138 126126 122122 Open in StatCrunch + Copy to Clipboard + Open in Excel + Diastolic 8181 7676 7676 5454 8989...
In a calibration test, 50 measurements were taken of a laboratory gas sample that is known...
In a calibration test, 50 measurements were taken of a laboratory gas sample that is known to have a CO concentration of 70 parts per million (ppm). A measurement is considered to be satisfactory if it is within 5 ppm of the true concentration. Of the 50 measurements, 37 were satisfactory. [5 + 10 points] (a) Suppose it is desirable to estimate the population proportion of satisfactory measurement within ±0.03, how big a sample of measurements should be? (b) Without...
Measurements of length (cm) were taken for a sample of fish. The gender (male or female)...
Measurements of length (cm) were taken for a sample of fish. The gender (male or female) of each fish was also recorded. The most appropriate technique to explore whether there is a difference in the mean weights of fish between males and females would be: Select one: a. 1 sample z-test b. testing for two proportions c. 2 sample t-test. d. 1 sample t-test
A sample of blood pressure measurements is taken from a data set and those values​ (mm...
A sample of blood pressure measurements is taken from a data set and those values​ (mm Hg) are listed below. The values are matched so that subjects each have systolic and diastolic measurements. Find the mean and median for each of the two samples and then compare the two sets of results. Are the measures of center the best statistics to use with these​ data? What else might be​ better? Systolic   Diastolic 154 53 118 51 149 77 120 87...
A sample of blood pressure measurements is taken from a data set and those values​ (mm...
A sample of blood pressure measurements is taken from a data set and those values​ (mm Hg) are listed below. The values are matched so that subjects each have systolic and diastolic measurements. Find the mean and median for each of the two samples and then compare the two sets of results. Are the measures of center the best statistics to use with these​ data? What else might be​ better? ​Systolic: 150 126 95 140 154 159 145 101 152...
Over a 18 day period, daily measurements of silver yield from a mine us taken. The...
Over a 18 day period, daily measurements of silver yield from a mine us taken. The company has an original goal of the 60th percentile being more than 50 tons. From the 18 days, 8 days yielded less than 50 tons. No day yielded exactly 50 tons. Test to see if the company’s goal is upheld. (a) State the hypothesis for testing this claim. (b) Explain and show what test statistic you're using and how to get the p-value (c)...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT