In: Chemistry
Two sets of measurements of ethanol concentration in a sample of vodka were made using the same instrument, but on two different days.
On the first day, a standard deviation of s1 = 9 ppm was found,
and on the next day s2 = 2 ppm.
Both datasets comprised 6 measurements.
Can the two datasets be combined, or is there is a significant difference at 95% confidence between the datasets, and should we discard one of them?
Hint: Begin by defining the null hypothesis, H0: s12 = s22, and the alternate hypothesis, HA: s12 ≠ s22.
Using the F-test to compare two variances or standard deviations.
When using the F-test, you again require a hypothesis, and have to compare standard deviations. That is, you will test the null hypothesis H0: σ12 = σ22 against an appropriate alternate hypothesis.
Initially starting by defining the null hypothesis, H0: σ12 = σ22,
whereas alternate hypothesis, HA: σ12 ≠ σ22.
"≠" indicating that this is a 2-tailed test, since interestingly in both cases: σ12 >σ22 and σ12 < σ22.
Day |
Standard Deviation (ppm) |
|
---|---|---|
1 |
9 |
|
2 |
2 |
A dataset is more reliable if its standard deviation is lower than that of the other dataset. So if we test the null hypothesis H0: σ22 = σ12, against the alternate hypothesis HA: σ22 > σ12.
Since s2 > s1, Fcalc = s22/s12 = 22/92 = 4.93*10-2. The confidence level being 95% we can discard the Day 1 dataset and take Day 2 dataset.