In: Statistics and Probability
2. Suppose that we want to test for the difference between two population means. Explain under what conditions we would use the t-test and under what conditions we would use the z-test. Explain the differences between the two tests.
Type all your answers. Try to write at least one page
Parametric t and z tests are used to compare the means of two samples. The calculation method differs according to the nature of the samples. A distinction is made between independent samples or paired samples. The t and z tests are known as parametric because the assumption is made that the samples are normally distributed.
Take a sample S1 comprising n1 observations, of mean µ1 and variance s1². Take a second sample S2, independent of S1 comprising n2 observations, of mean µ2 and variance s2². Let D be the assumed difference between the means (D is 0 when equality is assumed).
As for the z and t tests on a sample, we use:
Student's t Test :-
If we consider that the two samples have the same variance, the common variance is estimated by:
s² = [(n1-1)s1² + (n2-1)s2²] / (n1 + n2 - 2)
The test statistic is therefore given by:
t = (µ1 - µ2 -D) / (s √1/n1 + 1/n2)
The t statistic follows a Student distribution with n1 + n2 - 2 degrees of freedom.
If we consider that the variances are different, the statistic is given by:
t = (µ1 - µ2 -D) / (√s1²/n1 + s2²/n2)
z-Test
For the z-test, the variance s² of the population is presumed to be known. The user can enter this value or estimate it from the data (this is offered for teaching purposes only). The test statistic is given by:
z = (µ1 - µ2 -D) / (σ √1/n1 + 1/n2)
The z statistic follows a normal distribution.
Comparison of the means of two paired samples
If two samples are paired, they have to be of the same size. Where values are missing from certain observations, either the observation is removed from both samples or the missing values are estimated.
We study the mean of the calculated differences for the n observations. If d is the mean of the differences, s² the variance of the differences and D the supposed difference, the statistic of the t test is given by:
T= (d-D) ⁄ (s/√n)
The t statistic follows a Student distribution with n-1 degrees of freedom.
For the z test, the statistic is as follows where σ² is the variance
z= (d-D) ⁄ (σ/√n)
The z statistic follows a normal distribution.
Alternative hypotheses
Three types of test are possible depending on the alternative hypothesis chosen: