In: Statistics and Probability
how does sampling variation relate to concluding Ho or H1 when conducting a hypothesis testing?
Null hypothesis testing is a formal approach to deciding between two interpretations of a statistical relationship in a sample. One interpretation is called the null hypothesis (often symbolized H0 and read as “H-naught”). This is the idea that there is no relationship in the population and that the relationship in the sample reflects only sampling error. Informally, the null hypothesis is that the sample relationship “occurred by chance.” The other interpretation is called the alternative hypothesis (often symbolized as H1). This is the idea that there is a relationship in the population and that the relationship in the sample reflects this relationship in the population.
Again, every statistical relationship in a sample can be interpreted in either of these two ways: It might have occurred by chance, or it might reflect a relationship in the population. So researchers need a way to decide between them. Although there are many specific null hypothesis testing techniques, they are all based on the same general logic. The steps are as follows:
Assume for the moment that the null hypothesis is true. There is
no relationship between the variables in the population.
Determine how likely the sample relationship would be if the null
hypothesis were true.
If the sample relationship would be extremely unlikely, then reject
the null hypothesis in favour of the alternative hypothesis. If it
would not be extremely unlikely, then retain the null
hypothesis.
A crucial step in null hypothesis testing is finding the likelihood
of the sample result if the null hypothesis were true. This
probability is called the p value. A low p value means that the
sample result would be unlikely if the null hypothesis were true
and leads to the rejection of the null hypothesis. A high p value
means that the sample result would be likely if the null hypothesis
were true and leads to the retention of the null hypothesis. But
how low must the p value be before the sample result is considered
unlikely enough to reject the null hypothesis? In null hypothesis
testing, this criterion is called α (alpha) and is almost always
set to .05. If there is less than a 5% chance of a result as
extreme as the sample result if the null hypothesis were true, then
the null hypothesis is rejected. When this happens, the result is
said to be statistically significant. If there is greater than a 5%
chance of a result as extreme as the sample result when the null
hypothesis is true, then the null hypothesis is retained. This does
not necessarily mean that the researcher accepts the null
hypothesis as true—only that there is not currently enough evidence
to conclude that it is true. Researchers often use the expression
“fail to reject the null hypothesis” rather than “retain the null
hypothesis,” but they never use the expression “accept the null
hypothesis.”