In: Math
Data Collection Techniques are the sampling method(s) used for a survey depends on several factors including:
Sampling method refers to the way that observations are selected from a population to be in the sample for a sample survey.
The reason for conducting a sample survey is to estimate the value of some attribute of a population.
Consider this example. A public opinion pollster wants to know the percentage of voters that favor a flat-rate income tax. The actual percentage of all the voters is a population parameter. The estimate of that percentage, based on sample data, is a sample statistic.
The quality of a sample statistic (i.e., accuracy, precision, representativeness) is strongly affected by the way that sample observations are chosen; that is., by the sampling method.
As a group, sampling methods fall into one of two categories.
1. Probability samples: With probability sampling methods, each population element has a known (non-zero) chance of being chosen for the sample.
The main types of probability sampling methods are simple random sampling, stratified sampling, cluster sampling, multistage sampling, and systematic random sampling. The key benefit of probability sampling methods is that they guarantee that the sample chosen is representative of the population. This ensures that the statistical conclusions will be valid.
There are many ways to obtain a simple random sample. One way would be the lottery method. Each of the N population members is assigned a unique number. The numbers are placed in a bowl and thoroughly mixed. Then, a blind-folded researcher selects n numbers. Population members having the selected numbers are included in the sample.
Pros of Simple Random Sampling
Cons of Simple Random Sampling
One of the most obvious limitations of simple random sampling method is its need of a complete list of all the members of the population. Please keep in mind that the list of the population must be complete and up-to-date. This list is usually not available for large populations. In cases as such, it is wiser to use other sampling technique.
As a example, suppose we conduct a national survey. We might divide the population into groups or strata, based on geography - north, east, south, and west. Then, within each stratum, we might randomly select survey respondents.
Pros of Stratified Sampling
The aim of the stratified random sample is to reduce the potential for human bias in the selection of cases to be included in the sample. As a result, the stratified random sample provides us with a sample that is highly representative of the population being studied, assuming that there is limited missing data. Since the units selected for inclusion in the sample are chosen using probabilistic methods, stratified random sampling allows us to make generalizations (i.e. statistical inferences) from the sample to the population. This is a major advantage because such generalizations are more likely to be considered to have external validity.
Cons of Stratified Sampling
Stratified sampling is not useful when the population cannot be exhaustively partitioned into disjoint subgroups. It would be misapplication of the technique to make subgroups sample sizes proportional to the amount of data available from the subgroups, rather than scaling sample sizes to subgroup sizes (or to their variances, if known to vary significantly e.g. by means of an F test). Date representing each subgroup is taken to be of equal importance if suspected variation among them warrants stratified sampling. If, on the other hand, the very variances vary so much, among subgroups that the data need to be stratified by variance, there is no way to make the subgroup sample sizes proportional (at the same time) to the subgroups sizes with in the total population.
Note the difference between cluster sampling and stratified sampling. With stratified sampling, the sample includes elements from each stratum. With cluster sampling, in contrast, the sample includes elements only from sampled clusters.
For example, in Stage 1, we might use cluster sampling to choose clusters from a population. Then, in Stage 2, we might use simple random sampling to select a subset of elements from each chosen cluster for the final sample.
Pros of Cluster Sampling or Multi-Stage Sampling
Cons of Cluster Sampling
This method is different from simple random sampling since every possible sample of n elements is not equally likely.
Pros of Systematic Sampling
Cons of Systematic Sampling
The process of selection can interact with a hidden periodic trait within the population. If the sampling technique coincides with the periodicity of the trait, the sampling technique will no longer be random and representativeness of the sample is compromised.
2. Non-probability samples : With non-probability sampling methods, we do not know the probability that each population element will be chosen, and/or we cannot be sure that each population element has a non-zero chance of being chosen.
Two of the main types of non-probability sampling methods are voluntary samples and convenience samples.
With proportional quota sampling, the aim is to end up with a sample where the strata (groups) being studied (e.g. males vs. females students) are proportional to the population being studied. If we were to examine the differences in male and female students.
Pros of Quota Sampling
Quota sampling is particularly useful when you are unable to obtain a probability sample, but you are still trying to create a sample that is as representative as possible of the population being studied. In this respect, it is the non-probability based equivalent of the stratified random sample. Unlike probability sampling techniques, especially stratified random sampling, quota sampling is much quicker and easier to carry out because it does not require a sampling frame and the strict use of random sampling techniques (i.e. probability sampling techniques). This makes quota sampling popular in undergraduate and master’s level dissertations where there is a need to divide the population being studied into strata (groups). The quota sample improves the representations of particular strata (groups) within the population, as well as ensuring that these strata are not over-represented. For example, it would ensure that we have sufficient male students taking part in the research (60% of our sample size of 100; hence, 60 male students). It would also make sure we did not have more than 60 male students, which would result in an over-representation of male students in our research. The use of quota sample, which leads to stratification of a sample (e.g. male and female students), allows us to more easily compare these groups (strata)
Cons of Cluster Sampling
In quota sampling, the sample has not been chosen using random selection, which makes it impossible to determine the possible sampling error. Indeed, it is possible that the selection of units to be included in the sample will be based on ease of access and cost considerations, resulting in sampling bias. It also means that it is not possible to make generalizations (i.e. statistical inferences) from the sample to the population. This can lead to problems of external validity. Also, with quota sampling is must be possible to clearly divide the population into strata; that is, each unit from the population must only belong to one stratum. In our example, this would be fairly simple, since our strata are male and female students. Clearly, a student could only be classified as either male or female. No student could fit into both categories (ignoring transgender issues). Furthermore, imagine extending the sampling requirements such that we were also interested in how career goals changed depending on whether a student was an undergraduate or postgraduate. Since the strata must be mutually exclusive, this means that we would need to sample four strata from the population: undergraduate males, undergraduate females, postgraduate males and postgraduate females. This will increase overall sample size required for the research, which can increase costs and time to carry out the research.
Purposive sampling, also known as judgmental, selective or subjective sampling, reflects a group of sampling techniques that rely on the judgement of the researcher when it comes to selecting the units (e.g. people, case/organisations, events, pieces of data) that are to be studied. These purposive sampling techniques include maximum variation sampling, homogeneous sampling and typical case sampling; extreme (deviant) case sampling, total population sampling ad expert sampling.
Pros of Purposive Sampling
Cons of Purposive Sampling
Self-selection sampling is appropriate when we want to allow units or cases, whether individuals or organisations to choose to take part in research on their own accord. The key component is that research subjects volunteer to take part in the research rather than being approached by the researcher directly.
Pros of Self-selection Sampling
Cons of Self-selection Sampling Since the potential research subjects (or organisations) volunteer to take part in the survey:
In sociology and statistics research, snowball sampling or chain sampling, chain-referral sampling is a non-probability sampling technique where existing study subjects recruit future subjects from among their acquaintances. Thus the sample group appears to grow like a rolling snowball. As the sample builds up, enough data is gathered to be useful for research. This sampling technique is often used in hidden populations which are difficult for researchers to access.
Pros of Snowball Sampling
Cons of Snowball Sampling
Since snowball sampling does not select units for inclusion in the sample based on random selection, unlike probability sampling technique, it is impossible to determine the possible sampling error and make generalizations (i.e. statistical inferences) from the sample to the population. As such, snowball samples should not be considered to be representative of the population being studied.
Non-probability sampling methods offer two potential advantages - convenience and cost. The main disadvantage is that non-probability sampling methods do not allow you to estimate the extent to which sample statistics are likely to differ from population parameters. Only probability sampling methods permit that kind of analysis.