Sign In
Sampling is the process of selecting a sample from a population of objects or individuals. The sample can be then examined to make conclusions about the entire population.
There are many different sampling methods, such as convenience sampling, self-selected sampling, systematic sampling, and random sampling. For example, suppose that a sample of the students in a classroom needs to be chosen to study blood sugar levels. The following applet visualizes some of these methods.
A sample that accurately reflects the characteristics of the population is called a representative sample.
In a representative sample, if x parts of the population have a certain characteristic, approximately x parts of the sample will share the same characteristic. As an example, consider a sample of cats and dogs selected from all the cats and dogs at a pet shop.
The figure shows cats represent 60 % of the population and 60 % of the sample. Similarly, dogs represent 40 % of the population and 40 % of the sample. In this case, the sample is then representative. Note that in a representative sample, every subgroup of the population is represented.
A bias in a sample is an error in sampling that results in misrepresentation of members of a population. Bias occurs when members of a population that are representing certain characteristics are more likely to be selected in a sample than others.
| Definition | |
|---|---|
| Unbiased Sample | A sample that is representative of the population. Conclusions drawn from this sample can be generalized to the whole population. |
| Biased Sample | A sample that overrepresents or underrepresents a certain part of the population. The inferences drawn based on this sample may be invalid. |
The chosen sampling method may either introduce or minimize a bias in a sample. The following real-life scenarios present examples of biased samples.
| Biased Sample | Explanation |
|---|---|
| A city council asks residents whether there should be an off-leash area for dogs in a park. A hundred dog owners are surveyed at the park. | The only people asked are dog owners. This means that respondents are more likely to have a strong opinion about an off-leash area for their dogs. |
| To assess the experiences of customers who shop online, a company e-mails purchasers with a link to a survey. | Because this sample is self-selected, only those who are very satisfied or dissatisfied with the shopping experience are likely to respond. |
| Every sixth boxer at a boxing camp is asked to name their favorite brand of boxing gloves. | Not all boxers go to a boxing camp, as camps are usually sponsored by a brand and take place in a single city. Also, professional boxers often organize their own private camps with hired sparring partners. |
Sample-to-sample variability refers to the fact that different samples taken from the same population can yield different statistics, even with the same sampling method. For example, suppose the heights of students in a classroom are studied. Pick different samples in the following applet to see their different means.
In a simple random sample, each member of a population is equally likely to be selected as part of the sample. Consider an example where a researcher performs the following procedure. |c| There are $20$ people, each assigned a unique number, who are surveyed. Of those $20$ people, $4$ need to be randomly selected as a sample. The researcher writes each unique number on a paper and places them in a bag.The researcher then blindly selects $4$ papers out of the bag.
The researcher's way of choosing a sample means that each participant is equally likely to be selected as part of the sample. The following applet shows a generated example of simple random samples.
In a systematic sample, the members of a population are ordered randomly or in a random-like way. The sample size is predetermined and selected among the members in a specified interval. The intervals are determined by dividing the population size by the sample size. Consider an example. |c| Five customers are surveyed about the quality of a fragrance shampoo.
A sample of 5 people is selected from a population of 20 people using systematic sampling.
In a self-selected sample, the members of the sample are the people who are willing to participate. Since people participate voluntarily in these samples, the samples are also called voluntary response samples. Consider an example. |c| A survey about internet shopping is posted online. People volunteer to participate or simply ignore it. The following applet shows a certain number of people who volunteer to participate in the survey.
As a result, such a sample is not representative of the population because it underrepresents people with neutral opinions about the topic or who are not interested in it.
In a convenience sample, members of a population are selected to be in a sample based on convenience or their availability to the researchers. Consider an example. |c| A researcher wants to study all wolf subspecies in a forest. However, they only have the funds to study those that frequently visit a certain area. This is an example of convenience sampling because the researcher is selecting wolves who are conveniently available and in close proximity.
This type of sampling is often used when researchers have limited resources or are under certain time constraints.
In a stratified sampling, members from a population are divided into subgroups. These subgroups can be formed based on factors like age, education, or health status, among others. Members of the sample are then randomly selected from each subgroup. Consider an example. |c| A researcher wants to study the academic achievements of elementary students in a local school district. The researcher plans to randomly select students in each grade to be in the sample.
Stratified sampling can be conducted as follows.
For a cluster sample, a population is first divided into smaller groups with similar characteristics to the whole population called clusters. One or more clusters are randomly selected. All members in the selected clusters form the sample. A member of the population cannot be included in more than one cluster. Consider an example. A group of researchers conduct a study to examine the basic mathematical skills of all the eight-graders in a city. They consider each school in the city as a separate cluster. The researchers then select one cluster to collect data from. Cluster sampling can be conducted as follows.