Question

In: Statistics and Probability

Where and how do you select a sample? I am stuck on the concept of what...

Where and how do you select a sample? I am stuck on the concept of what it means.

Solutions

Expert Solution

Definition

A sample refers to a smaller, manageable version of a larger group. It is a subset containing the characteristics of a larger population. Samples are used in statistical testing when population sizes are too large for the test to include all possible members or observations. A sample should represent the population as a whole and not reflect any bias toward a specific attribute.

Suppose you want to know what will be the proportion of votes Donald Trump in election.

Now you can't ask each voter as population size is very large

so

you select a smaller number of voters (a sample ) by some method which will be discussed below

and based on that we approximate our parameter(population parameter) by statistics (sample proportion)

parameter could be anything like population mean, population variance etc

statistics can be sample mean, sample variance etc .

Types of Sampling :-

Random Sampling

Systematic Sampling

Stratified Sampling

Convenience Sampling

Simple Random Sampling

Simple random sampling is ideal if every entity in the population is identical. If the researchers don’t care whether their sample subjects are all male or all female or a combination of both sexes in some form, the simple random sampling may be a good selection technique.

Let's say there were 200,000 test-takers who sat for the CFA exam in 2016, out of which 40% were women and 60% were men. The random sample drawn from the population should, therefore, have 400 women and 600 men for a total of 1,000 test-takers.

But what about cases where knowing the ratio of men to women that passed a test after studying for less than 40 hours is important? Here, a stratified random sample would be preferable to a simple random sample.

Stratified Random Sampling

This type of sampling, also referred to as proportional random sampling or quota random sampling, divides the overall population into smaller groups. These are known as strata. People within the strata share similar characteristics.

What if age was an important factor that researchers would like to include in their data? Using the stratified random sampling technique, they could create layers or strata for each age group. The selection from each strata would have to be random so that everyone in the bracket has a likely chance of being included in the sample. For example, two participants, Alex and David, are 22 and 24 years old, respectively. The sample selection cannot pick one over the other based on some preferential mechanism. They both should have an equal chance of being selected from their age group. The strata could look something like this:

Strata (Age) Number of People in Population Number To Be Included in Sample
20-24 30,000 150
25-29 70,000 350
30-34 40,000 200
35-39 30,000 150
40-44 20,000 100
>44 10,000 50
Total 200,000 1,000

From the table, the population has been divided into age groups. For example, 30,000 people within the age range of 20 to 24 years old took the CFA exam in 2016. Using this same proportion, the sample group will have (30,000 ÷ 200,000) x 1,000 = 150 test-takers that fall within this group. Alex or David—or both or neither—may be included among the 150 random exam participants of the sample.

There are many more strata that could be compiled when deciding on a sample size. Some researchers might populate the job functions, countries, marital status, etc. of the test=takers when deciding how to create the sample.

Examples of Samples

As of 2017, the population of the world was 7.5 billion, out of which 49.6% were female and 50.4% were male. The total number of people in any given country can also be a population size. The total number of students in a city can be taken as a population, and the total number of dogs in a city is also a population size. Samples can be taken from these populations for research purposes.

Following our CFA exam example, the researchers could take a sample of 1,000 CFA participants from the total 200,000 test-takers—the population—and run the required data on this number. The mean of this sample would be taken to estimate the average of CFA exam takers that passed even though they only studied for less than 40 hours.

The sample group taken should not be biased. This means that if the sample mean of the 1,000 CFA exam participants is 50, the population mean of the 200,000 test-takers should also be approximately 50.


Related Solutions

I am stuck on this problem and I am not sure what the solution is. In...
I am stuck on this problem and I am not sure what the solution is. In C Write item.h and item.c. In item.h, typedef a struct (of type t_item) which contains the following information: t_item: char name[MAX_ITEM_NAME_STRING]; char description[MAX_ITEM_DESCRIPTION_STRING]; Make sure that MAX_ITEM_NAME_STRING and MAX_ITEM_DESCRIPTION_STRING are defined with suitable sizes in your item.h. Typical values are, 25 and 80, respectively. Add the following interface definition to item.h: int item_load_items(t_item items[], int max_items, char *filename); Returns the number of objects loaded...
Java ArrayList Parking Ticket Simulator, Hello I am stuck on this problem where I am asked...
Java ArrayList Parking Ticket Simulator, Hello I am stuck on this problem where I am asked to calculate the sum of all fines in the policeOfficer class from the arraylist i created. I modified the issueParking ticket method which i bolded at the very end to add each issued Parking ticket to the arrayList, i think thats the right way? if not please let me know. What I dont understand how to do is access the fineAmountInCAD from the arrayList...
Hi! I am in an intro level Finance course and I am stuck on this problem....
Hi! I am in an intro level Finance course and I am stuck on this problem. Any help would be greatly appreciated. I am deciding on opening a restaurant. I was able to scrape together some capital from friends and family, but I must pay them back in 4 years at 12% per annum. I figure that it will cost me $165,000 to start up with rent, deposits, equipment, salaries, chicken, basil, rice, etc. for the first year, but I...
I almost finished my pre-lab... but I am stuck! What are the energies of (-)-menthone and...
I almost finished my pre-lab... but I am stuck! What are the energies of (-)-menthone and (+)-isomenthone? How can I find this (how to do it)? Thank you for the help!
I am stuck on approaching the answer to this question, and most of the answer that...
I am stuck on approaching the answer to this question, and most of the answer that I have found online and on Chegg are do not make any sense regarding this question. What would be a good interpretation of what this question is asking regarding the GDP? "Think about the increases since 2001 in spending for the Department of Homeland Security and the wars in Afghanistan and Iraq. These increases represent government expenditures that have increased GDP. Explain whether you...
Kind of stuck on this one with the JE. Need to know if I am on...
Kind of stuck on this one with the JE. Need to know if I am on the right track with this problem. Please help Thomas sells products that carry a 6-month manufacturer’s warranty. Customers have the opportunity at the time of purchase to also buy a 3 year extended warranty for an additional charge of $100. Thomas sold 1,000 units this year for $2,000 each. For this year’s sales, Thomas estimates the 6-month warranty costs to be 10% of sales....
I am stuck in Austin with a flat tire, and I need to get to my...
I am stuck in Austin with a flat tire, and I need to get to my class in Houston within 2 hours. The drive is 180 miles long, but I want to be careful of the Austin cops. I do not usually get caught speeding unless I am seen accelerating too fast, so I do not want to accelerate at a rate of more than 120 mi?/?h2. Just to be careful, I?m going to take exactly 2 hours to make...
I am stuck on these: Find or Calculate the derivative of the function. y = (x0.3...
I am stuck on these: Find or Calculate the derivative of the function. y = (x0.3 − 3x − 6)(x−1 + x−2) f(x) = (6.4x − 6)−2 + (4.3x − 1)−2 f(x) = 5x + x2 h(x) = 4x2 − x r(x) = ln(x + 1) + 3x9ex
I am stuck on the follow up question. This is the original questions. Discuss how the...
I am stuck on the follow up question. This is the original questions. Discuss how the bond market reacts when the Federal Reserve increases short term interest rates. How do short-term versus long-term bond prices react? How do Treasury bonds versus corporate bonds behave? Follow up: Rates are low within the Market. When you look to invest $10,000 (can't use it to pay bills), where do you put your money and why?
What do I do for this? I am confused in what to do. Choose a developmental...
What do I do for this? I am confused in what to do. Choose a developmental domain covered in Module 2 (motor, self-care, cognitive). Provide a rationale for why development in this area is important for overall child development. Provide three strategies you would recommend caregivers implement at home to advance their childrens' skills in the chosen developmental domain.I chose the Self-Care domain.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT