Question

In: Statistics and Probability

A description of workers. Each year, the Census Bureau selects a different and random sample of...

A description of workers. Each year, the Census Bureau selects a different and random sample of more than 3 million of households to be interviewed in American Community Survey (ACS). The dataset that have been assigned to you contains infor mation of a small random sample of workers of a particular state interviewed in the ACS 2016. I

Notes In your answers use up to one decimal place when the number is not an integer If the number is close to zero (i.e 0.0006) use up to four decimal places e Show your work to get full credit. When it corresponds, indicate what statistic unction of Excel you used to compute the estimate. Use the dataset that was assigned. If you use a different dataset, your homework will not be graded

(a) Describe the structure of the data set. In your answer include the population egorical (nominal/ordinal) or quantitative (discrete ,continuous), type of data (1.e., cross of interest, sample size, number of variables, type of variables (i.e. cat sectional, time series, or longitudinal data). (10 points)

(b) If each year the Census Bureau inter viewed the same sample of households, what would be the type of dataset generated by the ACS in this case? (3 pts). Explain.

(c) Use the earnings (WAGP) of the first ten workers to calculate the Σ and (-) (i.e. the sum of the squared deviations) of the workers earnings. Use to compute the sample mean and the sample standard deviation of these sums these workers' earnings. (10 pts)

Solutions

Expert Solution

(NOTE: In the question it is mentioned that the dataset was assigned to the candidate, but here that assigned dataset is not attached, so here tried to give the general solution, so that the functions/methods in the solution can be applied on the available data. Where-ever necessary, the R program functions are also given to get solution)

(a) The data about data is called metadata. It includes the type of all variables available in the dataset (like qualitative or quantitative, discrete or continuous, time-series), total number of variables in dataset, etc. So one is prefering R Program for analysis, there is one function called str(your_dataset), it will give full information of metadata.

The dplyr package in R has funciton called glimpse(your_dataset), which also gives information about dataset columns.

(b) If each year the ACS interview the same set of househlds, the all year samples are said to be dependent samples. So the effective sample size remains same over all the years. If one is using the t-test to compare the mean over two years, then paired sample t-test is to be used, intead of independent sample test. So accordingly assumptions of test(s) get changed.

(c) To get the sum of squared deviations of WAGP column of first 10 workers, in excel there is direct function as below

=DEVSQ(<select first 10 elements in WAGP column and hit enter by closing the braces>)

Or if one wnats to calculate manually, then

i) calculate mean of first 10 WAGP values by =AVERAGE(<select first 10 elements in WAGP column and hit enter by closing the braces>) say it AVG

ii) The create new column by subtracting AVG value from each of 10 values of WAGP and

iii) square each of them then add all 10 values by = SUM(<data rabge>)

Sample standard deviation of earnings can be calculated by using the function =STDEV(<data range>) and sample mean is by =AVERAGE(<data range i.e. 10 values>).


Related Solutions

Wage information collected by the Census Bureau shows the numberof workers being paid at or...
Wage information collected by the Census Bureau shows the number of workers being paid at or below minimum wage between 1979 to 2017. The survey runs monthly with sample sizes of 60,000 men and 60,000 women participants.The wages of men paid at or below minimum wage showed a mean of 1393 workers and a std. deviation of 589 workers.The wages of women paid at or below minimum wage showed a mean of 2511 workers and a std. deviation of 1107...
According to the U.S. Census Bureau, 20% of the workers in Atlanta use public transportation. Suppose...
According to the U.S. Census Bureau, 20% of the workers in Atlanta use public transportation. Suppose 25 Atlanta workers are randomly selected. (Hint: use sampling distribution) (a) What is the standard deviation of the sample proportion of the selected workers who use public transportation? (5 points) (a) What is the probability that the proportion of the selected workers who use public transportation is less than 32%? (5 points) (b) What is the probability that the proportion of the selected workers...
Provide a description of the sample space for each of the following random experiments. Identify if...
Provide a description of the sample space for each of the following random experiments. Identify if the sample space is discrete or continuous in each case. a) Each of three mechanical components of a machine is tested and classified as being faulty or functional. b) Shuffling a standard deck of cards and revealing the top card. c) Shuffling a standard deck of cards and revealing the top two cards.
Provide a description of the sample space for each of the following random experiments. Identify if...
Provide a description of the sample space for each of the following random experiments. Identify if the sample space is discrete or continuous in each case. Note that there can be more than one acceptable interpretation of each experiment. State any assumptions you may have made. a) Shuffling a standard deck of cards and revealing the top card. b) Shuffling a standard deck of cards and revealing the top and bottom cards.
Question 5 (1 point) Saved A U.S. census bureau pollster noted that in 447 random households...
Question 5 (1 point) Saved A U.S. census bureau pollster noted that in 447 random households surveyed, 290 occupants owned their own home. What is the 99% confidence interval estimate of the proportion of American households who own their own home? Question 5 options: 1) ( 0.62619 , 0.67135 ) 2) ( 0.59061 , 0.70693 ) 3) ( 0.59625 , 0.70129 ) 4) ( -0.59061 , 0.70693 ) 5) ( 0.29307 , 0.40939 ) Question 6 (1 point) You work...
US Census Bureau tracks the median price for new home sales by month of year. The...
US Census Bureau tracks the median price for new home sales by month of year. The mediane price for April for the years 2001 to 2011 follow. Years Years Price ($1000) 2001 175.2 2002 187.1 2003 189.5 2004 222.3 2005 236.3 2006 257.0 2007 242.5 2008 246.4 2009 219.2 2010 208.3 2011 224.7 a.         Compute a 2-week moving average for the above time series. b.         Compute the mean square error (MSE) and mean Absolute deviation (MAD) for the      2-...
A student is interested in the sleep quality of students. That student selects a random sample...
A student is interested in the sleep quality of students. That student selects a random sample of 21 students (age 19-24 years) from each four undergraduate years (Freshman, Sophomore, Junior and Senior), and applies Pittsburgh Sleep Quality Index (PSQI) and obtains their responses. PSQI includes 19 self-reported items and is designed to evaluate overall sleep quality (Data are presented in Table 1 below). The student is interested in determining whether there is any evidence of a difference in sleep quality...
As part of an annual review of its accounts, a discount brokerage selects a random sample...
As part of an annual review of its accounts, a discount brokerage selects a random sample of 27 customers. Their accounts are reviewed for total account valuation, which showed a mean of $39,900, with a sample standard deviation of $8,300. (Use t Distribution Table.) What is a 99% confidence interval for the mean account valuation of the population of customers? (Round your answers to the nearest dollar amount.)
As part of an annual review of its accounts, a discount brokerage selects a random sample...
As part of an annual review of its accounts, a discount brokerage selects a random sample of 28 customers. Their accounts are reviewed for total account valuation, which showed a mean of $38,300, with a sample standard deviation of $8,400. (Use t Distribution Table.) What is a 98% confidence interval for the mean account valuation of the population of customers? (Round your answers to the nearest dollar amount.) 98% confidence interval for the mean account valuation is between $  and $...
. A researcher selects a random sample of 10 persons from a population of truck drivers...
. A researcher selects a random sample of 10 persons from a population of truck drivers and gives them a driver’s aptitude test. Their scores are 22,3,14,8,11,5,18,13,12, and 12.      (a) Find the estimated standard error of the mean.      (b) Find the 95% confidence interval for the population mean.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT