Question

In: Statistics and Probability

We are interested in estimating the proportion of graduates from Lancaster University who found a job...

We are interested in estimating the proportion of graduates from Lancaster University who found a job within one year of completing their undergraduate degree. Suppose we conduct a survey and find out that 354 of the 400 randomly sampled graduates found jobs. The number of students graduating that year was over 4000.

  1. (a) State the central limit theorem.

  2. (b) Why is the central limit theorem useful?

  3. (c) What is the population parameter of interest? What is the point estimate of this parameter?

  4. (d) What are the assumptions for constructing a confidence interval based on these data? Are they met?

  5. (e) Calculate a 95% confidence interval for the proportion of graduates who found a job within one year of completing their undergraduate degree. Interpret this within the context of the data.

  6. (f) Without doing any calculations, describe what would happen to the confidence interval if we decided to use a higher confidence level, e.g., 99%.

  7. (g) Without doing any calculations, describe what would happen to the confidence interval if we used a larger sample.

Solutions

Expert Solution

Solution

Part (a)

Central Limit Theorem

Let {X1, X2, …, Xn} be a sequence of n independent and identically distributed (i.i.d) random variables drawn from a distribution [i.e., {x1, x2, …, xn} is a random sample of size n] of expected value given by µ and finite variance given by σ2. Then, as n gets larger, the distribution of Z = {√n(Xbar − µ)/σ}, approximates the normal distribution with mean 0 and variance 1 (i.e., Standard Normal Distribution)

Or symbolically, Z = {√n(Xbar − µ)/σ} ~ N(0, 1) …………………………………………….....................................…………… (1a)

i.e., sample average from any distribution with mean µ and variance σ2, which is fairly symmetric, follows Normal Distribution with mean µ and variance σ2/n, if the sample size, n is large enough, say 30 or more......................... (1b)

In current scenario, distribution of sample proportion, phat can be approximated by Normal Distribution with mean = E(phat) and standard deviation = SE(phat)  Answer 1................................................................................. (1c)

Part (b)

Central Limit Theorem is useful because in many practical situations, the population distribution may not be known or the distribution may be complicated and difficult to handle analytically. In all such situations, CLT provides a very easy tool to handle the situation since Normal distribution is well researched and documented in terms of easy to handle probability tables. Answer 2

Part (c)

1. Population parameter of interest is the population proportion, i.e., contextually, the true proportion of graduates who found a job within one year of completing their undergraduate degree. Answer 3

2. Point estimate of this parameter is the sample proportion. Answer 4

Part (d)

Assumptions for constructing a confidence interval based on these data

CI in this case is based on Normality approximation, which requires that the sample size is large enough for both nphat and nphat(1 – phat) to be 10 or more. Answer 5

In the given situation, n = 400, phat = 354/400 = 0.885. So, conditions are met. Answer 6

Part (e)

100(1 - α) % Confidence Interval for the population proportion, p is: phat ± MoE, .....................................………………. (2)

where

MoE = Zα/2[√{phat (1 – phat)/n}] ……………………………………........................................................................………..(2a)

with

Zα/2 is the upper (α/2)% point of N(0, 1),

phat = sample proportion, and

n = sample size.

So, 95% confidence interval for the proportion of graduates who found a job within one year of completing their undergraduate degree is: [0.85, 0.92] Answer 7

Details of calculations

n

400

X

354

p' = phat

0.885

F = p'(1-p')/n

0.000254

sqrtF

0.015951

α

0.05

1 - (α/2)

0.975

Zα/2

1.959964

MoE

0.031264

LB

0.853736

UB

0.916264

Contextual Interpretation: There is only 2.5% chance that actual proportion of graduates who found a job within one year of completing their undergraduate degree could be less than 85.4% or more than 91.6%. Answer 8

Part (f)

When confidence level increases, vide (2) only MoE will change and vide (2a), that change also is effected only through the percentage point, Zα/2 which increases as confidence level increases.

Thus, the width of the CI will increase. Answer 9

Part (g)

As sample size increases, the SE would decrease since n is in the denominator.

Thus, the CI will narrow down. Answer 10

DONE


Related Solutions

We are interested in estimating the proportion of graduates at a mid-sized university who found a...
We are interested in estimating the proportion of graduates at a mid-sized university who found a job within one year of completing their undergraduate degree. We can do so by creating a 95% confidence interval for the true proportion p. Suppose we conduct a survey and find out that 340 of the 430 randomly sampled graduates found jobs within one year. Assume that the size of the population of graduates at this university is large enough so that all our...
1. We are interested in estimating the proportion of students at a university who smoke. Out...
1. We are interested in estimating the proportion of students at a university who smoke. Out of a random sample of 200 students from this university, 40 students smoke. (1) Calculate a 95% confidence interval for the proportion of students at this university who smoke and interpret this interval in context. (2) If we wanted the margin of error to be no larger than 2% at a 95% confidence level for the proportion of students who smoke, how big of...
A university interested in tracking its honors program believes that the proportion of graduates with a...
A university interested in tracking its honors program believes that the proportion of graduates with a GPA of 3.00 or below is less than 0.20. In a sample of 200 graduates, 30 students have a GPA of 3.00 or below. In testing the university’s belief, how does one define the population parameter of interest? Multiple Choice It’s the proportion of honors graduates with a GPA of 3.00 or below. It’s the standard deviation of the number of honors graduates with...
According to a report, the proportion of Lancaster University students who reported insufficient rest or sleep...
According to a report, the proportion of Lancaster University students who reported insufficient rest or sleep during each of the preceding 30 days is 8.0%, while this proportion is 8.8% for University of Cumbria students. These data are based on simple random samples of 11,545 Lancaster and 4,691 Cumbria students. (a) Calculate a 95% confidence interval for the difference between the proportions of Lancaster and Cumbria students who are sleep deprived and interpret it in the context of the data....
4. The university is interested in determining if the proportion of graduates obtaining a first-class degree...
4. The university is interested in determining if the proportion of graduates obtaining a first-class degree has changed from 2016 to 2017. Out of 2800 graduates in 2016, 560 obtained a first class degree. In 2017, 805 graduates out of 3500 obtained a first class degree. (a) Write down the method of moments estimates for the proportion of first-class degrees in 2016 and 2017, pˆ2016 and pˆ2017. (b) Write down appropriate null and alternative hypotheses for this test. (c) What...
A regional hardware chain is interested in estimating the proportion of their customers who own their...
A regional hardware chain is interested in estimating the proportion of their customers who own their own homes. There is some evidence to suggest that the proportion might be around 0.70. Given this, what sample size is required if they wish a 90 percent confidence level with a margin of error of ± .025? About 355 Almost 1,300 Approximately 910 100
Suppose we are interested in the proportion of nursing majors at a university, and we take...
Suppose we are interested in the proportion of nursing majors at a university, and we take a random sample of 150 students to estimate the percent of students in our class who are nursing majors. What is the population? What is the sample? What is the variable? Is the variable qualitative or quantitative?
A university dean is interested in determining the proportion of students who receive some sort of...
A university dean is interested in determining the proportion of students who receive some sort of financial aid. Rather than examine the records for all students, the dean randomly selects 225 students and finds that 45 of them are receiving financial aid. Using a 95% confidence interval, what is the upper limit of the confidence interval to estimate the true proportion of students who receive financial aid.   Using a 95% confidence interval, what is the upper limit of the confidence...
We are interested to estimate the proportion of the population who favor a candidate. Suppose that...
We are interested to estimate the proportion of the population who favor a candidate. Suppose that 210 of the people in a sample of 500 favored the candidate. (a) What is the proportion estimate, p-hat, and the standard error? (b) Find the 90% confidence interval for the proportion of the population who favor the candidate. Interpret result.
Caraline is interested in estimating the proportion of students at a certain college who have at least two written final exams.
Caraline is interested in estimating the proportion of students at a certain college who have at least two written final exams. She takes a random sample and finds that 60 of the 75 students she surveyed did indeed have at least 2 written finals. Compute a 99% confindence interval for her and interpret it.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT