Question

In: Statistics and Probability

An agency wants to examine the distribution of wages F(x) of graduates with bachelor degrees. For...

An agency wants to examine the distribution of wages F(x) of graduates with bachelor degrees. For that purpose they receive from US Census Bureau database a sample X1,…, Xn of   10,000 numeric records of wages for randomly chosen graduates.

Find the algorithm to provide an interval of wages which cover the 50% central part of distribution F(x) with approximate confidence 95%. How can you evaluate the accuracy of that estimation?

Solutions

Expert Solution

Given the data, we calculate the following quantities:

a) The sample size

This is the size of the sample out of 10,000 records. Note that n should be at least 30 to use the normal approximation.

b) The sample mean

This corresponds to 50% central part of the distribution.

c) The sample standard deviation

As the sample size is large, we use the normal approximation for the distribution. We use a z-table for converting probability to the z-score.

d) The 95% confidence interval corresponds to a probability range of 0.025 to 0.975. Using a z-table, we get the corresponding z-score for 0.975 as 1.96.

e) In the final step, we get the confidence interval as:

All the quantities have been calculated in the previous steps.

To evaluate the estimation, we take more samples of size from the same data and confirm that at least 95% of those samples have their mean in the above confidence interval.


Related Solutions

Finding F critical for Variances Use the F-distribution to find the degrees of freedon for the...
Finding F critical for Variances Use the F-distribution to find the degrees of freedon for the numerator (d.f.N.), the degrees of freedom for the Denominator (d.f.D.) and the critical F-value Use the closest value when looking up the d.f.N. and d.f.D. in the tables. Test alpha α Sample 1 Sample 2 d.f.N. d.f.D. F critical Right 0.01 s12=37 n1=14 s22=89 n2=25 Two-tailed 0.10 s12=164 n1=21 s22=53 n2=17 Right 0.05 s12=92.8 n1=11 s22=43.6 n2=11
A random sample of size n from a distribution by f(x) = 2x and F(x) =...
A random sample of size n from a distribution by f(x) = 2x and F(x) = x^2 ; 0 < x < 1. Let R = X(n) − X(1) be the range of the sample. Give a general form of the density function of R
Dr. Lee wants to calculate the mean of a t distribution with ν = 4 degrees...
Dr. Lee wants to calculate the mean of a t distribution with ν = 4 degrees of freedom. However, he doesn’t remember the full probability density function (pdf) of the t distribution. Luckily, he remembers the pdf of the t distribution up to some constant. f(x) ∝ (1 + x2/ν)−(ν+1)/2, −∞ < x < ∞. 1. Use the rejection sampling to generate a random sample of size 10,000 from the t dis- tribution with ν = 4 degrees of freedom....
The temperature X in degrees Fahrenheit (F) of a particular chemical reaction is known to be...
The temperature X in degrees Fahrenheit (F) of a particular chemical reaction is known to be distributed between 220 and 280 degrees with a probability density function of fX(x) = (x − 190)/3600. A value of X degrees Fahrenheit can be converted to Y degrees Celsius (C) by taking Y = (5/9)(X − 32). Determine the following in both degrees F and degrees C: (a) the mean of the distribution (b) the variance of the distribution (c) the cumulative distribution...
Consider an exponential distribution f(x|θ) = θe^(−θx) for x > 0. Let the prior distribution for...
Consider an exponential distribution f(x|θ) = θe^(−θx) for x > 0. Let the prior distribution for θ be f(θ) = e^ −θ for θ > 0. (a) Show that the posterior distribution is a Gamma distribution. With what parameters? (b) Find the Bayes’ estimator for θ.
If the joint probability distribution of X and Y f(x, y) = (x + y)/2
If the joint probability distribution of X and Y f(x, y) = (x + y)/2, x=0,1,2,3; y=0,1,2, Compute the following a. P(X≤2,Y =1) b. P(X>2,Y ≤1) c. P(X>Y) d. P(X+Y=4)
Let the continuous random variable X have probability density function f(x) and cumulative distribution function F(x)....
Let the continuous random variable X have probability density function f(x) and cumulative distribution function F(x). Explain the following issues using diagram (Graphs) a) Relationship between f(x) and F(x) for a continuous variable, b) explaining how a uniform random variable can be used to simulate X via the cumulative distribution function of X, or c) explaining the effect of transformation on a discrete and/or continuous random variable
The random variable X has a continuous distribution with density f, where f(x) ={x/2−5i f10≤x≤12 ,0...
The random variable X has a continuous distribution with density f, where f(x) ={x/2−5i f10≤x≤12 ,0 otherwise. (a) Determine the cumulative distribution function of X.(1p) (b) Calculate the mean of X.(1p) (c) Calculate the mode of X(point where density attains its maximum) (d) Calculate the median of X, i.e. a number m such that P(X≤m) = 1/2 (e) Calculate the mean of the random variable Y= 12−X (f) Calculate P(X^2<121)
Given below is a bivariate distribution for the random variables x and y. f(x, y) x...
Given below is a bivariate distribution for the random variables x and y. f(x, y) x y 0.3 50 80 0.2 30 50 0.5 40 60 (a) Compute the expected value and the variance for x and y. E(x) = E(y) = Var(x) = Var(y) = (b) Develop a probability distribution for x + y. x + y f(x + y) 130 80 100 (c) Using the result of part (b), compute E(x + y) and Var(x + y). E(x...
Consider a distribution with the density function f(x) = x^2/3 for −1 ≤ x ≤ 2....
Consider a distribution with the density function f(x) = x^2/3 for −1 ≤ x ≤ 2. (a) Randomly pick a sample of size 20 from this distribution, find the probability that there are 2 to 4 (inclusive) of these taking negative values. (b) Randomly pick an observation X from this distribution, find the probability that it is between 1.2 and 1.4, i.e., P (1.2 < X < 1.4). (c) Randomly pick a sample of size 40 from this distribution, and...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT