Question

In: Statistics and Probability

Use the Happy 1 variable for this exercise. Suppose someone claims the population mean is 55,...

Use the Happy 1 variable for this exercise. Suppose someone claims the population mean is 55, and the standard deviation is 10.

PART 1 - For now, assume both of the claims about the population are correct.

1a. Given the assumed pop. mean and st.dev, calculate the probability of observing a value above the number for your first data point in the data set. (which is 36)

1b. Suppose you collected 8 new data points in a new sample. Calculate the probability that the mean of these 8 new data points is above the number for your first data point in your file.

1c. If this is a normally distributed variable, above what value should you find 70% of data points? How many of the values from your data set are above this value?

1d. If this is a normally distributed variable, between what two numbers (centered around the assumed mean) should you find 68% of data points? What percentage of your data points are between these numbers?

1e. Think about your answers to 1c and 1d. Does this variable appear to be normally distributed with this mean and standard deviation?

Happy1
36
18
66
43
28
39
47
40
24
46
48
57
36
58
39
62
43
65
74
36
39
44
61
50
47
63
60
38
45
51
55
46
68
32
42
38
61
45
31
32
44
30
29
62
49
54
64
38
49
55
28
53
55
52
50
54
76
28
49
70
29
34
77
40
50
40
56
54
36
51
42
71
45
53
55
37
51
36
39
36
51
40
51
52
53
33
66
37
76
67
55
46

Expert Solution

1a) In this question we want to calculate P(X>36)

so

or

Hence the probability of observing a value greater than 36 = 0.97

1b) Now we have taken 8 new data points in a new sample. We will use the central limit theorem which defines the mean and standard deviation of a sample to be equal to and where

Mean of the population

Standard Deviation of the population

Sample Size

For this question as well, we want to calculate the probability

or,

or

Hence, if we take 8 new data points in anew sample then the probability that the sample mean will be greater than 36 will be approximately equal to 1.

1c) We want to know the value of a data point so that 70% of the data points are above that value.

In other words, we want to find a right-tailed confidence interval of the variable Happy1 with 70% confidence.

Hence, the Z-score of the value of the required data point should be more than 0.525 to have more 70% of data points lying above that value.

Let the required value of the data point be

So,

or

Hence, 70% of the data points will have a value greater than 60.25.

In the given data set only 17 data points of all has a value greater than 60.25 which is contradictory to the above statement of 70% of the datas being greater than 60.25.

1d) We again assume this dataset to be normally distributed. This time we want to know the range of values between which 68% of the data points will lie.

In other words we want to calculate a two tailed confidence interval for variable Happy1 with 68% confidence.

= 1 - 0.68 = 0.32

/2 = 0.16

Two tailed Value of z for = 0.32 = 0.995 (observed from the z-table)

Now, the expression for the 68% confidence interval for the variabe Happy1 can be given by

( - Z * , + Z * )

= ( 55 - 0.995 * 10 , 55 + 0.995 * 10)

= (45.05 , 64.95)

Hence, we can say that 68% of the values of the data set will lie between 45.05 and 64.95.

In the dataset, upon observation,we find that 40 value are between this interval. Which is less than 50% of the values and this answer is also contradictory with our question that 68% of the values should lie in between this interval.

1e) The answers in question c and d were highly contradictory to the assumption that the following dataset is normally distributed. Due to this wrong results, we have to say that the assumption is not true. That is, the given data set is not normally distributed.

We can show this by plotting these datas in excel.

As we can see the data points are not at all normally distributed.

Hence, the assumption is not true and that is why we are getting contradictory results in question c and d.

Thank You!!

Please Upvote!!

orchestra answered 2 years ago

Use the Happy1 variable. Suppose someone claims the population mean is 55 and the standard deviation is...

Use the Happy1 variable. Suppose someone claims the population mean is 55 and the standard deviation is 10. Happy1 36 18 66 43 28 39 47 40 24 46 48 57 36 58 39 62 43 65 74 36 39 44 61 50 47 63 60 38 45 51 55 46 68 32 42 38 61 45 31 32 44 30 29 62 49 54 64 38 49 55 28 53 55 52 50 54 76 28 49 70 29 34...

Sample size: 2,2,2,3,4,5 Suppose someone claims that the population mean is 5 days absent (?o: ?...

Sample size: 2,2,2,3,4,5 Suppose someone claims that the population mean is 5 days absent (?o: ? ? 5). What is the alternative hypothesis? Can you reject the null hypothesis at the 5‐percent and 1‐percent levels of significance? Use the critical value, p‐value, and confidence interval approaches both “by hand” and with R. In the “by hand” approach, you can use R (or a statistical table) to get the critical values.

A. Suppose you have a sample size of 57 with a mean 55 and a population...

A. Suppose you have a sample size of 57 with a mean 55 and a population standard deviation of 12.5. Based on this, construct a 95% confidence interval for the true population mean. Use z = 2 (rather than 1.96) for your calculations. Give your answers as decimals, to two places [ , ] B. You measure 37 textbooks' weights, and find they have a mean weight of 62 ounces. Assume the population standard deviation is 13.6 ounces. Based on...

Suppose that someone claims that the average number of siblings per person is 2. Use our...

Suppose that someone claims that the average number of siblings per person is 2. Use our survey data to test this claim. Use the p-value method. Assume 0.05 for the level of significance. Construct an appropriate confidence interval to see if the two methods produce the same conclusion. 1) Write the claim mathematically and identify H0andHa 2) Determine whether the hypothesis test is a one-tailed or a two-tailed test and whether to use a z-test, a t-test or a chi-square...

1. A variable is normally distributed in the population with a mean of 100 and a...

1. A variable is normally distributed in the population with a mean of 100 and a standard deviation of 10. A sample of 20 is randomly selected. The probability that the sample mean is between 90 and 110 is _______ the probability that the variable is between 90 and 110. greater than less than equal to not comparable with 2.the general manager of a logistic consulting group believes that 28% of the firm's orders come from new customers. A simple...

QUESTION 39 "A Rutgers University professor claims that 55% of male students of the university exercise...

QUESTION 39 "A Rutgers University professor claims that 55% of male students of the university exercise at least 15 minutes a day. What would be a proper conclusion if he collects data, conducts hypothesis test, and finds Z=-3.42. Use alpha = 2%." Reject H0. Professor's claim is not valid. Reject H0. Professor's claim is valid. Fail to reject H0. Professor's claim is not valid. Fail to reject H0. Professor's claim is valid.

Consider a population of 300 with a mean of 55 and a standard deviation equal to...

Consider a population of 300 with a mean of 55 and a standard deviation equal to 22. What is the probability of obtaining a sample mean of 57 or less from a sample of 35?

Suppose we know that a random variable X has a population mean µ = 400 with...

Suppose we know that a random variable X has a population mean µ = 400 with a standard deviation σ = 100. What are the following probabilities? (12 points) The probability that the sample mean is above 376 when n = 1600. The probability that the sample mean is above 376 when n = 400. The probability that the sample mean is above 376 when n = 100. The probability that the sample mean is above 376 when n =...

Example (from 8-1) Someone claims that the mean height of men is equal to 174.1cm, against...

Example (from 8-1) Someone claims that the mean height of men is equal to 174.1cm, against which a hypothesis test is to be conducted. 1. What is the null hypothesis, and how is it denoted? 2. What is the alternative hypothesis, and how is it denoted? 3. You found a sample from which the sample mean of men’s heights was 175 cm. You suspect that the actual mean height of men is greater than 174.1 cm. What is the alternative...

Suppose that we have 45% of claims are type A1 and 55% are type A2 (This...

Suppose that we have 45% of claims are type A1 and 55% are type A2 (This is our prior distribution). Suppose Type A1 claims will have the following claims amount distribution. P(0) = 0.6; P($500) = 0.25 and P($2000) = 0.15. Type A2 claims will have the following claims amount distribution. P(0) = 0.75; P($750) = 0.18 and P($2500) = 0.07. a. What is the expected Claim amount for each type? b. What is the overall expected claim amount? c....