Question

In: Statistics and Probability

Purpose: • To create and interpret confidence intervals for the population proportion or population mean. •...

Purpose:

• To create and interpret confidence intervals for the population proportion or population mean.

• To do hypothesis testing on a population proportion or population mean. Due Date: Nov 27, 2018 at the beginning of class.

What you must deliver:

1. Formulate a statistical hypothesis. 2. Develop a data production strategy. 3. Collect sample data. 4. Solutions to the questions (See below). 5. Reflection.

Suggested ideas to consider:

• Proportion of students at Cañada College who can raise one eyebrow without raising the other eyebrow.

• Mean age of cars driven by (statistics) students and/or mean age of cars driven by faculty.

• Proportion of students at Cañada College who can correctly identify the President, the Vice President, and the Secretary of State.

• Proportion of students at Cañada College who are over the age of 18 and are registered to vote.

• Mean age of evening class student at Cañada College

. • Proportion of student cars that are (white).

• Mean number of hours that students work at Cañada College each week.

• Mean age of books (based on copyright dates) from the library.

• Proportion of books that are over years from the library.

• Proportion of pages of a sample of different issues that contain advertising

GRADING RUBRIC: Total Score (50) 1. Collect sample data. (5 points) 2. Solutions to the questions (40 points Total) - Summary of data (5) - Compute margin of error correctly (5) - Compute confidence interval correctly (10) - Perform the hypothesis test correctly (15) - Interpret the result of the test correctly (5) 3. Reflection. (5 points)

Explore your own Data Set.

1.Select a research question from the given list, or make up your own question

. Write down the question selected

. 2. Decide whether you would use the point estimate for population mean or population proportion.

Describe the population you are targeting.

3. Collect the data. Collect a minimum of 31 sample data. Proper data collection methods (i.e. randomization) should be used if possible. If proper methods cannot be used, then this must be acknowledged and the reasoning for using the less than proper methods explained. Describe how you obtain your data in 3-5 sentences.

4. Summarize the data. Use additional pages if necessary. a. You must provide ALL of your sample data based on the topic you choose. b. Identify �, �̂, �, and/or �̅where appropriate. c. List the sample size and determine the necessary data values to do the calculation. Use the correct variables. d. Find the 75%, 95%, and 99% confidence intervals. (Do all three) e. Determine the Margin of Error for the 75%, 95%, and 99% confidence intervals.

5. Interpret the results of the confidence interval.

6. Hypothesis Testing. a. Formulate your statistical claim against a population proportion or a population mean. (i.e. Less than 30% of the students at Cañada College…..) b. Show the seven steps to your hypothesis testing and its result. c. Identify which test (left-tail, right-tail, two-tail), which distribution (z-Test statistics or t-Test statistics), and which method (Critical Value Method or P-Value Method) you used. d. Supply all necessary work with diagrams.

7. Interpret the results of the hypothesis testing. STEPS 1-7 can be hand-written, in a legible manner.

8. Reflection: Each student must write up a half-page to one-page reflection, typed, choosing three of the following questions

. a. What were your overall thoughts about this project? Explain any surprises.

b. How did this project help you understand statistics better?

c. Do you feel you worked as efficiently as possible? What can you do to improve your efficiency?

d. Explain how this project is relevant to something you have experienced or seen in the real world?

Solutions

Expert Solution

Let’s move on to see how confidence intervals account for that margin of error. To do this, we’ll use the same tools that we’ve been using to understand hypothesis tests. I’ll create a sampling distribution using probability distribution plots, the t-distribution, and the variability in our data. We'll base our confidence interval on the energy cost data set that we've been using.

When we looked at significance levels, the graphs displayed a sampling distribution centered on the null hypothesis value, and the outer 5% of the distribution was shaded. For confidence intervals, we need to shift the sampling distribution so that it is centered on the sample mean and shade the middle 95%.

The shaded area shows the range of sample means that you’d obtain 95% of the time using our sample mean as the point estimate of the population mean. This range [267 394] is our 95% confidence interval.

Using the graph, it’s easier to understand how a specific confidence interval represents the margin of error, or the amount of uncertainty, around the point estimate. The sample mean is the most likely value for the population mean given the information that we have. However, the graph shows it would not be unusual at all for other random samples drawn from the same population to obtain different sample means within the shaded area. These other likely sample means all suggest different values for the population mean. Hence, the interval represents the inherent uncertainty that comes with using sample data.

You can use these graphs to calculate probabilities for specific values. However, notice that you can’t place the population mean on the graph because that value is unknown. Consequently, you can’t calculate probabilities for the population mean, just as Neyman said!

Why P Values and Confidence Intervals Always Agree About Statistical Significance

You can use either P values or confidence intervals to determine whether your results are statistically significant. If a hypothesis test produces both, these results will agree.

The confidence level is equivalent to 1 – the alpha level. So, if your significance level is 0.05, the corresponding confidence level is 95%.

  • If the P value is less than your significance (alpha) level, the hypothesis test is statistically significant.
  • If the confidence interval does not contain the null hypothesis value, the results are statistically significant.
  • If the P value is less than alpha, the confidence interval will not contain the null hypothesis value.

For our example, the P value (0.031) is less than the significance level (0.05), which indicates that our results are statistically significant. Similarly, our 95% confidence interval [267 394] does not include the null hypothesis mean of 260 and we draw the same conclusion.

To understand why the results always agree, let’s recall how both the significance level and confidence level work.

  • The significance level defines the distance the sample mean must be from the null hypothesis to be considered statistically significant.
  • The confidence level defines the distance for how close the confidence limits are to sample mean.

Both the significance level and the confidence level define a distance from a limit to a mean. Guess what? The distances in both cases are exactly the same!

The distance equals the critical t-value * standard error of the mean. For our energy cost example data, the distance works out to be $63.57.

Imagine this discussion between the null hypothesis mean and the sample mean:

Null hypothesis mean, hypothesis test representative: Hey buddy! I’ve found that you’re statistically significant because you’re more than $63.57 away from me!

Sample mean, confidence interval representative: Actually, I’m significant because you’re more than $63.57 away from me!

Very agreeable aren’t they? And, they always will agree as long as you compare the correct pairs of P values and confidence intervals. If you compare the incorrect pair, you can get conflicting results, as shown by common mistake #1 in this post.

Closing Thoughts

In statistical analyses, there tends to be a greater focus on P values and simply detecting a significant effect or difference. However, a statistically significant effect is not necessarily meaningful in the real world. For instance, the effect might be too small to be of any practical value.

It’s important to pay attention to the both the magnitude and the precision of the estimated effect. That’s why I'm rather fond of confidence intervals. They allow you to assess these important characteristics along with the statistical significance. You'd like to see a narrow confidence interval where the entire range represents an effect that is meaningful in the real world.

If you like this post, you might want to read the previous posts in this series that use the same graphical framework:

  • Part One: Why We Need to Use Hypothesis Tests
  • Part Two: Significance Levels (alpha) and P values

Related Solutions

Confidence Intervals for a proportion and mean Do all steps in the confidence interval: a) Check...
Confidence Intervals for a proportion and mean Do all steps in the confidence interval: a) Check when easy the requirements for the interval (t-interval) b) Create a summary of the information that goes into the interval c) Write out the formula for the interval d) Replace the symbols in the formula with the numbers from (b) e) Produce the interval f) Interpret the interval In 1998, as an advertising campaign, the Nabisco Company announced a "1000 Chips Challenge," claiming that...
What are the confidence intervals for a population mean? Provide examples.
Question 2 What are the confidence intervals for a population mean? Provide examples. Question 3 A confidence interval for the population mean when the population follows the normal distribution and the population standard deviation is known is computed by? Provide examples.
Using the data down and interpret 95% confidence intervals for the mean age of an American...
Using the data down and interpret 95% confidence intervals for the mean age of an American truck driver.   This data represents a random sample of drivers in America. There are about 3.5 million truck drivers in the USA. Find:1- Sample Standard Deviation. 2- Sample Mean. 3- Sample size. 4- Standard error of the mean. 5-T-value. 6- Interval half-width. 7-Interval lower limit. 8- Interval upper limit  . Please use this data. Truck Drivers Employee Gender Age Total education years 1 M 30...
Use technology to construct the confidence intervals for the proportion variance sigma2 and the population standard...
Use technology to construct the confidence intervals for the proportion variance sigma2 and the population standard deviation sigma. Assume the sample is taken from a normally distributed population. c=0.90, s2=10.89, n=25 The confidence interval for the population variance is (?,?) The confidence interval for the population standard deviation is (?,?)
how to interpret confidence intervals and how NOT to interpret them. What are the assumptions to...
how to interpret confidence intervals and how NOT to interpret them. What are the assumptions to justify the use of hypothesis testing? If the null hypothesis is rejected, what can we conclude? If we know that 60% of ASU students like the parking and 50% of the community as a whole likes the parking, and the difference between the sample and population are tested, with the null rejected, what do we conclude? Is the difference significant? Not significant? Are ASU...
Construct and interpret 95% confidence intervals for the difference in mean pain intensity at 14 days...
Construct and interpret 95% confidence intervals for the difference in mean pain intensity at 14 days after treatment. (Use μVertebroplasty − μPlacebo. Round your answers to two decimal places.) -1.188 Correct: Your answer is correct. to .788 Correct: Your answer is correct. Interpret the interval. There is a 95% chance that the true mean pain intensity 14 days after treatment for the vertebroplasty treatment is directly in the middle of these two values. There is a 95% chance that the...
construct and interpret a 78% confidence interval for the population proportion with n=1200 and x=400
construct and interpret a 78% confidence interval for the population proportion with n=1200 and x=400
This is a question regarding Statistics. Calculate and interpret a confidence interval for a population mean....
This is a question regarding Statistics. Calculate and interpret a confidence interval for a population mean. given a normal distribution with 1) a known variance2) an unknown population variance or 3) an unknown variance and a large sample size when sampling from a normal distribution, why test statistic no matter small(n<30) or large(n>=30) we choose z-statistic?(please give an example) Thanks  
Give and interpret the 95% confidence intervals for males and a second 95% confidence interval for...
Give and interpret the 95% confidence intervals for males and a second 95% confidence interval for females on the SLEEP variable. Which is wider and why? Known values for Male and Female: Males: Sample Size = 17; Sample Mean = 7.765; Standard Deviation = 1.855 Females: Sample Size = 18; Sample Mean = 7.667; Standard Deviation = 1.879 Using t-distribution considering sample sizes (Male/Female count) are less than 30
Should statistical analyses, i.e., confidence intervals for the population mean and hypotheses testing about the population...
Should statistical analyses, i.e., confidence intervals for the population mean and hypotheses testing about the population mean, be conducted using the data on the original or on the log-transformed scale? The original data is not normal and the log-transformed scale is normal.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT