Question

In: Statistics and Probability

data set West SouthEast MidWest NewEngland 4 3 2 3 7 2 2 3 8 4...

data set

West SouthEast MidWest NewEngland
4 3 2 3
7 2 2 3
8 4 8 4
3 2 9 8
4 8 10 7
4 3 12 3
4 5 1 5
5 6 9 6
6 2 9 2
10 4 3 2
9 3 6 1
7 3 5 4
7 9 6 3
4 2 4 3
3 3 4 5
2 1 4 10
1 2 4 10
2 1 2 9
3 2 2 11
9 2 1
1 1
1
3
2

Part 1
Using the same data set, analyze the data for Midwest region. Do the following:
(1) Obtain a histogram and boxplot for the data (revisit one of the earlier lab reports).
(2) You need to test whether the mean number of years of education in this region is significantly different from 5.5. State the hypotheses.
(3) Perform an appropriate test of hypotheses; copy and paste the results.
(4) Identify the test statistic and the p-value, and state the decision.
(5) Interpret the p-value (see Week 5 module), and answer the research question.

Part 2
Using the same data set, consider the data for Southeast and West regions. Do the following:
(1) You need to test whether there is a significant difference in the mean number of years of education beyond high school between the two regions. State the hypotheses.
(2) Perform an appropriate test of hypothesis; copy and paste the results.
(3) Identify the test statistic and the p-value, and state the decision.
(4) Interpret the p-value (see Week 5 module), and answer the research question.
(5) Identify the 95% confidence interval for the mean difference, and interpret the result. How does this result relate to the results in parts (3) and (4) Be specific.

Support your conclusion with specific calculations.

Solutions

Expert Solution

Solution:

Part 1.

The histogram for Mid West is the following:

The boxplot for the MidWest is the following:

The null hypothesis is: Mean number of years of education in the Mid West region is equal to 5.5.

The alternative hypothesis is: Mean number of years of education in the Mid West region is different from 5.5.

We perform the test and have the following results tabulated below:

One-Sample Statistics

N

Mean

Std. Deviation

Std. Error Mean

Mid_West

21

4.9524

3.35375

.73185

One-Sample Test

Test Value = 5.5

t

df

Sig. (2-tailed)

Mean Difference

95% Confidence Interval of the Difference

Lower

Upper

Mid_West

-.748

20

.463

-.54762

-2.0742

.9790

Here the P-value is = 0.463>0.05, hence we fail to reject the null hypothesis.

Part2.

The null hypothesis is: Mean number of years of education beyond high school between the South East and West regions are equal.

The alternative hypothesis is: Mean number of years of education beyond high school between the South East and West regions are different.

We perform two independent sample t-test and obtain the following result:

Group 1 = West

Group 2 = SouthEast


Related Solutions

For the data set 1 2 3 4 7 7 7 8 11 12 12 15...
For the data set 1 2 3 4 7 7 7 8 11 12 12 15 15 16 17 17 17 18 20 20 22 24 24 25 26 26 26 26 27 30 32 32 33 34 34 36 38 39 43 44 45 46 47 47 48 51 52 52 53 54 54 54 55 56 58 58 59 61 63 65 65 67 69 70 73 75 75 76 77 77 79 80 81 82 82 (a)...
For the following data set, X: 9, 6, 8, 3, 8, 9, 3, 4, 3, 7:...
For the following data set, X: 9, 6, 8, 3, 8, 9, 3, 4, 3, 7: Calculate: 1. Variance 2. Mode 3. Mean 4. Mean Average Deviation (MAD) about the mean 5. Median
For the following data set [ 1, 4, 3, 6, 2, 7, 18, 3, 7, 2,...
For the following data set [ 1, 4, 3, 6, 2, 7, 18, 3, 7, 2, 4, 3, 5, 3, 7] please compute the following 1. measures of central tendency (3 points) 2. standard deviation ( 5 points) 3. is 18 an outlier? (5 points) 4. describe the shape of the distribution (2 points)
Step 2 Data Set A x 1 2 3 4 5 6 7 y 7 7...
Step 2 Data Set A x 1 2 3 4 5 6 7 y 7 7 7 9 9 9 10 Data Set B x 1 2 3 4 5 6 7 8 9 10 11 y 4 6 6 6 8 9 9 9 10 10 10 Step 2 Find the equation for the least-squares line, and graph the line on the scatter plot. Find the sample correlation coefficient r and the coefficient of determination r2. Is r significant?...
Using the same data… 2 3 4 4 4 6 6 6 7 8 8 9...
Using the same data… 2 3 4 4 4 6 6 6 7 8 8 9 10 10 11 12 16 16 28 46 (d) [5 pts] Determine the 5# summary. (e) Determine the lower and upper fence to determine if there are any outliers. (f) Draw and carefully label a modified boxplot for this data. (g) What is the shape of the distribution (symmetric, skewed left, or skewed right). Explain.
For the data set shown​ below. x   y 3   4 4   5 5   8 7   12...
For the data set shown​ below. x   y 3   4 4   5 5   8 7   12 8   15 Find the estimates of β0 and β1. β0≈b0=-3.256 ​ (Round to three decimal places as​ needed.) β1≈b1=2.233 (Round to three decimal places as​ needed.) (b) Compute the standard​ error, the point estimate for σ. Se=0.5972 ​(Round to four decimal places as​ needed.) ​(c) Assuming the residuals are normally​ distributed, determine sb1. sb1=0.144 ​(Round to three decimal places as​ needed.) ​(d) Assuming the...
5.) For the data set 2 4 4 5 7 8 9 10 12 12 13...
5.) For the data set 2 4 4 5 7 8 9 10 12 12 13 13 16 16 16 16 17 19 19 20 23 24 24 24 25 26 26 27 28 28 29 31 32 34 34 36 37 38 42 44 45 46 47 47 48 50 52 53 53 54 55 56 56 57 58 (a) Find the 80th percentile. The 80t percentile is =    (a) Find the 42nd percentile. The 42nd percentile is...
Which statistic has the greatest value for the set of data 3, 6, 4, 8, 2,...
Which statistic has the greatest value for the set of data 3, 6, 4, 8, 2, 9, 1, 9, 3, 2, 6, 5, 3? mean median mode range interquartile range
Consider the data set. 2, 3, 4, 6, 8 (a) Find the range. (Enter an exact...
Consider the data set. 2, 3, 4, 6, 8 (a) Find the range. (Enter an exact number.) (b) Use the defining formula to compute the sample standard deviation s. (Enter a number. Round your answer to two decimal places.) (c) Use the defining formula to compute the population standard deviation σ. (Enter a number. Round your answer to two decimal places.)
Consider the following set of observations: Obs. 1 2 3 4 5 6 7 8 9...
Consider the following set of observations: Obs. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 input 1 2 3 4 5 6 7 8 9 10 11 12 13 14 result 1 2 3 5 8 13 21 34 55 89 144 233 377 610 Enter the data in L1 and L2 in your TI calculator, find the regression line, and construct a scatterplot with the regression line included. Does a line appear to...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT