Question

In: Math

The National Center of Education Statistics conducted a surveyof high school seniors, collecting test data...

The National Center of Education Statistics conducted a survey of high school seniors, collecting test data on reading, writing, and several other subjects. Here we examine a simple random sample of 200 students from this survey. Side-by-side box plots of reading and writing scores as well as a histogram of the differences in scores are shown below.

Image for The National Center of Education Statistics conducted a survey of high school seniors, collecting test data on

(a) Is there a clear difference in the average reading and writing scores?
(b) Are the reading and writing scores of each student independent of each other?
(c) Create hypotheses appropriate for the following research question: is there an evident difference in the average scores of students in the reading and writing exam?
(d) Check the conditions required to complete this test.
(e) The average observed difference in scores is x_readwrite = 0.545, and the standard deviation of the difference is 8.887 points. Do these data provide convincing evidence of a difference between the average scores on the two exams?
(f) What type of error might we have made? Explain what the error means in the context of the application.
(g) Based on the results of this hypothesis test, would you expect a confidence interval for the average difference between the reading and writing scores to include 0? Explain your reasoning.

Expert Solution

Concepts and reason

Paired t test: The averages of the same group at different times are compared. That is, the samples are dependent.

If two types of treatments are measured or compared on the same observational unit instead for two separate groups it is called as paired design. In other words, matched pair design is that in which the treatments are assigned randomly to the units and each observational unit in the study receives two treatments.

Assumptions:

• Dependent variable is measured on a continuous scale.

• Each score in one sample is paired with a particular score in the other sample.

• The difference of the either group follows normal distribution.

Rejection rule:

If , then reject the null hypothesis ${H_0}$ .

Confidence interval: A range of values such that the population parameter can expected to contain for the given confidence level is termed as the confidence interval. In other words, it can be defined as an interval estimate of the population parameter which is calculated for the given data based on a point estimate and for the given confidence level.

Moreover, the confidence level indicates the possibility that the confidence interval can contain the population parameter. Usually, the confidence level is denoted by . The value is chosen by the researcher. Some of the most common confidence levels are 90%, 95%, and 99%.

The margin of error is defined as a statistic which gives the amount of sampling error in the given study. Also, the margin of error tells the percentage of points that the obtained results would differ from that of the given population value.

P-value: The probability of getting the value of the statistic that is as extreme as the observed statistic when the null hypothesis is true is called as P-value.

Type I Error: Reject the null hypothesis when it is true, called a type I error. It is also known as level of significance. The type I error is denoted as .

Type II Error: Failing to reject the null hypothesis when the alternative is true, called a type II error. The type II error is denoted as .

Fundamentals

The formula for paired t test is given below:

$t = \frac{{{{\bar x}_d} - {\mu _d}}}{{\frac{{{S_d}}}{{\sqrt n }}}}$

Where ${\bar x_d}$ denotes the sample mean difference and ${S_d}$ denotes the sample standard deviation difference.

Degrees of freedom: $df = n - 1$

The formula for the confidence interval for the difference in means is,

$\begin{array}{c}\\{\rm{Confidence}}\,{\rm{interval}} = {{\bar x}_d} \pm {t_{\frac{\alpha }{2},n - 1}}\left( {\frac{{{S_d}}}{{\sqrt n }}} \right)\\\\ = {{\bar x}_d} \pm {\rm{Margin of error}}\left( E \right)\\\end{array}$

Where ${\bar x_d}$ is the sample mean difference, ${S_d}$ is the standard deviation difference and n be the sample size.

The general conditions to perform the paired t-test are as follows:

• Dependent variable is measured on a continuous scale.

• Each score in one sample is paired with a particular score in the other sample.

• The difference of the either group follows normal distribution.

Rejection rule using p-value:

If , then reject the null hypothesis.

If , then do not reject the null hypothesis.

Rejection rule based on confidence interval:

• If the confidence interval contains the value zero, then the null hypothesis is not rejected.

• If the confidence interval does not contain the value zero, then the null hypothesis not rejected.

(1.a)

From the boxplot of reading and writing scores, it is clear that there is no difference in the average scores as the distribution of reading and writing are approximately normal. This would lead to zero difference.

(2.b)

The reading scores and the writing scores are taken from each of the student. That is each student is measured in terms of reading and writing. This indicates that the reading an writing scores are not independent of each other.

(3.c)

The hypotheses are stated below:

Let ${\mu _d}$ be the population mean difference in the average scores of students in the reading and writing exam

Null hypothesis:

${H_0}:{\mu _d} = 0$

Alternative hypothesis:

${H_a}:{\mu _d} \ne 0$

(4.d)

In the given study, the averages of the same students for different subjects’ scores are compared.

• The average of reading and writing scores are compared on the same students. This implies that the samples are dependent.

• The each student paired with the reading and writing scores.

• The differences of the reading writing scores are approximately normal. Because in the histogram for the difference scores (read write) is symmetric.

(5.e)

Instructions to find the test statistic and p-value by using MINITAB:

1.Choose Stat > Basic Statistics > Paired t.

2.Choose Summarized data.

3.Enter Sample size as 200, Mean as 0.545, Standard deviation as 8.887.

4.Choose Options.

5.In Confidence level, enter 95.

6.In Alternative, select not equal.

7.Click OK.

Follow the above instructions to get the test statistic:

From MINITAB output, the value of test statistic is 0.87 and the p-value is 0.387.

The conclusion is stated below:

Use the significance level 0.05.

The p-value is 0.387 and the level of significance is 0.05.

That is, $p{\rm{ - value}}\left( { = 0.387} \right) > \alpha \left( {0.05} \right)$ .

By the rejection rule, do not reject the null hypothesis.

Therefore, it can be concluded that, there is no difference in the average scores of students in the reading and writing exam.

(6.f.1)

From the information in part 5.e the null hypothesis is not rejected.

Since the null hypothesis is not rejected, there might be a chance that of not rejecting the false null hypothesis. In this situation the error that can occur would be type II error.

(6.f.2)

The result of the study indicates that the null hypothesis is not rejected. That is, there is no significance difference in the average reading and writing scores.

But because of type II error, the result states that there is no significance difference in the average reading and writing scores when actually there is significance difference in the average reading and writing scores.

(7.g)

The result stated that the null hypothesis is not rejected. If the null hypothesis is not rejection then the confidence interval must definitely contain the value zero based on the rejection rule of the confidence interval.

Ans: Part 1.a

Thus, no, there is no clear difference in the average reading and writing scores.

Part 2.b

Thus, yes, the reading and writing scores of each student not independent of each other.

Part 3.c

Thus, the Null hypothesis is ${H_0}:{\mu _d} = 0$ and alternative hypothesis is ${H_a}:{\mu _d} \ne 0$

Part 4.d

Yes, the conditions are satisfied to complete the test

Part 5.e

Thus, do not reject the null hypothesis: there is no difference in the average scores of students in the reading and writing exam.

Part 6.f.1

Thus, it is possible to make type II error.

Part 6.f.2

The error in the context of the study is that determining that there is no difference in scores when actually there is difference in the scores.

Part 7.g

Based on the results of the hypothesis test, it cannot be expected that a confidence interval for the average difference between the reading and writing scores would include 0.

milcah answered 3 years ago

The National Center of Education Statistics conducted a survey of high school seniors,

5.20 High School and Beyond, Part I: The National Center of Education Statistics conducted a survey of high school seniors, collecting test data on reading, writing, and several other subjects. Here we examine a simple random sample of 200 students from this survey. Side-by-side box plots of reading and writing scores as well as a histogram of the differences in scores are shown below. (b) Create hypotheses appropriate for the following research question: is there an evident difference in the average...

The National Center of Education Statistics conducted a survey of high school seniors

High School and Beyond, Part l. The National Center of Education Statistics conducted a survey of high school seniors, collecting test data on reading, writing, and several other subjects. Here we examine a simple random sample of 200 students from this survey. A histogram of the difference in the reading and writing score of each student is shown below. .1. Which set of hypotheses is appropriate for the following research question: is there an significant difference in the average scores of students...

High school graduates: The National Center for Educational Statistics reported that 82% of freshmen entering public...

High school graduates: The National Center for Educational Statistics reported that 82% of freshmen entering public high schools in the U.S. in 2009 graduated with their class in 2013. A random sample of 135 freshmen is chosen. a. Page 333 Find the mean μ. b. Find the standard deviation σ. c. Find the probability that less than 80% of freshmen in the sample graduated. d. Find the probability that the sample proportion of students who graduated is between 0.75 and...

The following table is based on a random sample conducted of high school seniors and their...

The following table is based on a random sample conducted of high school seniors and their parents by Jennings and Niemi, in which they explore the party identification of parents and their children. Student Party Identification Parent Party ID Democrat Independent Republican Democrat 604 245 67 Independent 130 235 76 Republican 63 180 252 What is the percentage of students who share the same party identification as their parents? (Show the computation.) What percentage of Democrat parents have Republican children?...

I/ The following data are ACT test scores from a group of high school seniors: 30,...

I/ The following data are ACT test scores from a group of high school seniors: 30, 25, 29, 32, 27, 25, 24, 18, 26 1/ Find the mode 2/ Find the mean 3/ Construct a boxplot (clearly label all 5 specific values) 4/ Calculate the standard deviation for the data set

The National Center for Education Statistics reports the following statistics for surveys of 12,320 female college...

The National Center for Education Statistics reports the following statistics for surveys of 12,320 female college students and 9,184 male college students. 36% of females work 16-25 hours per week 38% of males work 16-25 hours per week Suppose two students are selected with replacement. Find the probability that the first student is a female that works 16-25 hours per week and the second student is a female that works 16-25 hours per week. Round your answer to three decimal...

11. High School Standardized Test Past experience indicates that the time required for high school seniors...

11. High School Standardized Test Past experience indicates that the time required for high school seniors to complete a standardized test is a normal random variable with a mean of 35 minutes (normally distributed). If a random sample of 30 high school seniors took an average of 33.5 minutes to complete this test with a standard deviation of 4 minutes, test the hypothesis, at the 0.05 level of significance, that u = 35 minutes against the alternative that u <35...

The Panama City Times recently conducted a survey of local high school juniors and seniors and...

The Panama City Times recently conducted a survey of local high school juniors and seniors and found that 59.8% of them planned on attending Gulf Coast State College within the next 2 years. If 12 local high school students are selected randomly from this area, what the probability that fewer than 4 of them will say they plan on attending GCSC within the next 2 years? what would my n, x and p be? using the calculator( binomcdf)

The National Center for Education Statistics found that in 2015, 41% of students nationwide reported that...

The National Center for Education Statistics found that in 2015, 41% of students nationwide reported that their mothers had graduated from college. A superintendent randomly sampled 356 students from her local school district and found that 43% of them had mothers that graduated from college. Does her sample give evidence of a higher education level among mothers in her district? Use a significance level of ? = 0.05. a) State the hypotheses in symbols. b) Run the test and...

According to the national center of education statistics, 67% of Texas students are eligible to receive...

According to the national center of education statistics, 67% of Texas students are eligible to receive free or reduced-price lunches. Suppose you randomly choose 310 Texas Students. Find the probability that no more than 73% of them are eligible to receive free or reduced-price lunches.