Question

In: Statistics and Probability

Learn by Doing Matched Pairs: In this lab you will learn how to conduct a matched...

Learn by Doing

Matched Pairs: In this lab you will learn how to conduct a matched pairs T-test for a population mean using StatCrunch. We will work with a data set that has historical importance in the development of the T-test.

Paired T hypothesis test:

μ_D = μ₁ - μ₂ : Mean of the difference between Regular seed and Kiln-dried seed
H₀ : μ_D = 0
H_A : μ_D > 0
Hypothesis test results:

Difference	Mean	Std. Err.	DF	T-Stat	P-value
Regular seed - Kiln-dried seed	-33.727273	19.951346	10	-1.6904761	0.9391

Some features of this activity may not work well on a cell phone or tablet. We highly recommend that you complete this activity on a computer.

Here are the directions, grading rubric, and definition of high-quality feedback for the Learn by Doing discussion board exercises.

A list of StatCrunch directions is provided at the bottom of this page.

Context

Gosset's Seed Plot Data

William S. Gosset was employed by the Guinness brewing company of Dublin. Sample sizes available for experimentation in brewing were necessarily small. At that time, Gosset contacted a famous statistician Karl Pearson (1857-1936) and was told that there were no techniques for developing probability models for small data sets. Gosset studied under Pearson, and the outcome of his study was perhaps the most famous paper in statistical literature, "The Probable Error of a Mean" (1908), which introduced the T-distribution.

Since Gosset was employed by Guinness, any work he produced would be owned by Guinness, so he published under a pseudonym, "Student"; hence, the T-distribution is often referred to as Student's T-distribution.

To illustrate his analysis, Gosset used the results of seeding 11 different plots of land with two different types of seed: regular and kiln-dried. He wanted to determine if drying seeds before planting increased plant yield. Since different plots of soil may be naturally more fertile, this confounding variable was eliminated by using the matched pairs design and planting both types of seed in all 11 plots.

The resulting data (corn yield in pounds per acre) are as follows.

Plot	Regular seed	Kiln-dried Seed
1	1903	2009
2	1935	1915
3	1910	2011
4	2496	2463
5	2108	2180
6	1961	1925
7	2060	2122
8	1444	1482
9	1612	1542
10	1316	1443
11	1511	1535

We use these data to test the hypothesis that kiln-dried seed yields more corn than regular seed.

Because of the nature of the experimental design (matched pairs), we are testing the difference in yield.

Plot	Regular seed	Kiln-dried Seed	Difference
1	1903	2009	–106
2	1935	1915	20
3	1910	2011	–101
4	2496	2463	33
5	2108	2180	–72
6	1961	1925	36
7	2060	2122	–62
8	1444	1482	–38
9	1612	1542	70
10	1316	1443	–127
11	1511	1535	–24

Note that the differences were calculated: regular − kiln-dried.

Variables

Regular seed: regular seeds that were traditionally used for planting
kiln-dried: seed that were kiln-dried before planting

Data

Download the seed (Links to an external site.) data file, and then upload the file into StatCrunch.

Prompt

State the hypotheses and define the parameter.
Checking conditions: Since Gosset invented the T-distribution, we will assume that his sample meets the conditions and proceed with the T-test. Regardless, answer these questions to demonstrate your understanding of the conditions for use of the T-model.

But first you will need to review the dotplots for the data (opens in a new tab).
1. Which graph is used to check conditions? Why?
2. What do we look for in the graph to verify that conditions are met?
3. What else do we need to know about the sample of seeds before using the T-test?
Use StatCrunch to find the T-score and the P-value. Hint: as you work through the StatCrunch directions, keep in mind that we want to calculate the differences as regular − kiln-dried . So you will choose Regular seed for Sample 1 and kiln-dried seed for Sample 2. (directions)
Copy and paste the information in the StatCrunch output window into your initial post.
State a conclusion based on the context of this scenario.

EXAMPLE TO RIGHT ANSWER

1. Ho: μ=0

Ha: μ>0

The average difference is -33.73

2. a) We use the graph of the differences because that is what we are analyzing.

b) We look to see if the graph is normally distributed, not skewed, and doesn't have outliers.

c) We don't know if the data is randomly selected.

3.

Paired T hypothesis test:

μ_D = μ₁ - μ₂ : Mean of the difference between Regular seed and Kiln-dried seed
H₀ : μ_D = 0
H_A : μ_D > 0
Hypothesis test results:

Difference	Mean	Std. Err.	DF	T-Stat	P-value
Regular seed - Kiln-dried seed	-33.727273	19.951346	10	-1.6904761	0.9391

Differences stored in column, Differences.

4. Based on the P-value of 0.9391, we do not have enough evidence to reject the null hypothesis. There is no statistically significant evidence to show that kiln-dried seeds yield more than regular seeds.

Expert Solution

A normal probability plot of the difference will be used to check the normality of data.

The normality condition is met.

We look to see if the graph is normally distributed, not skewed, and doesn't have outliers.

μD = μ1 - μ2 : Mean of the difference between Regular seed and Kiln-dried seed
H0 : μD = 0
HA : μD > 0

Based on the P-value of 0.9391, we do not have enough evidence to reject the null hypothesis. There is no statistically significant evidence to show that kiln-dried seeds yield more than regular seeds.

Plot	Regular seed	Kiln-dried Seed	Difference
1	1903	2009	–106
2	1935	1915	20
3	1910	2011	–101
4	2496	2463	33
5	2108	2180	–72
6	1961	1925	36
7	2060	2122	–62
8	1444	1482	–38
9	1612	1542	70
10	1316	1443	–127
11	1511	1535	–24

	1,841.45	mean Regular seed
	1,875.18	mean Kiln-dried Seed
	-33.727	mean difference (Regular seed - Kiln-dried Seed)
	66.171	std. dev.
	19.951	std. error
	11	n
	10	df

	-1.690	t
	.9391	p-value (one-tailed, upper)

orchestra answered 3 years ago

Using the concept of of "Matched Pairs t Procedures" , describe how you would construct a...

Using the concept of of "Matched Pairs t Procedures" , describe how you would construct a comparison between two Automotive Repair facilities to see if there was a better alternative, in general. This means you need to determine sample size, how the the design would be implemented, and what the hypotheses are. This is similar to Round Robin testing in a laboratory or a "pretest/post test" study at the university. This discussion should be more than a single paragraph and...

You are performing a left-tailed matched-pairs test with 29 pairs of data. If α = .005...

You are performing a left-tailed matched-pairs test with 29 pairs of data. If α = .005 , find the critical value, to two decimal places.

Analyze the advantages and disadvantages of using a matched-subjects (matched pairs) design instead of a typical...

Analyze the advantages and disadvantages of using a matched-subjects (matched pairs) design instead of a typical between subjects design.

The Problem Statement In this lab section, you are going to learn how to sort an...

The Problem Statement In this lab section, you are going to learn how to sort an array of integers, and to sort an array of objects. We are going to divide the work into two parts. Part 1. Sorting an array of integers using selection sort In this part, you are given the code (see List 1) for sorting an array of integers into ascending order. The sorting method used is the selection sort. You can cut-and-paste the code into...

For which of the following research designs would a matched pairs test be ideal? a) You...

For which of the following research designs would a matched pairs test be ideal? a) You have data on the mean income of residents in 45 different cities from the 2010 census and income information on the same cities in 2018 and you want to see if average incomes have increased from 2010 to 2018 b) You have data on how a random sample of voters plan to vote before the most recent debate, and how another random sample in...

In this lab you will learn how to use methods from the Math class in order...

In this lab you will learn how to use methods from the Math class in order to calculate the area or the volume of several different shapes. If you are confused about the Methods you can access from the Math class and would like to see some examples click here. Hint: Most of these methods can be done in one line. Step 1 - circleArea In this method you will calculate the area of a circle. Before you can calculate...

difference between a repeated measures design and a matched pairs design?

Explain the difference between a matched pairs experimental design and a randomized ex- perimental design. How...

Explain the difference between a matched pairs experimental design and a randomized ex- perimental design. How does this difference affect the statistical tests that we perform on the data gathered?

In this lab, you will learn how to create shell scripts using the Bourne Shell Scripting...

In this lab, you will learn how to create shell scripts using the Bourne Shell Scripting language. The student will have to do research on the Bourne Shell Scripting language and then writ a script that asks the First Name , Last Name, Age, and Country of origin of the student and then print out these items in a statement Then the students will write a 3 to 4-page paper (not including the title and references pages) describing how the...

the concept of "Matched Pairs t Procedures" was introduced. Since all you drive and possibly have...

the concept of "Matched Pairs t Procedures" was introduced. Since all you drive and possibly have had to have your car repaired at one time or another due to an accident, I would like you to describe how you would construct a comparison between two Automotive Repair facilities to see if there was a better alternative, in general. (Hint: The side of your car has been damaged and you take it to two different shops; I doubt you would get...