Question

In: Statistics and Probability

Week 5 Extra Credit Assignment Collect two sets of data from two different related/similar populations (for...

Week 5 Extra Credit Assignment

Collect two sets of data from two different related/similar populations (for example: Male & female).

1) Test a claim about the population mean for one of the sets.
2) Test a claim about the population standard deviation for one of the sets.
3)  Test a claim about both population standard deviations.

4) Test a claim about both population means (for example the mean for the one population equals the mean of the other population).


You are responsible for coming up with the claims and picking your alpha.

Solutions

Expert Solution

Lets Suppose we have 2 datasets for human body temperature. Here are some details for the 2 datasets.

DATASET 1 -> no. of data points(n1) = 101, sample mean (X) = 97.89 degrees., sample standard deviation = 0.73 degrees

DATASET 2-> no. of data points(n2) = 101, sample mean(X) = 97.53 degrees, sample standard deviation = 0.69 degrees

1)

Lets test the claim for population mean of first dataset () whether it is 98.6 degrees or less.

For this we will do a one tailed test,

Now, first we form a hypothesis, The Null Hypothesis(H0) -> = 98.6 degrees

and Alternative Hypothesis (Ha) -> < 98.6 degrees.

Now that we have our hypothesis formed, we need to calculate the test statistic (t),

t =  

standard error = = = 0.0726

t = (97.89-98.6)/0.0726 = -9.77

Now,we calculate p-value assuming null hypothesis is true.

For this, we need to look at t - distribution table for area below t= -9.77 for degree of freedom(df) = 100.

p-value <0.02 which is less than 0.05 (95% confidence level)

so, we reject the null hypothesis and accept the alternative hypothesis which is population mean is less than 98.6 degrees.

2)

For testing the claim about standard deviation, the test statistic will follow a chi-square distribution with (n-1) degrees of freedom, X2 = ((n-1)s2)/02

Let the test claim be that population standard deviation is atleast 0.7 degrees

So, Null Hypothesis(H0) -> = 0.7

Alternative Hypothesis(Ha) -> < 0.7

Now, for significance level () = 0.05

test statistic = (101-1)*0.732/0.72 = 108.76

So, the p- value for this( from chi square distribution table) = 0.258

since, p-value > 0.05, we fail to reject Null Hypothesis. So, the claim that standard deviation is 0.7 degrees is true.

3)

Let the claim be that both population have same standard deviation, so (1)2/(2)2= 1.

Sample variances are releated to chi-square distribution and ratio of variances are related to F-distribution.

Test Statistic, F = s12/s22

Now, if the first sample has smaller variance, it is left tailed test, else it is a right tailed test

So, with significance level () = 0.05

F = 0.732/0.692 = 1.1193,

Since, first sample has larger variance, it is a right tailed test

so, looking at F-distribution table, P-value = 0.287

since, p-value > 0.05. So, we fail to reject Null Hypothesis. So, the claim that both population have same standard deviation is true.

4)

Let the claim be that both population have same mean.

So, Null Hypothesis, (H0) -> 1 = 2

Alternative Hypothesis(Ha) -> 1 2

Test Statistic (t) = ((x1 - x2) - d0)/(sp*) with n1+n2-2 degrees of freedom., d0 is difference between population means as per null hypothesis so = 0.

sp2 = ((n1-1)s12+(n2-1)s22)/(n1+n2-2 ) is called pooled variance.

so, sp = (100*0.732+100*0.692)/(101+101-2) = 0.5045

So, t = ((97.89-97.53)-0)/(0.5045*0.141) = 0.36/0.0711 = 5.06

looking at normal distribution table for two tailed test, p-value < 0.001

So, Null Hypothesis is rejected. So, The population mean of both the data sets is not equal.


Related Solutions

This week, you will use two of the data sets that were posted during last week's...
This week, you will use two of the data sets that were posted during last week's discussion, as follows: 1) Refer to the data set that you posted last week (high temperatures for your area during the month of June 2019) and 2) Refer to the data set that one of your classmates posted last week (high temperatures for their area during the month of June 2019). Use these data sets to test the claim that the average high temperature...
This week, you will use two of the data sets that were posted during last week's...
This week, you will use two of the data sets that were posted during last week's discussion, as follows: 1) Refer to the data set that you posted last week (high temperatures for your area during the month of June 2019) and 2) Refer to the data set that one of your classmates posted last week (high temperatures for their area during the month of June 2019). Use these data sets to test the claim that the average high temperature...
Two different simple random samples are drawn from two different populations. The first sample consists of...
Two different simple random samples are drawn from two different populations. The first sample consists of 20 people with 9 having a common attribute. The second sample consists of 2000 people with 1440 of them having the same common attribute. Compare the results from a hypothesis test of p 1equalsp 2 ​(with a 0.01 significance​ level) and a 99​% confidence interval estimate of p 1minusp 2. Identify the test statistic. Identify the critical​ value(s).
Two different simple random samples are drawn from two different populations. The first sample consists of...
Two different simple random samples are drawn from two different populations. The first sample consists of 40 people with 19 having a common attribute. The second sample consists of 2200 people with 1555 of them having the same common attribute. Compare the results from a hypothesis test of p 1=p 2 ​(with a 0.01 significance​ level) and a 99​% confidence interval estimate of p 1-p 2. Calculate the test statistic z to two decimal places, the critical values, and find...
Two different simple random samples are drawn from two different populations. The first sample consists of...
Two different simple random samples are drawn from two different populations. The first sample consists of 30 people with 14 having a common attribute. The second sample consists of 1900 people with 1370 of them having the same common attribute. Compare the results from a hypothesis test of p 1equalsp 2 ​(with a 0.01 significance​ level) and a 99​% confidence interval estimate of p 1minusp 2. What are the null and alternative hypotheses for the hypothesis​ test? A. Upper H...
A Guideline for Project I 1. Collect a data set with two related variables. You can...
A Guideline for Project I 1. Collect a data set with two related variables. You can create your own data set or can download a data set from a web or to use data sets that posted on project organizer. 2. Calculate descriptive statistics, i.e., mean, median and standard deviation, 3 . Estimate a regression equation for the two variables. 4. Submit a written report
10. Two different simple random samples are drawn from two different populations. The first sample consists...
10. Two different simple random samples are drawn from two different populations. The first sample consists of 40 people with 21 having a common attribute. The second sample consists of 1800 people with 1271 of them having the same common attribute. Compare the results from a hypothesis test of p 1=p  2 ​(with a 0.05 significance​ level) and a 95​% confidence interval estimate of p 1-p 2. What are the null and alternative hypotheses for the hypothesis​ test? A.H 0​: p...
How can PCR and gel electrophoresis tell if individuals from different populations are related. What else...
How can PCR and gel electrophoresis tell if individuals from different populations are related. What else can these tell you about the collected DNA samples
Assignment 1 Choose any one variable of interest (e.g., cups of coffee) and collect data from...
Assignment 1 Choose any one variable of interest (e.g., cups of coffee) and collect data from two independent samples (e.g., men vs. women, children vs. adults, college students vs. non-college students, etc.) could make up the data.. of minimum size n=5 each. Complete the following: Indicate whether your variable is continuous or discrete. Indicate which scale of measurement your variable is categorized as (nominal, ordinal, interval, or ratio). Calculate the mean, median, and mode for each sample. Provide a conclusion...
Collect data on one response (dependent or y) variable and two different explanatory (independent or x)...
Collect data on one response (dependent or y) variable and two different explanatory (independent or x) variables. This will require a survey with three questions. For example: To predict a student’s GPA (y), you might collect data on two x variables: SAT score and age. So we would be trying to determine if there was a linear correlation between someone’s SAT score and their GPA, as well as their age and their GPA. (Note: students may not choose GPA as...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT