Question

In: Statistics and Probability

explain various unknown methods of non parametric tests (NPT) with their distinct role and applications. Also...

explain various unknown methods of non parametric tests (NPT) with their distinct role and applications. Also state the formula for each of the Three NPT

Solutions

Expert Solution

non-parametric statistics:

To a statistician, a parameter is a measurable characteristic of a population. The population characteristics that usually interest statisticians are the location and the shape. Non-parametric statistics are used when the parameters of the population are not measurable or do not meet certain standards. In cases when the data only order the observations, so that the interval between the observations is unknown, neither a mean nor a variance can be meaningfully computed. In such cases, you need to use non-parametric tests. Because your sample does not have cardinal, or interval, data, you cannot use it to estimate the mean or variance of the population, though you can make other inferences. Even if your data are cardinal, the population must be normal before the shape of the many sampling distributions are known. Fortunately, even if the population is not normal, such sampling distributions are usually close to the known shape if large samples are used. In that case, using the usual techniques is acceptable. However, if the samples are small and the population is not normal, you have to use non-parametric statistics. As you know, “there is no such thing as a free lunch”. If you want to make an inference about a population without having cardinal data, or without knowing that the population is normal, or with very small samples, you will have to give up something. In general, non-parametric statistics are less precise than parametric statistics. Because you know less about the population you are trying to learn about, the inferences you make are less exact.

When either (1) the population is not normal and the samples are small, or (2) when the data are not cardinal, the same non-parametric statistics are used. Most of these tests involve ranking the members of the sample, and most involve comparing the ranking of two or more samples. Because we cannot compute meaningful sample statistics to compare to a hypothesized standard, we end up comparing two samples.

1)

Mann-Whitney U-test:

“The t-Test”, you learned how to test to see if two samples came from populations with the same mean by using the t-test. If your samples are small and you are not sure if the original populations are normal, or if your data do not measure intervals, you cannot use that t-test because the sample t-scores will not follow the sampling distribution in the t-table. Though there are two different data problems that keep you from using the t-test, the solution to both problems is the same, the non-parametric Mann-Whitney U-test. The basic idea behind the test is to put the samples together, rank the members of the combined sample, and then see if the two samples are mixed together in the common ranking.

The modules on hypothesis testing presented techniques for testing the equality of means in two independent samples. An underlying assumption for appropriate use of the tests described was that the continuous outcome was approximately normally distributed or that the samples were sufficiently large (usually n1> 30 and n2> 30) to justify their use based on the Central Limit Theorem. When comparing two independent samples when the outcome is not normally distributed and the samples are small, a nonparametric test is appropriate.

A popular nonparametric test to compare outcomes between two independent groups is the Mann Whitney U test. The Mann Whitney U test, sometimes called the Mann Whitney Wilcoxon Test or the Wilcoxon Rank Sum Test, is used to test whether two samples are likely to derive from the same population (i.e., that the two populations have the same shape). Some investigators interpret this test as comparing the medians between the two populations. Recall that the parametric test compares the means (H0: μ1=μ2) between independent groups.

In contrast, the null and two-sided research hypotheses for the nonparametric test are stated as follows:

H0: The two populations are equal versus

H1: The two populations are not equal.

This test is often performed as a two-sided test and, thus, the research hypothesis indicates that the populations are not equal as opposed to specifying directionality. A one-sided research hypothesis is used if interest lies in detecting a positive or negative shift in one population as compared to the other. The procedure for the test involves pooling the observations from the two samples into one combined sample, keeping track of which sample each observation comes from, and then ranking lowest to highest from 1 to n1+n2, respectively.

U1=n1n2+[n1(n1+1)]/2−T1

where

T1 = the sum of the ranks of group 1

n1 = the number of members of the sample from group 1

n2 = the number of members of the sample from group 2

2)

Hypothesis of 1 sample Wilcoxon Signed test

For the left-tailed test:

  • Null Hypothesis H0: The hypothesized sample median is equal to theoretical value
  • Alternative Hypothesis : H1: The hypothesized sample median is less than the theoretical value

For right-tailed test:

  • Null Hypothesis H0: The hypothesized sample median is equal to theoretical value
  • Alternative Hypothesis : H1: The hypothesized sample median is greater than the theoretical value

Assumptions of the one sample Wilcoxon test

  • Differences between the data value and the hypothesized median are continuous
  • Data follows symmetric distribution
  • Observations are mutually independent to each other
  • Measurement scale is at least interval

Procedure to execute One Sample Wicoxon Non Parametric Hypothesis Test

  • Identify the difference between each individual value and the median
  • If the difference of individual value and median is zero, ignore it.
  • Ignore the signs of the difference values and assign lowest rank to the smallest difference value. If the values have tied, then consider the mean value.
  • Compute the sum of ranks of positive difference values, and negative difference values (W+ and W-)
  • If the values are (>20), the normal approximation would be

Where t is the ranks of tied values

  • Calculate the z-value using

  • Compare the test statistic, W, with the critical value in the tables; the null hypothesis can be rejected if W is less than or equal to the critical value.
  • Now, compare the test statics with critical value in the tables, make a decision, the null hypothesis will be rejected if the test statistic ,W, is less than or equal to the critical value
  • Interpret the decision in the context of the original claim.

3)

Kruskal-Wallis test:

The Kruskal-Wallis test is a nonparametric (distribution free) test, and is used when the assumptions of one-way ANOVA are not met. Both the Kruskal-Wallis test and one-way ANOVA assess for significant differences on a continuous dependent variable by a categorical independent variable (with two or more groups). In the ANOVA, we assume that the dependent variable is normally distributed and there is approximately equal variance on the scores across groups. However, when using the Kruskal-Wallis Test, we do not have to make any of these assumptions. Therefore, the Kruskal-Wallis test can be used for both continuous and ordinal-level dependent variables. However, like most non-parametric tests, the Kruskal-Wallis Test is not as powerful as the ANOVA.

Null hypothesis: Null hypothesis assumes that the samples (groups) are from identical populations.

Alternative hypothesis: Alternative hypothesis assumes that at least one of the samples (groups) comes from a different population than the others.

Example questions answered:

How do test scores differ between the different grade levels in elementary school?

Do job satisfaction scores differ by race?

The distribution of the Kruskal-Wallis test statistic approximates a chi-square distribution, with k-1 degrees of freedom, if the number of observations in each group is 5 or more. If the calculated value of the Kruskal-Wallis test is less than the critical chi-square value, then the null hypothesis cannot be rejected. If the calculated value of Kruskal-Wallis test is greater than the critical chi-square value, then we can reject the null hypothesis and say that at least one of the samples comes from a different population.

Assumptions

  1. We assume that the samples drawn from the population are random.
    2. We also assume that the observations are independent of each other.
    3. The measurement scale for the dependent variable should be at least ordinal.

Related Pages:

  • Sign Test
  • ANOVA
  • Wilcoxon Sign Test
  1. Rank all data from all groups together; i.e., rank the data from 1 to N ignoring group membership. Assign any tied values the average of the ranks they would have received had they not been tied.
  2. The test statistic is given by:

      where:

    • is the number of observations in group
    • is the rank (among all observations) of observation from group
    • is the total number of observations across all groups
    • is the average rank of all observations in group
    • is the average of all the

Related Solutions

explain some distinguishing feature of non parametric tests?
explain some distinguishing feature of non parametric tests?
Explain the difference between parametric Vs non parametric methods. Give examples to explain to your answer....
Explain the difference between parametric Vs non parametric methods. Give examples to explain to your answer. Please provide a detailed answer.
How to understand and compute non parametric tests
How to understand and compute non parametric tests
What are the key differences between parametric and non-parametric tests? Provide one example of a parametric...
What are the key differences between parametric and non-parametric tests? Provide one example of a parametric test and one example of a non-parametric test.
An advantage of non-parametric tests includes: a. Good power compared to parametric tests b. Set up...
An advantage of non-parametric tests includes: a. Good power compared to parametric tests b. Set up to test hypotheses and estimate effect size c. Very few assumptions for the distribution of the data d. Allows for analysis of large continuous scale data sets
Question: a)Distinguish between the following: i) Non- Parametric Methods ii) Semi-Parametric Methods iii) Parametric Methods b)...
Question: a)Distinguish between the following: i) Non- Parametric Methods ii) Semi-Parametric Methods iii) Parametric Methods b) Discuss the following statistical properties of asset returns: i) Heavy tails ii) Ergodicity iii)Autocorrelation -absence of linear autocorrelation C)Explain the following Diagnostic tests of the error term i) White Test of heteroscedasticity ii)Normality Test
In general, when should you use non-parametric vs. parametric tests?
In general, when should you use non-parametric vs. parametric tests?
When should you use non-parametric tests of statistical significance? When is it inappropriate to use non-parametric...
When should you use non-parametric tests of statistical significance? When is it inappropriate to use non-parametric statistical tests? Describe what is meant by the phrase: "Power of a a statistical test". Are non-parametric statistical procedures as powerful as parametric statistical procedures?
How many types of tests are considered non-parametric data and briefly explain each
How many types of tests are considered non-parametric data and briefly explain each
6. Parametric tests usually have more statistical power than non-parametric tests. True or False 5. A...
6. Parametric tests usually have more statistical power than non-parametric tests. True or False 5. A post hoc test does not need to be performed when an ANOVA produces a statistically significant F value. True or False 4. In the case of a hypothesis t test, population mean is known. True or False
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT