Jump to content

Kruskal–Wallis test: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Kgwet (talk | contribs)
Kgwet (talk | contribs)
Line 27: Line 27:


==References==
==References==
* Gwet, Kilem Li (2011). "The Practical Guide to Statistis: Applications with Excel, R, and Calc" (section 10.3, page 364). Advanced Analytics, LLC ISBN: 978-0970806291.
* Gwet, Kilem Li (2011). "[http://pstat.advancedanalyticsllc.com/index.html The Practical Guide to Statistis: Applications with Excel, R, and Calc]" (section 10.3, page 364). Advanced Analytics, LLC ISBN: 978-0970806291.
* Corder and Foreman. (2009). "Nonparametric Statistics for Non-Statisticians: A Step-by-Step Approach". New York: Wiley.
* Corder and Foreman. (2009). "Nonparametric Statistics for Non-Statisticians: A Step-by-Step Approach". New York: Wiley.
* Kruskal and Wallis. Use of ranks in one-criterion variance analysis. ''Journal of the American Statistical Association'' '''47''' (260): 583&ndash;621, December 1952. <ref>http://homepages.ucalgary.ca/~jefox/Kruskal%20and%20Wallis%201952.pdf</ref>
* Kruskal and Wallis. Use of ranks in one-criterion variance analysis. ''Journal of the American Statistical Association'' '''47''' (260): 583&ndash;621, December 1952. <ref>http://homepages.ucalgary.ca/~jefox/Kruskal%20and%20Wallis%201952.pdf</ref>

Revision as of 20:54, 29 April 2011

In statistics, the Kruskal–Wallis one-way analysis of variance by ranks (named after William Kruskal and W. Allen Wallis) is a non-parametric method for testing equality of population medians among groups. It is identical to a one-way analysis of variance with the data replaced by their ranks. It is an extension of the Mann–Whitney U test to 3 or more groups.

Since it is a non-parametric method, the Kruskal–Wallis test does not assume a normal population, unlike the analogous one-way analysis of variance. However, the test does assume an identically-shaped and scaled distribution for each group, except for any difference in medians.

Method

  1. Rank all data from all groups together; i.e., rank the data from 1 to N ignoring group membership. Assign any tied values the average of the ranks they would have received had they not been tied.
  2. The test statistic is given by:
    where:
    • is the number of observations in group
    • is the rank (among all observations) of observation from group
    • is the total number of observations across all groups
    • ,
    • is the average of all the .
  3. Notice that the denominator of the expression for is exactly and . Thus

    Notice that the last formula only contains the squares of the average ranks.
  4. A correction for ties can be made by dividing by , where G is the number of groupings of different tied ranks, and ti is the number of tied values within group i that are tied at a particular value. This correction usually makes little difference in the value of K unless there are a large number of ties.
  5. Finally, the p-value is approximated by . If some values are small (i.e., less than 5) the probability distribution of K can be quite different from this chi-square distribution. If a table of the chi-square probability distribution is available, the critical value of chi-square, , can be found by entering the table at g − 1 degrees of freedom and looking under the desired significance or alpha level. The null hypothesis of equal population medians would then be rejected if . Appropriate multiple comparisons would then be performed on the group medians.

See also

References

  • Gwet, Kilem Li (2011). "The Practical Guide to Statistis: Applications with Excel, R, and Calc" (section 10.3, page 364). Advanced Analytics, LLC ISBN: 978-0970806291.
  • Corder and Foreman. (2009). "Nonparametric Statistics for Non-Statisticians: A Step-by-Step Approach". New York: Wiley.
  • Kruskal and Wallis. Use of ranks in one-criterion variance analysis. Journal of the American Statistical Association 47 (260): 583–621, December 1952. [1]
  • Siegel and Castellan. (1988). "Nonparametric Statistics for the Behavioral Sciences" (second edition). New York: McGraw–Hill.


References