Jump to content

Kolmogorov–Smirnov test: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
No edit summary
 
Larry_Sanger (talk)
mNo edit summary
Line 23: Line 23:




Note that when the underlying independent variable is cyclic as with day of the year or day of the week, then [[Kuiper Test|Kuiper's test]] is more appropriate. Numerical Recipes is again a good source of information on this. ''Note: Kuiper is pronounced in Dutch the way an English persion would pronounce Cowper''.
See http://www.itl.nist.gov/div898/handbook/eda/section3/eda35g.htm for a lovely explanation of the one-sided KS test and http://www.io.com/~ritter/JAVASCRP/NORMCHIK.HTM for Javascript code that implements both the one-sided and two-sided tests. As always, Numerical Recipes ISBN: 0521431085 is a prime resource for this sort of thing (see http://www.nr.com/nronline_switcher.html for a discussion.






Links:
Note that when the underlying independent variable is cyclic as with day of the year or day of the week, then [[Kuiper Test|Kuiper's test]] is more appropriate. Numerical Recipes is again a good source of information on this. ''Note: Kuiper is pronounced Cowper in Dutch''.



See http://www.itl.nist.gov/div898/handbook/eda/section3/eda35g.htm for a lovely explanation of the one-sided KS test and http://www.io.com/~ritter/JAVASCRP/NORMCHIK.HTM for Javascript code that implements both the one-sided and two-sided tests. As always, Numerical Recipes ISBN: 0521431085 is a prime resource for this sort of thing (see http://www.nr.com/nronline_switcher.html for a discussion.



Revision as of 12:50, 29 June 2001

The Kolmogorov-Smirnov test is used to determine whether two empirical distributions are different or whether an empirical distribution differs from a theoretical distribution.


The empirical cumulative distribution for N observations yi is defined as E(x) = Σ i (yi < x). The two one-sided Kolmogorov-Smirnov test statistics statistics are given by


DN+ = MAX( E(x) - F(x) )


DN- = MAX( F(x) - E(x) )


where F(x) is the theoretical distribution or another empirical distribution. Knuth gives a detailed description of how to analyze the significance of this pair of statistics. Many people use MAX(DN+, DN-) instead, but the distribution of this statistic is more difficult to deal with.


Note that when the underlying independent variable is cyclic as with day of the year or day of the week, then Kuiper's test is more appropriate. Numerical Recipes is again a good source of information on this. Note: Kuiper is pronounced in Dutch the way an English persion would pronounce Cowper.


Links:


See http://www.itl.nist.gov/div898/handbook/eda/section3/eda35g.htm for a lovely explanation of the one-sided KS test and http://www.io.com/~ritter/JAVASCRP/NORMCHIK.HTM for Javascript code that implements both the one-sided and two-sided tests. As always, Numerical Recipes ISBN: 0521431085 is a prime resource for this sort of thing (see http://www.nr.com/nronline_switcher.html for a discussion.