= Anderson–Darling test =

The Anderson–Darling test is a statistical test of whether a given sample of data is drawn from a given probability distribution. In its basic form, the test assumes that there are no parameters to be estimated in the distribution being tested, in which case the test and its set of critical values is distribution-free. However, the test is most often used in contexts wherein a family of distributions is being tested, in which case the parameters of that family need to be estimated and account must be taken of this in adjusting either the test-statistic or its critical values. When applied to testing whether a normal distribution adequately describes a set of data, it is one of the most powerful statistical tools for detecting most departures from normality.
K-sample Anderson–Darling tests are available for testing whether several collections of observations can be modelled as coming from a single population, where the distribution function does not have to be specified.

In addition to its use as a test of fit for distributions, it can be used in parameter estimation as the basis for a form of minimum distance estimation procedure.

The test is named after Theodore Wilbur Anderson (1918–2016) and Donald A. Darling (1915–2014), who invented it in 1952.

==The single-sample test==

The Anderson–Darling and Cramér–von Mises statistics belong to the class of
quadratic EDF statistics (tests based on the empirical distribution function). If the hypothesized distribution is $F$, and empirical (sample) cumulative distribution function is $F_n$, then the quadratic EDF statistics measure the distance between $F$ and $F_n$ by
$n \int_{0}^{1} {\left(F_n(x) - F(x)\right)}^2 \, w(x) \, dF(x),$
where $n$ is the number of elements in the sample, and $w(x)$ is a weighting function. When the weighting function is $w(x)=1$, the statistic
is the Cramér–von Mises statistic. The Anderson–Darling (1954) test is based on the distance
$A^2 = n \int_{0}^{1} \frac{\hat{\sigma}}.$
With the standard normal CDF $\Phi$, $A^2$ is calculated using
$A^2 = -n -\frac{1}{n} \sum_{i=1}^n \left(2i - 1\right) \left[\ln \Phi(Y_i) + \ln(1-\Phi(Y_{n+1-i}))\right].$
An alternative expression in which only a single observation is dealt with at each step of the summation is:
$A^2 = -n -\frac{1}{n} \sum_{i=1}^n\left[(2i-1)\ln\Phi(Y_i)+(2(n-i)+1)\ln(1-\Phi(Y_i))\right].$
A modified statistic can be calculated using
$A^{*2} =
\begin{cases}
A^2\left(1+\frac{0.75}{n}+\frac{2.25}{n^2}\right), & \text{if the variance and the mean are both unknown.} \\
A^2, & \text{otherwise.}
\end{cases}$

. If $A^{2}$ or $A^{*2}$ exceeds a given critical value, then the hypothesis of normality is rejected with
some significance level. The critical values are given in the table below for values of $A^{*2}$.

| Case | n | 15% | 10% | 5% | 2.5% | 1% |
| 0 | ≥ 5 | 1.621 | 1.933 | 2.492 | 3.070 | 3.857 |
| 1 | ≥ 20 | 0.782 | 0.894 | 1.087 | 1.285 | 1.551 |
| 2 | ≥ 20 | 1.430 | 1.743 | 2.308 | 2.898 | 3.702 |
| 3 | | 0.561 | 0.631 | 0.752 | 0.873 | 1.035 |

Note 1: If $\hat{\sigma}$ = 0 or any $\Phi(Y_i)=$(0 or 1) then $A^2$ cannot be calculated and is undefined.

Note 2: The above formula for the modified statistic $A^{*2}$ is taken from D'Agostino (1986, p. 123). Care is required in comparisons across different sources as often the specific adjustment formula is not stated.

Note 3: Stephens notes that the test becomes better when the parameters are computed from the data, even if they are known.

Note 4: Marsaglia & Marsaglia provide a more accurate result for Case 0 at 85% and 99%.

===Tests for other distributions===

Above, it was assumed that the variable $X_i$ was being tested for normal distribution. Any other family of distributions can be tested, but the test for each family is implemented by using a different modification of the basic test statistic, with reference to critical values specific to that family of distributions. The modifications of the statistic and tables of critical values are given by Stephens (1986) for the exponential, extreme-value, Weibull, gamma, logistic, Cauchy, and von Mises distributions. Tests for the (two-parameter) log-normal distribution can be implemented by transforming the data using a logarithm and using the above test for normality. Details for the required modifications to the test statistic and for the critical values for the normal distribution and the exponential distribution have been published by Pearson & Hartley (1972, Table 54). Details for these distributions, with the addition of the Gumbel distribution, are also given by Shorack & Wellner (1986, p239). Details for the logistic distribution are given by Stephens (1979). A test for the (two parameter) Weibull distribution can be obtained by making use of the fact that the logarithm of a Weibull variate has a Gumbel distribution.

==Non-parametric k-sample tests==
Fritz Scholz and Michael A. Stephens (1987) discuss a test, based on the Anderson–Darling measure of agreement between distributions, for whether a number of random samples with possibly different sample sizes may have arisen from the same distribution, where this distribution is unspecified. The R package kSamples and the Python package Scipy implements this rank test for comparing k samples among several other such rank tests.

For $k$ samples the statistic can be computed as follows under the assumption that the distribution function $F_i$ of $i$-th sample is continuous

$A^2_{kN} = \frac{1}{N} \sum_{i=1}^k \frac{1}{n_i} \sum_{j=1}^{N-1} \frac{(NM_{ij} - jn_i)^2}{j(N-j)}$

where
- $n_i$ is the number of observations in the $i$-th sample
- $N$ is the total number of observations in all samples
- $Z_1 < \cdots < Z_N$ is the pooled ordered sample
- $M_{ij}$ is the number of observations in the $i$-th sample that are not greater than $Z_j$.

==See also==
- Kolmogorov–Smirnov test
- Kuiper's test
- Shapiro–Wilk test
- Jarque–Bera test
- Goodness of fit
