# Nonparametric skew

In statistics and probability theory, the nonparametric skew is a statistic occasionally used with random variables that take real values. It is a measure of the skewness of a random variable's distribution—that is, the distribution's tendency to "lean" to one side or the other of the mean. Its calculation does not require any knowledge of the form of the underlying distribution—hence the name nonparametric. It has some desirable properties: it is zero for any symmetric distribution; it is unaffected by a scale shift; and it reveals either left- or right-skewness equally well. In some statistical samples it has been shown to be less powerful than the usual measures of skewness in detecting departures of the population from normality.

## Properties

### Definition

The nonparametric skew is defined as

$S={\frac {\mu -\nu }{\sigma }}$ where the mean (µ), median (ν) and standard deviation (σ) of the population have their usual meanings.

### Properties

The nonparametric skew is one third of the Pearson 2 skewness coefficient and lies between −1 and +1 for any distribution. This range is implied by the fact that the mean lies within one standard deviation of any median.

Under an affine transformation of the variable (X), the value of S does not change except for a possible change in sign. In symbols

$S(aX+b)=\operatorname {sign} (a)\,S(X)$ where a ≠ 0 and b are constants and S( X ) is the nonparametric skew of the variable X.

## Sharper bounds

The bounds of this statistic ( ±1 ) were sharpened by Majindar who showed that its absolute value is bounded by

${\frac {2(pq)^{1/2}}{(p+q)^{1/2}}}$ with

$p=\Pr(X>\operatorname {E} (X))$ and

$q=\Pr(X<\operatorname {E} (X)),$ where X is a random variable with finite variance, E() is the expectation operator and Pr() is the probability of the event occurring.

When p = q = 0.5 the absolute value of this statistic is bounded by 1. With p = 0.1 and p = 0.01, the statistic's absolute value is bounded by 0.6 and 0.199 respectively.

## Extensions

It is also known that

$|\mu -\nu _{0}|\leq \operatorname {E} (|X-\nu _{0}|)\leq \operatorname {E} (|X-\mu |)\leq \sigma ,$ where ν0 is any median and E(.) is the expectation operator.

It has been shown that

${\frac {|\mu -x_{q}|}{\sigma }}\leq \max \left({\sqrt {\frac {(1-q)}{q}}},{\sqrt {\frac {q}{(1-q)}}}\right)$ where xq is the qth quantile. Quantiles lie between 0 and 1: the median (the 0.5 quantile) has q = 0.5. This inequality has also been used to define a measure of skewness.

This latter inequality has been sharpened further.

$\mu -\sigma {\sqrt {\frac {1-q}{q}}}\leq x_{q}\leq \mu +\sigma {\sqrt {\frac {q}{1-q}}}$ Another extension for a distribution with a finite mean has been published:

$\mu -{\frac {1}{2q}}\operatorname {E} |X-\mu |\leq x_{q}\leq \mu +{\frac {1}{(2-2q)}}\operatorname {E} |X-\mu |$ The bounds in this last pair of inequalities are attained when $\Pr(X=a)=q$ and $\Pr(X=b)=1-q$ for fixed numbers a < b.

### Finite samples

For a finite sample with sample size n ≥ 2 with xr is the rth order statistic, m the sample mean and s the sample standard deviation corrected for degrees of freedom,

${\frac {|m-x_{r}|}{s}}\leq {\text{max}}\left[{\sqrt {\frac {(n-1)(r-1)}{n(n-r+1)}}},{\sqrt {\frac {(n-1)(n-r)}{nr}}}\right]$ Replacing r with n / 2 gives the result appropriate for the sample median:

${\frac {|m-a|}{s}}\leq {\sqrt {\frac {n^{2}-n}{n^{2}}}}={\sqrt {\frac {n-1}{n}}}$ where a is the sample median.

## Statistical tests

Hotelling and Solomons considered the distribution of the test statistic

$D={\frac {n(m-a)}{s}}$ where n is the sample size, m is the sample mean, a is the sample median and s is the sample's standard deviation.

Statistical tests of D have assumed that the null hypothesis being tested is that the distribution is symmetric .

Gastwirth estimated the asymptotic variance of n−1/2D. If the distribution is unimodal and symmetric about 0, the asymptotic variance lies between 1/4 and 1. Assuming a conservative estimate (putting the variance equal to 1) can lead to a true level of significance well below the nominal level.

Assuming that the underlying distribution is symmetric Cabilio and Masaro have shown that the distribution of S is asymptotically normal. The asymptotic variance depends on the underlying distribution: for the normal distribution, the asymptotic variance of Sn is 0.5708...

Assuming that the underlying distribution is symmetric, by considering the distribution of values above and below the median Zheng and Gastwirth have argued that

${\sqrt {2n}}\left({\frac {m-a}{s}}\right)$ where n is the sample size, is distributed as a t distribution.

## Related statistics

Mira studied the distribution of the difference between the mean and the median.

$\gamma _{1}=2(m-a),$ where m is the sample mean and a is the median. If the underlying distribution is symmetrical γ1 itself is asymptotically normal. This statistic had been earlier suggested by Bonferroni.

Assuming a symmetric underlying distribution, a modification of S was studied by Miao, Gel and Gastwirth who modified the standard deviation to create their statistic.

$J={\frac {1}{n}}{\sqrt {\frac {\pi }{2}}}\sum {|X_{i}-a|}$ where Xi are the sample values, || is the absolute value and the sum is taken over all n sample values.

The test statistic was

$T={\frac {m-a}{J}}.$ The scaled statistic Tn is asymptotically normal with a mean of zero for a symmetric distribution. Its asymptotic variance depends on the underlying distribution: the limiting values are, for the normal distribution var(Tn) = 0.5708... and, for the t distribution with three degrees of freedom, var(Tn) = 0.9689...

## Values for individual distributions

### Symmetric distributions

For symmetric probability distributions the value of the nonparametric skew is 0.

### Asymmetric distributions

It is positive for right skewed distributions and negative for left skewed distributions. Absolute values ≥ 0.2 indicate marked skewness.

It may be difficult to determine S for some distributions. This is usually because a closed form for the median is not known: examples of such distributions include the gamma distribution, inverse-chi-squared distribution, the inverse-gamma distribution and the scaled inverse chi-squared distribution.

The following values for S are known:

• Beta distribution: 1 < α < β where α and β are the parameters of the distribution, then to a good approximation
$S={\frac {1}{3}}{\frac {(\alpha -2\beta )(\alpha +\beta +1)^{1/2}}{(\alpha +\beta -2/3)(\alpha \beta )^{1/2}}}$ If 1 < β < α then the positions of α and β are reversed in the formula. S is always < 0.
$S={\frac {2}{\beta ^{2}(4+5\alpha ^{2})}}$ where α is the shape parameter and β is the location parameter.
${\frac {-4}{3}}\leq S\leq {\frac {4}{3}}$ $S\approx {\frac {1-(1-{\frac {2}{k}})^{3}}{2}}$ $S=1-\log _{e}(2)\approx 0.31$ $S=1-\log _{e}(2)\approx 0.31$ $S=-{\frac {polylog(2,1-p)+\ln(1+{\sqrt {p}})\ln p}{\sqrt {-[2polylog(3,1-p)+polylog^{2}(2,1-p)]}}}$ Here S is always > 0.
$0\leq S\leq 1-\log _{e}(2)$ $S=n^{-3/2}{\sqrt {\frac {n-4}{n-2}}}+O(n^{-5/2})$ $S={\frac {\Gamma \left(1-{\frac {1}{\alpha }}\right)-{\frac {1}{{\sqrt {\alpha }}\log _{e}(2)}}}{\sqrt {\Gamma \left(1-{\frac {2}{\alpha }}\right)-\left(\Gamma \left(1-{\frac {1}{\alpha }}\right)\right)^{2}}}}$ • Gamma distribution: The median can only be determined approximately for this distribution. If the shape parameter α is ≥ 1 then
$S\approx {\frac {\beta }{3\alpha +0.2}}$ where β > 0 is the rate parameter. Here S is always > 0.
$S=-{\frac {\exp({\frac {-k^{2}}{2}})-1}{\sqrt {\exp({\frac {k^{2}}{2}})-1}}}$ S is always < 0.
$S=\left({\frac {2^{k}-1}{k}}-2^{k}\right)(1-2k)^{0.5}$ ${\frac {{\sqrt {6}}[\gamma +\log _{e}(\log _{e}(2))]}{\pi }}\approx 0.1643$ where γ is Euler's constant.
$S\approx {\frac {{\sqrt {2}}-0.6745{\sqrt {\pi }}}{\sqrt {\pi -2}}}\approx 0.36279$ $S={\frac {b-\sin(b)}{\sqrt {b\tan(b)-b^{2}}}}$ The standard deviation does not exist for values of b > 4.932 (approximately). For values for which the standard deviation is defined, S is > 0.
$S={\frac {1}{(e^{\frac {\sigma ^{2}}{2}}+1)(e^{\mu +\sigma ^{2}})}}$ $S\approx {\frac {[\log _{e}(\log _{e}(2))-0.5772]{\sqrt {6}}}{\pi }}\approx -0.1643$ $S={\frac {(\alpha -1)(\alpha -2)(1-(\alpha -1)(2^{1/\alpha }-1))}{\alpha ^{1/2}}}$ $S\approx {\frac {{\sqrt {2}}-1.5382\Gamma ({\frac {3}{2}})}{\sqrt {2(\Gamma ({\frac {5}{2}})-\Gamma ({\frac {3}{2}}))}}}\approx 0.0854$ $S=-1$ $S=(\alpha -2^{1/\alpha }[\alpha -1])({\frac {\alpha -2}{\alpha }})^{1/2},$ and S is always > 0.
${\frac {-\log _{e}(2)}{\lambda ^{\frac {1}{2}}}}\leq S\leq {\frac {1}{3\lambda ^{\frac {1}{2}}}}$ where λ is the parameter of the distribution.
$S={\sqrt {\frac {2}{4-\pi }}}[({\frac {\pi }{2}})^{0.5}-\log _{e}(4)]\approx 0.1251$ $S={\frac {\Gamma (1+1/k)-\log _{e}(2)^{1/k}}{(\Gamma (1+2/k)-\Gamma (1+1/k))^{1/2}}},$ where k is the shape parameter of the distribution. Here S is always > 0.

## History

In 1895 Pearson first suggested measuring skewness by standardizing the difference between the mean and the mode, giving

${\frac {\mu -\theta }{\sigma }},$ where μ, θ and σ is the mean, mode and standard deviation of the distribution respectively. Estimates of the population mode from the sample data may be difficult but the difference between the mean and the mode for many distributions is approximately three times the difference between the mean and the median which suggested to Pearson a second skewness coefficient:

${\frac {3(\mu -\nu )}{\sigma }},$ where ν is the median of the distribution. Bowley dropped the factor 3 from this formula in 1901 leading to the nonparametric skew statistic.

The relationship between the median, the mean and the mode was first noted by Pearson when he was investigating his type III distributions.

## Relationships between the mean, median and mode

For an arbitrary distribution the mode, median and mean may appear in any order.

Analyses have been made of some of the relationships between the mean, median, mode and standard deviation. and these relationships place some restrictions of the sign and magnitude of the nonparametric skew.

A simple example illustrating these relationships is the binomial distribution with n = 10 and p = 0.09. This distribution when plotted has a long right tail. The mean (0.9) is to the left of the median (1) but the skew (0.906) as defined by the third standardized moment is positive. In contrast the nonparametric skew is -0.110.

### Pearson's rule

The rule that for some distributions the difference between the mean and the mode is three times that between the mean and the median is due to Pearson who discovered it while investigating his Type 3 distributions. It is often applied to slightly asymmetric distributions that resemble a normal distribution but it is not always true.

In 1895 Pearson noted that for what is now known as the gamma distribution that the relation

$\nu -\theta =2(\mu -\nu )$ where θ, ν and µ are the mode, median and mean of the distribution respectively was approximately true for distributions with a large shape parameter.

Doodson in 1917 proved that the median lies between the mode and the mean for moderately skewed distributions with finite fourth moments. This relationship holds for all the Pearson distributions and all of these distributions have a positive nonparametric skew.

Doodson also noted that for this family of distributions to a good approximation,

$\theta =3\nu -2\mu ,$ where θ, ν and µ are the mode, median and mean of the distribution respectively. Doodson's approximation was further investigated and confirmed by Haldane. Haldane noted that in samples with identical and independent variates with a third cumulant had sample means that obeyed Pearson's relationship for large sample sizes. Haldane required a number of conditions for this relationship to hold including the existence of an Edgeworth expansion and the uniqueness of both the median and the mode. Under these conditions he found that mode and the median converged to 1/2 and 1/6 of the third moment respectively. This result was confirmed by Hall under weaker conditions using characteristic functions.

Doodson's relationship was studied by Kendall and Stuart in the log-normal distribution for which they found an exact relationship close to it.

Hall also showed that for a distribution with regularly varying tails and exponent α that[clarification needed]

$\mu -\theta =\alpha (\mu -\nu )$ ### Unimodal distributions

Gauss showed in 1823 that for a unimodal distribution

$\sigma \leq \omega \leq 2\sigma$ and

$|\nu -\mu |\leq {\sqrt {\frac {3}{4}}}\omega ,$ where ω is the root mean square deviation from the mode.

For a large class of unimodal distributions that are positively skewed the mode, median and mean fall in that order. Conversely for a large class of unimodal distributions that are negatively skewed the mean is less than the median which in turn is less than the mode. In symbols for these positively skewed unimodal distributions

$\theta \leq \nu \leq \mu$ and for these negatively skewed unimodal distributions

$\mu \leq \nu \leq \theta$ This class includes the important F, beta and gamma distributions.

This rule does not hold for the unimodal Weibull distribution.

For a unimodal distribution the following bounds are known and are sharp:

${\frac {|\theta -\mu |}{\sigma }}\leq {\sqrt {3}},$ ${\frac {|\nu -\mu |}{\sigma }}\leq {\sqrt {0.6}},$ ${\frac {|\theta -\nu |}{\sigma }}\leq {\sqrt {3}},$ where μ,ν and θ are the mean, median and mode respectively.

The middle bound limits the nonparametric skew of a unimodal distribution to approximately ±0.775.

### van Zwet condition

The following inequality,

$\theta \leq \nu \leq \mu ,$ where θ, ν and µ is the mode, median and mean of the distribution respectively, holds if

$F(\nu -x)+F(\nu +x)\geq 1{\text{ for all }}x,$ where F is the cumulative distribution function of the distribution. These conditions have since been generalised and extended to discrete distributions. Any distribution for which this holds has either a zero or a positive nonparametric skew.