Let be the measured kth moment, the corresponding corrected moment, and the class interval (bin width). No correction is necessary for the mean (first moment about zero). The first few measured and corrected moments about the mean are then related as follows:
When the data come from a normally distributed population, then binning and using the midpoint of the bin as the observed value results in an overestimate of the variance. That is why the correction to the variance is negative. The reason why the uncorrected estimate of the variance is an overestimate is that the error is negatively correlated with the observation. For the uniform distribution, the error is uncorrelated with the observation, so a correction should be +c2/12, which is the variance of the error itself rather than −c2/12. Thus Sheppard's correction is biased in favor of population distributions in which the error is negatively correlated with the observation.
The cumulants of the sum of the grouped variable and the uniform variable are the sums of the cumulants. As odd cumulants of a uniform distribution are zero; only even moments are affected.
The second and fourth cumulants of the uniform distribution on (−0.5c, 0.5c) are respectively, c2/12 and −c4/120.
The correction to moments can be derived from the relation between cumulants and moments.
- Weisstein, Eric W. "Sheppard's Correction". MathWorld—A Wolfram Web Resource. Retrieved March 2, 2014.
- Weatherburn, C.E. (1949), A first course in mathematical statistics, Cambridge University Press
|This statistics-related article is a stub. You can help Wikipedia by expanding it.|