Freedman–Diaconis rule

In statistics, the Freedman–Diaconis rule can be used to select the width of the bins to be used in a histogram.^[1] It is named after David A. Freedman and Persi Diaconis.

For a set of empirical measurements sampled from some probability distribution, the Freedman-Diaconis rule is designed to minimize the difference between the area under the empirical probability distribution and the area under the theoretical probability distribution.^{[clarification needed]}

The general equation for the rule is:

{\text{Bin width}}=2\,{{\text{IQR}}(x) \over {\sqrt[{3}]{n}}}

where $\operatorname {IQR} (x)$ is the interquartile range of the data and $n$ is the number of observations in the sample $x.$

Other approaches

Another approach is to use Sturges' rule: use a bin so large that there are about $1+\log _{2}n$ non-empty bins (Scott, 2009).^[2] This works well for n under 200, but was found to be inaccurate for large n.^[3] For a discussion and an alternative approach, see Birgé and Rozenholc.^[4]

References

^ Freedman, David; Diaconis, Persi (December 1981). "On the histogram as a density estimator: L₂ theory". Probability Theory and Related Fields. 57 (4): 453–476. CiteSeerX 10.1.1.650.2473. doi:10.1007/BF01025868. ISSN 0178-8051.
^ Scott, D.W. (2009). "Sturges' rule". WIREs Computational Statistics. 1 (3): 303–306. doi:10.1002/wics.35.
^ Hyndman, R.J. (1995). "The problem with Sturges' rule for constructing histograms" (PDF). {{cite journal}}: Cite journal requires |journal= (help)
^ Birgé, L.; Rozenholc, Y. (2006). "How many bins should be put in a regular histogram". ESAIM: Probability and Statistics. 10: 24–45. CiteSeerX 10.1.1.3.220. doi:10.1051/ps:2006001.

This statistics-related article is a stub. You can help Wikipedia by expanding it.

[1] Freedman, David; Diaconis, Persi (December 1981). "On the histogram as a density estimator: L₂ theory". Probability Theory and Related Fields. 57 (4): 453–476. CiteSeerX 10.1.1.650.2473. doi:10.1007/BF01025868. ISSN 0178-8051.

[2] Scott, D.W. (2009). "Sturges' rule". WIREs Computational Statistics. 1 (3): 303–306. doi:10.1002/wics.35.

[3] Hyndman, R.J. (1995). "The problem with Sturges' rule for constructing histograms" (PDF). {{cite journal}}: Cite journal requires |journal= (help)

[4] Birgé, L.; Rozenholc, Y. (2006). "How many bins should be put in a regular histogram". ESAIM: Probability and Statistics. 10: 24–45. CiteSeerX 10.1.1.3.220. doi:10.1051/ps:2006001.

[1]

[2]

[3]

[4]