Hodges' estimator

In statistics, Hodges' estimator^[1] (or the Hodges–Le Cam estimator^[2]), named for Joseph Hodges, is a famous counterexample of an estimator which is "superefficient",^[3] i.e. it attains smaller asymptotic variance than regular efficient estimators. The existence of such a counterexample is the reason for the introduction of the notion of regular estimators.

Hodges' estimator improves upon a regular estimator at a single point. In general, any superefficient estimator may surpass a regular estimator at most on a set of Lebesgue measure zero.^[4]

Although Hodges discovered the estimator he never published it; the first publication was in the doctoral thesis of Lucien Le Cam.^[5]

Construction[edit]

Suppose ${\hat {\theta }}_{n}$ is a "common" estimator for some parameter $\theta$ : it is consistent, and converges to some asymptotic distribution $L_{\theta }$ (usually this is a normal distribution with mean zero and variance which may depend on $\theta$ ) at the ${\sqrt {n}}$ -rate:

{\sqrt {n}}({\hat {\theta }}_{n}-\theta )\ {\xrightarrow {d}}\ L_{\theta }\ .

Then the Hodges' estimator ${\hat {\theta }}_{n}^{H}$ is defined as^[6]

{\hat {\theta }}_{n}^{H}={\begin{cases}{\hat {\theta }}_{n},&{\text{if }}|{\hat {\theta }}_{n}|\geq n^{-1/4},{\text{ and}}\\0,&{\text{if }}|{\hat {\theta }}_{n}|<n^{-1/4}.\end{cases}}

This estimator is equal to ${\hat {\theta }}_{n}$ everywhere except on the small interval $[-n^{-1/4},n^{-1/4}]$ , where it is equal to zero. It is not difficult to see that this estimator is consistent for $\theta$ , and its asymptotic distribution is^[7]

{\begin{aligned}&n^{\alpha }({\hat {\theta }}_{n}^{H}-\theta )\ {\xrightarrow {d}}\ 0,\qquad {\text{when }}\theta =0,\\&{\sqrt {n}}({\hat {\theta }}_{n}^{H}-\theta )\ {\xrightarrow {d}}\ L_{\theta },\quad {\text{when }}\theta \neq 0,\end{aligned}}

for any $\alpha \in \mathbb {R}$ . Thus this estimator has the same asymptotic distribution as ${\hat {\theta }}_{n}$ for all $\theta \neq 0$ , whereas for $\theta =0$ the rate of convergence becomes arbitrarily fast. This estimator is superefficient, as it surpasses the asymptotic behavior of the efficient estimator ${\hat {\theta }}_{n}$ at least at one point $\theta =0$ .

It is not true that the Hodges estimator is equivalent to the sample mean, but much better when the true mean is 0. The correct interpretation is that, for finite $n$ , the truncation can lead to worse square error than the sample mean estimator for $E[X]$ close to 0, as is shown in the example in the following section.^[8]

Le Cam shows that this behaviour is typical: superefficiency at the point θ implies the existence of a sequence $\theta _{n}\rightarrow \theta$ such that $\lim \inf E\theta _{n}\ell ({\sqrt {n}}({\hat {\theta }}_{n}-\theta _{n}))$ is strictly larger than the Cramer-Rao bound. For the extreme case where the asymptotic risk at θ is zero, the $\liminf$ is even infinite for a sequence $\theta _{n}\rightarrow \theta$ .^[9]

In general, superefficiency may only be attained on a subset of Lebesgue measure zero of the parameter space $\Theta$ .^[10]

Example[edit]

Suppose x₁, ..., x_n is an independent and identically distributed (IID) random sample from normal distribution N(θ, 1) with unknown mean but known variance. Then the common estimator for the population mean θ is the arithmetic mean of all observations: $\scriptstyle {\bar {x}}$ . The corresponding Hodges' estimator will be $\scriptstyle {\hat {\theta }}_{n}^{H}\;=\;{\bar {x}}\cdot \mathbf {1} \{|{\bar {x}}|\,\geq \,n^{-1/4}\}$ , where 1{...} denotes the indicator function.

The mean square error (scaled by n) associated with the regular estimator x is constant and equal to 1 for all θ's. At the same time the mean square error of the Hodges' estimator $\scriptstyle {\hat {\theta }}_{n}^{H}$ behaves erratically in the vicinity of zero, and even becomes unbounded as n → ∞. This demonstrates that the Hodges' estimator is not regular, and its asymptotic properties are not adequately described by limits of the form (θ fixed, n → ∞).

Notes[edit]

^ Vaart (1998, p. 109)
^ Kale (1985)
^ Bickel (1998, p. 21)
^ Vaart (1998, p. 116)
^ Le Cam, Lucien M.; University of California, Berkeley. (1953). On some asymptotic properties of maximum likelihood estimates and related Bayes' estimates. University of California publications in statistics; v. 1, no. 11. Berkeley: University of California press.
^ Stoica & Ottersten (1996, p. 135)
^ Vaart (1998, p. 109)
^ Vaart AW van der. Asymptotic Statistics. Cambridge University Press; 1998.
^ van der Vaart, A. W., & Wellner, J. A. (1996). Weak Convergence and Empirical Processes. In Springer Series in Statistics. Springer New York. https://doi.org/10.1007/978-1-4757-2545-2
^ Vaart AW van der. Asymptotic Statistics. Cambridge University Press; 1998.
^ Vaart (1998, p. 110)

References[edit]

Bickel, Peter J.; Klaassen, Chris A.J.; Ritov, Ya’acov; Wellner, Jon A. (1998). Efficient and adaptive estimation for semiparametric models. Springer: New York. ISBN 0-387-98473-9.
Kale, B.K. (1985). "A note on the super efficient estimator". Journal of Statistical Planning and Inference. 12: 259–263. doi:10.1016/0378-3758(85)90074-6.
Stoica, P.; Ottersten, B. (1996). "The evil of superefficiency". Signal Processing. 55: 133–136. doi:10.1016/S0165-1684(96)00159-4.
Vaart, A. W. van der (1998). Asymptotic statistics. Cambridge University Press. ISBN 978-0-521-78450-4.

[1] Vaart (1998, p. 109)

[2] Kale (1985)

[3] Bickel (1998, p. 21)

[4] Vaart (1998, p. 116)

[5] Le Cam, Lucien M.; University of California, Berkeley. (1953). On some asymptotic properties of maximum likelihood estimates and related Bayes' estimates. University of California publications in statistics; v. 1, no. 11. Berkeley: University of California press.

[6] Stoica & Ottersten (1996, p. 135)

[7] Vaart (1998, p. 109)

[8] Vaart AW van der. Asymptotic Statistics. Cambridge University Press; 1998.

[9] van der Vaart, A. W., & Wellner, J. A. (1996). Weak Convergence and Empirical Processes. In Springer Series in Statistics. Springer New York. https://doi.org/10.1007/978-1-4757-2545-2

[10] Vaart AW van der. Asymptotic Statistics. Cambridge University Press; 1998.

[11] Vaart (1998, p. 110)

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

Construction[edit]

Example[edit]

See also[edit]

Notes[edit]

References[edit]