Asymptotic theory (statistics)
In statistics, asymptotic theory, or large sample theory, is a framework for assessing properties of estimators and statistical tests. Within this framework, it is often assumed that the sample size n may grow indefinitely; the properties of estimators and tests are then evaluated under the limit of n → ∞. In practice, a limit evaluation is considered to be approximately valid for large finite sample sizes too.[1]
Overview
[edit]Most statistical problems begin with a dataset of size n. The asymptotic theory proceeds by assuming that it is possible (in principle) to keep collecting additional data, thus that the sample size grows infinitely, i.e. n → ∞. Under the assumption, many results can be obtained that are unavailable for samples of finite size. An example is the weak law of large numbers. The law states that for a sequence of independent and identically distributed (IID) random variables X1, X2, ..., if one value is drawn from each random variable and the average of the first n values is computed as Xn, then the Xn converge in probability to the population mean E[Xi] as n → ∞.[2]
In asymptotic theory, the standard approach is n → ∞. For some statistical models, slightly different approaches of asymptotics may be used. For example, with panel data, it is commonly assumed that one dimension in the data remains fixed, whereas the other dimension grows: T = constant and N → ∞, or vice versa.[2]
Besides the standard approach to asymptotics, other alternative approaches exist:
- Within the local asymptotic normality framework, it is assumed that the value of the "true parameter" in the model varies slightly with n, such that the n-th model corresponds to θn = θ + h/√n . This approach lets us study the regularity of estimators.
- When statistical tests are studied for their power to distinguish against the alternatives that are close to the null hypothesis, it is done within the so-called "local alternatives" framework: the null hypothesis is H0: θ = θ0 and the alternative is H1: θ = θ0 + h/√n . This approach is especially popular for the unit root tests.
- There are models where the dimension of the parameter space Θn slowly expands with n, reflecting the fact that the more observations there are, the more structural effects can be feasibly incorporated in the model.
- In kernel density estimation and kernel regression, an additional parameter is assumed—the bandwidth h. In those models, it is typically taken that h → 0 as n → ∞. The rate of convergence must be chosen carefully, though, usually h ∝ n−1/5.
In many cases, highly accurate results for finite samples can be obtained via numerical methods (i.e. computers); even in such cases, though, asymptotic analysis can be useful. This point was made by Small (2010, §1.4), as follows.
A primary goal of asymptotic analysis is to obtain a deeper qualitative understanding of quantitative tools. The conclusions of an asymptotic analysis often supplement the conclusions which can be obtained by numerical methods.
Modes of convergence of random variables
[edit]Asymptotic properties
[edit]Point estimators
[edit]In the following, will be an arbitrary sequence of estimators.
The sequence is said to be (weakly) consistent, if it converges in probability to the true value of the parameter being estimated:
This means that the probability that the estimated value gets arbitrarily close to converges to zero as tends to infinity.[2] The sequence of estimates is said be strongly consistent, if it even converges almost surely to the true parameter:The subtle difference to a weakly consistent estimator is that a strongly consistent estimator is guaranteed to get arbitrarily close to as tends to infinity, eventually.
If it is possible to find sequences of non-random constants , (possibly depending on the value ), and a non-degenerate distribution such that
then the sequence of estimators is said to have the asymptotic distribution .
Most often, the estimators encountered in practice are asymptotically normal, meaning their asymptotic distribution is the normal distribution, with , , and for some covariance matrix :
Asymptotic unbiasedness
[edit]The sequence is called asymptotically unbiased for the parameter , if converges in distribution to a random variable with for some sequence of real numbers.[3]
Limiting variance
[edit]If there exists a sequence of real numbers such that , then is called the limiting variance of . Typically, it is chosen .[4]
Asymptotic variance
[edit]If the sequence is asymptotically normal for , that is, , then is called the asymptotic variance of .[5]
In the typical (univariate) case, the asymptotic variance will coincide with the limiting variance. In general, however, the two concepts might differ and, if , it always holds .[6]
The sequence is called asymptotically efficient, if is asymptotically normal for and its asymptotic variance is with the Fisher information , that is, the asymptotic variance achieves the Cramér–Rao bound.[5]
Statistical tests
[edit]Asymptotic confidence regions
[edit]Asymptotic theorems
[edit]Asymptotic properties of a sequence of estimators (for a parameter ) often result from general theorems from probability theory. In the following, we let be independent and identically distributed observations from the true distribution .
For consistency
[edit]Strong law of large numbers
[edit]If the estimators are of the form for some measurable function such that , then by the strong law of large numbers
Thus, if , then is strongly consistent.
Continuous mapping theorem
[edit]If the estimators are of the form for some continuous function and a sequence of estimators , then by the continuous mapping theorem . The same holds true if almost sure convergence is replaced by convergence in probability.
Thus, if , then is (strongly) consistent.
For asymptotic distribution
[edit]- Central limit theorem
- Fisher–Tippet–Gnedenko theorem
- Glivenko–Cantelli theorem
- Slutsky's theorem
- Delta method
See also
[edit]References
[edit]- ^ Höpfner, R. (2014), Asymptotic Statistics, Walter de Gruyter. 286 pag. ISBN 3110250241, ISBN 978-3110250244
- ^ a b c A. DasGupta (2008), Asymptotic Theory of Statistics and Probability, Springer. ISBN 0387759700, ISBN 978-0387759708
- ^ Lehmann & Casella (1998), p.438
- ^ Casella & Berger (2002), p.470
- ^ a b Casella & Berger (2002), p.471
- ^ Lehmann & Casella (1998), p.437
Bibliography
[edit]- Balakrishnan, N.; Ibragimov, I. A. V. B.; Nevzorov, V. B., eds. (2001), Asymptotic Methods in Probability and Statistics with Applications, Birkhäuser, ISBN 9781461202097
- Borovkov, A. A.; Borovkov, K. A. (2010), Asymptotic Analysis of Random Walks, Cambridge University Press
- Buldygin, V. V.; Solntsev, S. (1997), Asymptotic Behaviour of Linearly Transformed Sums of Random Variables, Springer, ISBN 9789401155687
- Le Cam, Lucien; Yang, Grace Lo (2000), Asymptotics in Statistics (2nd ed.), Springer
- Casella, G.; Berger, R.L. (2002), Statistical Inference (2nd ed.), Duxbury
- Dawson, D.; Kulik, R.; Ould Haye, M.; Szyszkowicz, B.; Zhao, Y., eds. (2015), Asymptotic Laws and Methods in Stochastics, Springer-Verlag
- Höpfner, R. (2014), Asymptotic Statistics, Walter de Gruyter
- Lehmann, E.L.; Casella, G. (1998), Theory of Point Estimation (2nd ed.), Springer
- Lin'kov, Yu. N. (2001), Asymptotic Statistical Methods for Stochastic Processes, American Mathematical Society
- Oliveira, P. E. (2012), Asymptotics for Associated Random Variables, Springer
- Petrov, V. V. (1995), Limit Theorems of Probability Theory, Oxford University Press
- Sen, P. K.; Singer, J. M.; Pedroso de Lima, A. C. (2009), From Finite Sample to Asymptotic Methods in Statistics, Cambridge University Press
- Shiryaev, A. N.; Spokoiny, V. G. (2000), Statistical Experiments and Decisions: Asymptotic theory, World Scientific
- Small, C. G. (2010), Expansions and Asymptotics for Statistics, Chapman & Hall
- van der Vaart, A. W. (1998), Asymptotic Statistics, Cambridge University Press