Relative likelihood

In statistics, when selecting a statistical model for given data, the relative likelihood compares the relative plausibilities of different candidate models or of different values of a parameter of a single model.

Relative likelihood of parameter values[edit]

Assume that we are given some data $x$ for which we have a statistical model with parameter $θ$ . Suppose that the maximum likelihood estimate for $θ$ is ${\hat {\theta }}$ . Relative plausibilities of other $θ$ values may be found by comparing the likelihoods of those other values with the likelihood of ${\hat {\theta }}$ . The relative likelihood of $θ$ is defined to be^[1]^[2]^[3]^[4]^[5]

{\frac {~{\mathcal {L}}(\theta \mid x)~}{~{\mathcal {L}}({\hat {\theta }}\mid x)~}},

where ${\mathcal {L}}(\theta \mid x)$ denotes the likelihood function. Thus, the relative likelihood is the likelihood ratio with fixed denominator ${\mathcal {L}}({\hat {\theta }}\mid x)$ .

The function

\theta \mapsto {\frac {~{\mathcal {L}}(\theta \mid x)~}{~{\mathcal {L}}({\hat {\theta }}\mid x)~}}

is the relative likelihood function.

Likelihood region[edit]

A likelihood region is the set of all values of $θ$ whose relative likelihood is greater than or equal to a given threshold. In terms of percentages, a $p$ % likelihood region for $θ$ is defined to be.^[1]^[3]^[6]

\left\{\theta :{\frac {{\mathcal {L}}(\theta \mid x)}{{\mathcal {L}}({\hat {\theta \,}}\mid x)}}\geq {\frac {p}{100}}\right\}.

If $θ$ is a single real parameter, a $p$ % likelihood region will usually comprise an interval of real values. If the region does comprise an interval, then it is called a likelihood interval.^[1]^[3]^[7]

Likelihood intervals, and more generally likelihood regions, are used for interval estimation within likelihood-based statistics ("likelihoodist" statistics): They are similar to confidence intervals in frequentist statistics and credible intervals in Bayesian statistics. Likelihood intervals are interpreted directly in terms of relative likelihood, not in terms of coverage probability (frequentism) or posterior probability (Bayesianism).

Given a model, likelihood intervals can be compared to confidence intervals. If $θ$ is a single real parameter, then under certain conditions, a 14.65% likelihood interval (about 1:7 likelihood) for $θ$ will be the same as a 95% confidence interval (19/20 coverage probability).^[1]^[6] In a slightly different formulation suited to the use of log-likelihoods (see Wilks' theorem), the test statistic is twice the difference in log-likelihoods and the probability distribution of the test statistic is approximately a chi-squared distribution with degrees-of-freedom (df) equal to the difference in df-s between the two models (therefore, the $e$ ⁻² likelihood interval is the same as the 0.954 confidence interval; assuming difference in df-s to be 1).^[6]^[7]

Relative likelihood of models[edit]

The definition of relative likelihood can be generalized to compare different statistical models. This generalization is based on AIC (Akaike information criterion), or sometimes AICc (Akaike Information Criterion with correction).

Suppose that for some given data we have two statistical models, $M 1$ and $M 2$ . Also suppose that $AIC(M 1) \leq AIC(M 2)$ . Then the relative likelihood of $M 2$ with respect to $M 1$ is defined as follows.^[8]

\exp \left({\frac {\operatorname {AIC} (M_{1})-\operatorname {AIC} (M_{2})}{2}}\right)

To see that this is a generalization of the earlier definition, suppose that we have some model $M$ with a (possibly multivariate) parameter $θ$ . Then for any $θ$ , set $M 2 = M (θ)$ , and also set $M 1 = M ($ ${\hat {\theta }}$ $)$ . The general definition now gives the same result as the earlier definition.

Notes[edit]

^ ^a ^b ^c ^d Kalbfleisch, J.G. (1985), Probability and Statistical Inference, Springer, §9.3
^ Azzalini, A. (1996), Statistical Inference — Based on the likelihood, Chapman & Hall, §1.4.2, ISBN 9780412606502
^ ^a ^b ^c Sprott, D.A. (2000), Statistical Inference in Science, Springer, chap. 2
^ Davison, A.C. (2008), Statistical Models, Cambridge University Press, §4.1.2
^ Held, L.; Sabanés Bové, D.S. (2014), Applied Statistical Inference — Likelihood and Bayes, Springer, §2.1
^ ^a ^b ^c Rossi, R.J. (2018), Mathematical Statistics, Wiley, p. 267
^ ^a ^b Hudson, D.J. (1971), "Interval estimation from the likelihood function", Journal of the Royal Statistical Society, Series B, 33: 256–262
^ Burnham, K. P.; Anderson, D. R. (2002), Model Selection and Multimodel Inference: A practical information-theoretic approach, Springer, §2.8

[Kalbfleisch-1] Kalbfleisch, J.G. (1985), Probability and Statistical Inference, Springer, §9.3

[2] Azzalini, A. (1996), Statistical Inference — Based on the likelihood, Chapman & Hall, §1.4.2, ISBN 9780412606502

[Sprott-3] Sprott, D.A. (2000), Statistical Inference in Science, Springer, chap. 2

[4] Davison, A.C. (2008), Statistical Models, Cambridge University Press, §4.1.2

[5] Held, L.; Sabanés Bové, D.S. (2014), Applied Statistical Inference — Likelihood and Bayes, Springer, §2.1

[Rossi2018-6] Rossi, R.J. (2018), Mathematical Statistics, Wiley, p. 267

[Hudson-7] Hudson, D.J. (1971), "Interval estimation from the likelihood function", Journal of the Royal Statistical Society, Series B, 33: 256–262

[8] Burnham, K. P.; Anderson, D. R. (2002), Model Selection and Multimodel Inference: A practical information-theoretic approach, Springer, §2.8

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

Relative likelihood of parameter values[edit]

Likelihood region[edit]

Relative likelihood of models[edit]

See also[edit]

Notes[edit]