Bayes' rule
In probability theory and applications, Bayes' rule relates the odds of event A1 to event A2, before and after conditioning on event B. The relationship is expressed in terms of the Bayes factor, Λ. Bayes' rule is derived from and closely related to Bayes' theorem. Bayes' rule may be preferred over Bayes' theorem when the relative probability (that is, the odds) of two events matters, but the individual probabilities do not. This is because in Bayes' rule, P(B) is eliminated and need not be calculated (see Derivation). It is commonly used in science and engineering, notably for model selection.
Under the frequentist interpretation of probability, Bayes' rule is a general relationship between O(A1:A2) and O(A1:A2 | B), for any events A1, A2 and B in the same event space. In this case, Λ represents the impact of the conditioning on the odds.
Under the Bayesian interpretation of probability, Bayes' rule relates the odds on probability models A1 and A2 before and after evidence B is observed. In this case, Λ represents the impact of the evidence on the odds. This is a form of Bayesian inference - the quantity O(A1:A2) is called the prior odds, and O(A1:A2 | B) the posterior odds. By analogy to the prior and posterior probability terms in Bayes' theorem, Bayes' rule can be seen as Bayes' theorem in odds form. For more detail on the application of Bayes' rule under the Bayesian interpretation of probability, see Bayesian model selection.
Contents |
[edit] The rule
[edit] Single event
Given events A1, A2 and B, Bayes' rule states that the conditional odds of A1:A2 given B are equal to the marginal odds of A1:A2 multiplied by the Bayes factor Λ:
where
In the special case that A1 = A and
, this may be written as
[edit] Multiple events
Bayes' rule may be conditioned on an arbitrary number of events. For two events B and C,
where
In this special case, the equivalent notation is
[edit] Derivation
Consider two instances of Bayes' theorem:
Combining these gives
Now defining
this implies
A similar derivation applies for conditioning on multiple events, using the appropriate extension of Bayes' theorem
[edit] Examples
[edit] Frequentist example
Consider the drug testing example in the article on Bayes' theorem.
The same results may be obtained using Bayes' rule. The prior odds on an individual being a drug-user are 199 to 1 against, as
and
. The Bayes factor when an individual tests positive is
in favour of being a drug-user: this is the ratio of the probability of a drug-user testing positive, to the probability of a non-drug user testing positive. The posterior odds on being a drug user are therefore
, which is very close to
. In round numbers, only one in three of those testing positive are actually drug-users.
[edit] Model selection
| This article does not cite any references or sources. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. (October 2011) |
[edit] External links
- The on-line textbook: Information Theory, Inference, and Learning Algorithms, by David J.C. MacKay, discusses Bayesian model comparison in Chapters 3 and 28.













