# Lottery mathematics

Lottery mathematics is used to calculate probabilities of winning or losing a lottery game. It is based heavily on combinatorics, particularly the twelvefold way and combinations without replacement.

## Choosing 6 from 49

In a typical 6/49 game, each player chooses six distinct numbers from a range of 1-49. If the six numbers on a ticket match the numbers drawn by the lottery, the ticket holder is a jackpot winner—regardless of the order of the numbers. The probability of this happening is 1 in 13,983,816.

The chance of winning can be demonstrated as follows: The first number drawn has a 1 in 49 chance of matching. When the draw comes to the second number, there are now only 48 balls left in the bag, because the balls are drawn without replacement. So there is now a 1 in 48 chance of predicting this number.

Thus for each of the 49 ways of choosing the first number there are 48 different ways of choosing the second. This means that the probability of correctly predicting 2 numbers drawn from 49 in the correct order is calculated as 1 in 49 × 48. On drawing the third number there are only 47 ways of choosing the number; but of course we could have arrived at this point in any of 49 × 48 ways, so the chances of correctly predicting 3 numbers drawn from 49, again in the correct order, is 1 in 49 × 48 × 47. This continues until the sixth number has been drawn, giving the final calculation, 49 × 48 × 47 × 46 × 45 × 44, which can also be written as ${\displaystyle {49! \over (49-6)!}}$ or 49 factorial divided by 43 factorial. This works out to 10,068,347,520, which is much bigger than the ~14 million stated above.

However; the order of the 6 numbers is not significant. That is, if a ticket has the numbers 1, 2, 3, 4, 5, and 6, it wins as long as all the numbers 1 through 6 are drawn, no matter what order they come out in. Accordingly, given any set of 6 numbers, there are 6 × 5 × 4 × 3 × 2 × 1 = 6! or 720 orders in which they could be drawn. Dividing 10,068,347,520 by 720 gives 13,983,816, also written as ${\displaystyle {49! \over 6!*(49-6)!}}$, or more generally as

${\displaystyle {n \choose k}={n! \over k!(n-k)!}}$, where n is the number of alternatives and k is the number of choices. Further information is available at binomial coefficient and multinomial coefficient.

This function is called the combination function; in Microsoft Excel, this function is implemented as COMBIN(n, k). For example, COMBIN(49, 6) (the calculation shown above), would return 13,983,816. For the rest of this article, we will use the notation ${\displaystyle {n \choose k}}$. "Combination" means the group of numbers selected, irrespective of the order in which they are drawn.

An alternative method of calculating the odds is to note that the probability of the first ball corresponding to one of the six chosen is 6/49; the probability of the second ball corresponding to one of the remaining five chosen is 5/48; and so on. This yields a final formula of

${\displaystyle {n \choose k}={49 \choose 6}={49 \over 6}*{48 \over 5}*{47 \over 4}*{46 \over 3}*{45 \over 2}*{44 \over 1}}$

The range of possible combinations for a given lottery can be referred to as the "number space". "Coverage" is the percentage of a lottery's number space that is in play for a given drawing.

## Odds of getting other possibilities in choosing 6 from 49

One must divide the number of combinations producing the given result by the total number of possible combinations (for example, ${\displaystyle {49 \choose 6}=13,983,816}$ ). The numerator equates to the number of ways to select the winning numbers multiplied by the number of ways to select the losing numbers.

For a score of n (for example, if 3 choices match three of the 6 balls drawn, then n = 3), ${\displaystyle {6 \choose n}}$ describes the odds of selecting n winning numbers from the 6 winning numbers. This means that there are 6 - n losing numbers, which are chosen from the 43 losing numbers in ${\displaystyle {43 \choose 6-n}}$ ways. The total number of combinations giving that result is, as stated above, the first number multiplied by the second. The expression is therefore ${\displaystyle {6 \choose n}{43 \choose 6-n} \over {49 \choose 6}}$.

This can be written in a general form for all lotteries as:

${\displaystyle {K \choose B}{N-K \choose K-B} \over {N \choose K}}$

where ${\displaystyle N}$ is the number of balls in lottery, ${\displaystyle K}$ is the number of balls in a single ticket, and ${\displaystyle B}$ is the number of matching balls for a winning ticket.

The generalisation of this formula is called the hypergeometric distribution.

This gives the following results:

Score Calculation Exact Probability Approximate Decimal Probability Approximate 1/Probability
0 ${\displaystyle {6 \choose 0}{43 \choose 6} \over {49 \choose 6}}$ 435,461/998,844 0.436 2.2938
1 ${\displaystyle {6 \choose 1}{43 \choose 5} \over {49 \choose 6}}$ 68,757/166,474 0.413 2.4212
2 ${\displaystyle {6 \choose 2}{43 \choose 4} \over {49 \choose 6}}$ 44,075/332,948 0.132 7.5541
3 ${\displaystyle {6 \choose 3}{43 \choose 3} \over {49 \choose 6}}$ 8,815/499,422 0.0177 56.66
4 ${\displaystyle {6 \choose 4}{43 \choose 2} \over {49 \choose 6}}$ 645/665,896 0.000969 1,032.4
5 ${\displaystyle {6 \choose 5}{43 \choose 1} \over {49 \choose 6}}$ 43/2,330,636 0.0000184 54,200.8
6 ${\displaystyle {6 \choose 6}{43 \choose 0} \over {49 \choose 6}}$ 1/13,983,816 0.0000000715 13,983,816

When a bonus number is included, the adjusted odds are:[1]

Score Calculation Exact Probability Approximate Decimal Probability Approximate 1/Probability
5, bonus not won ${\displaystyle {6 \choose 5}{(43-1) \choose 1} \over {49 \choose 6}}$ 0.0000180208 55,491.33
5, bonus won ${\displaystyle {6 \choose 5}{(43-1) \choose (1-1)} \over {49 \choose 6}}$ 0.0000004291 2,330,636

## Powerballs and bonus balls

Many lotteries have a Powerball (or "bonus ball"). If the powerball is drawn from a pool of numbers different from the main lottery, the odds are multiplied by the number of powerballs. For example, in the 6 from 49 lottery, given 10 powerball numbers, then the odds of getting a score of 3 and the powerball would be 1 in 56.66 × 10, or 566.6 (the probability would be divided by 10, to give an exact value of ${\textstyle {\frac {8815}{4994220}}}$). Another example of such a game is Mega Millions, albeit with different jackpot odds.

Where more than 1 powerball is drawn from a separate pool of balls to the main lottery (for example, in the EuroMillions game), the odds of the different possible powerball matching scores are calculated using the method shown in the "other scores" section above (in other words, the powerballs are like a mini-lottery in their own right), and then multiplied by the odds of achieving the required main-lottery score.

If the powerball is drawn from the same pool of numbers as the main lottery, then, for a given target score, the number of winning combinations includes the powerball. For games based on the Canadian lottery (such as the lottery of the United Kingdom), after the 6 main balls are drawn, an extra ball is drawn from the same pool of balls, and this becomes the powerball (or "bonus ball"). An extra prize is given for matching 5 balls and the bonus ball. As described in the "other scores" section above, the number of ways one can obtain a score of 5 from a single ticket is ${\textstyle {6 \choose 5}{43 \choose 1}=258}$. Since the number of remaining balls is 43, and the ticket has 1 unmatched number remaining, 1/43 of these 258 combinations will match the next ball drawn (the powerball), leaving 258/43 = 6 ways of achieving it. Therefore, the odds of getting a score of 5 and the powerball are ${\textstyle {6 \over {49 \choose 6}}={1 \over 2,330,636}}$.

Of the 258 combinations that match 5 of the main 6 balls, in 42/43 of them the remaining number will not match the powerball, giving odds of ${\textstyle {{258\cdot {\frac {42}{43}}} \over {49 \choose 6}}={\frac {3}{166,474}}\approx 1.802\times 10^{-5}}$ for obtaining a score of 5 without matching the powerball.

Using the same principle, the odds of getting a score of 2 and the powerball are ${\textstyle {6 \choose 2}{43 \choose 4}=1,\!851,\!150}$ for the score of 2 multiplied by the probability of one of the remaining four numbers matching the bonus ball, which is 4/43. Since ${\textstyle 1,851,150\cdot {\frac {4}{43}}=172,\!200}$, the probability of obtaining the score of 2 and the bonus ball is ${\textstyle {\frac {172,200}{49 \choose 6}}={\frac {1025}{83237}}=1.231\%}$, approximate decimal odds of 1 in 81.2.

The general formula for ${\displaystyle B}$ matching balls in a ${\displaystyle N}$ choose ${\displaystyle K}$ lottery with one bonus ball from the ${\displaystyle N}$ pool of balls is:

${\displaystyle {\frac {{\frac {K-B}{N-K}}{K \choose B}{N-K \choose K-B}}{N \choose K}}}$

The general formula for ${\displaystyle B}$ matching balls in a ${\displaystyle N}$ choose ${\displaystyle K}$ lottery with zero bonus ball from the ${\displaystyle N}$ pool of balls is:

${\displaystyle {N-K-K+B \over N-K}{K \choose B}{N-K \choose K-B} \over {N \choose K}}$

The general formula for ${\displaystyle B}$ matching balls in a ${\displaystyle N}$ choose ${\displaystyle K}$ lottery with one bonus ball from a separate pool of ${\displaystyle P}$ balls is:

${\displaystyle {1 \over P}{K \choose B}{N-K \choose K-B} \over {N \choose K}}$

The general formula for ${\displaystyle B}$ matching balls in a ${\displaystyle N}$ choose ${\displaystyle K}$ lottery with no bonus ball from a separate pool of ${\displaystyle P}$ balls is:

${\displaystyle {P-1 \over P}{K \choose B}{N-K \choose K-B} \over {N \choose K}}$

## Minimum number of tickets for a match

It is a hard (and often open) problem to calculate the minimum number of tickets one needs to purchase to guarantee that at least one of these tickets matches at least 2 numbers. In the 5-from-90 lotto, the minimum number of tickets that can guarantee a ticket with at least 2 matches is 100.[2]

## Information theoretic results

As a discrete probability space, the probability of any particular lottery outcome is atomic, meaning it is greater than zero. Therefore, the probability of any event is the sum of probabilities of the outcomes of the event. This makes it easy to calculate quantities of interest from information theory. For example, the information content of any event is easy to calculate, by the formula

${\displaystyle \operatorname {I} (E):=-\log {\left[\Pr {\left(E\right)}\right]}=-\log {\left(P\right)}.}$

In particular, the information content of outcome ${\displaystyle x}$ of discrete random variable ${\displaystyle X}$ is

${\displaystyle \operatorname {I} _{X}(x):=-\log {\left[p_{X}{\left(x\right)}\right]}=\log {\left({\frac {1}{p_{X}{\left(x\right)}}}\right)}.}$

For example, winning in the example § Choosing 6 from 49 above is a Bernoulli-distributed random variable ${\displaystyle X}$ with a 1/13,983,816 chance of winning ("success") We write ${\textstyle X\sim \mathrm {Bernoulli} \!\left(p\right)=\mathrm {B} \!\left(1,p\right)}$ with ${\textstyle p={\tfrac {1}{13,983,816}}}$ and ${\textstyle q={\tfrac {13,983,815}{13,983,816}}}$. The information content of winning is

${\displaystyle \operatorname {I} _{X}({\text{win}})=-\log _{2}{p_{X}{({\text{win}})}}=-\log _{2}\!{\tfrac {1}{13,983,816}}\approx 23.73725}$
shannons or bits of information. (See units of information for further explanation of terminology.) The information content of losing is

{\displaystyle {\begin{aligned}\operatorname {I} _{X}({\text{lose}})&=-\log _{2}{p_{X}{({\text{lose}})}}=-\log _{2}\!{\tfrac {13,983,815}{13,983,816}}\\&\approx 1.0317\times 10^{-7}{\text{ shannons}}.\end{aligned}}}

The information entropy of a lottery probability distribution is also easy to calculate as the expected value of the information content.

{\displaystyle {\begin{alignedat}{2}\mathrm {H} (X)&=\sum _{x}{-p_{X}{\left(x\right)}\log {p_{X}{\left(x\right)}}}\ &=\sum _{x}{p_{X}{\left(x\right)}\operatorname {I} _{X}(x)}\\&{\overset {\underset {\mathrm {def} }{}}{=}}\ \mathbb {E} {\left[\operatorname {I} _{X}(x)\right]}\end{alignedat}}}

Oftentimes the random variable of interest in the lottery is a Bernoulli trial. In this case, the Bernoulli entropy function may be used. Using ${\displaystyle X}$ representing winning the 6-of-49 lottery, the Shannon entropy of 6-of-49 above is

{\displaystyle {\begin{aligned}\mathrm {H} (X)&=-p\log(p)-q\log(q)=-{\tfrac {1}{13,983,816}}\log \!{\tfrac {1}{13,983,816}}-{\tfrac {13,983,815}{13,983,816}}\log \!{\tfrac {13,983,815}{13,983,816}}\\&\approx 1.80065\times 10^{-6}{\text{ shannons.}}\end{aligned}}}

## References

1. ^ Zabrocki, Mike (2003-03-01). "Calculating the Probabilities of Winning Lotto 6/49,Version 3" (PDF). Retrieved 2016-08-14.
2. ^ Z. Füredi, G. J. Székely, and Z. Zubor (1996). "On the lottery problem". Journal of Combinatorial Designs. 4 (1): 5–10. doi:10.1002/(sici)1520-6610(1996)4:1<5::aid-jcd2>3.3.co;2-w.CS1 maint: multiple names: authors list (link) [1]