# Kelly criterion

In probability theory and intertemporal portfolio choice, the Kelly criterion, Kelly strategy, Kelly formula, or Kelly bet, is a formula used to determine the optimal size of a series of bets. In most gambling scenarios, and some investing scenarios under some simplifying assumptions, the Kelly strategy will do better than any essentially different strategy in the long run (that is, over a span of time in which the observed fraction of bets that are successful equals the probability that any given bet will be successful). It was described by J. L. Kelly, Jr in 1956.[1] The practical use of the formula has been demonstrated.[2][3][4]

Although the Kelly strategy's promise of doing better than any other strategy in the long run seems compelling, some economists have argued strenuously against it, mainly because an individual's specific investing constraints may override the desire for optimal growth rate.[5] The conventional alternative is expected utility theory which says bets should be sized to maximize the expected utility of the outcome (to an individual with logarithmic utility, the Kelly bet maximizes expected utility, so there is no conflict; moreover, Kelly's original paper clearly states the need for a utility function in the case of gambling games which are played finitely many times[1]). Even Kelly supporters usually argue for fractional Kelly (betting a fixed fraction of the amount recommended by Kelly) for a variety of practical reasons, such as wishing to reduce volatility, or protecting against non-deterministic errors in their advantage (edge) calculations.[6]

In recent years, Kelly has become a part of mainstream investment theory[7] and the claim has been made that well-known successful investors including Warren Buffett[8] and Bill Gross[9] use Kelly methods. William Poundstone wrote an extensive popular account of the history of Kelly betting.[5]

## Statement

For simple bets with two outcomes, one involving losing the entire amount bet, and the other involving winning the bet amount multiplied by the payoff odds, the Kelly bet is:

${\displaystyle f^{*}={\frac {bp-q}{b}}={\frac {p(b+1)-1}{b}},\!}$

where:

• f* is the fraction of the current bankroll to wager, i.e. how much to bet;
• b is the net odds received on the wager ("b to 1"); that is, you could win $b (on top of getting back your$1 wagered) for a $1 bet • p is the probability of winning; • q is the probability of losing, which is 1 − p. As an example, if a gamble has a 60% chance of winning (p = 0.60, q = 0.40), and the gambler receives 1-to-1 odds on a winning bet (b = 1), then the gambler should bet 20% of his bankroll at each opportunity (f* = 0.20), in order to maximize the long-run growth rate of the bankroll. If the gambler has zero edge, i.e. if b = q / p, then the criterion recommends the gambler bets nothing. If the edge is negative (b < q / p) the formula gives a negative result, indicating that the gambler should take the other side of the bet. For example, in standard American roulette, the bettor is offered an even money payoff (b = 1) on red, when there are 18 red numbers and 20 non-red numbers on the wheel (p = 18/38). The Kelly bet is -1/19, meaning the gambler should bet one-nineteenth of his bankroll that red will not come up. Unfortunately, the casino doesn't allow betting against something coming up, so a Kelly gambler cannot place a bet. The top of the first fraction is the expected net winnings from a$1 bet, since the two outcomes are that you either win $b with probability p, or lose the$1 wagered, i.e. win $-1, with probability q. Hence: ${\displaystyle f^{*}={\frac {\text{expected net winnings}}{\text{net winnings if you win}}}\!}$ For even-money bets (i.e. when b = 1), the first formula can be simplified to: ${\displaystyle f^{*}=p-q.\!}$ Since q = 1-p, this simplifies further to ${\displaystyle f^{*}=2p-1.\!}$ A more general problem relevant for investment decisions is the following: 1. The probability of success is ${\displaystyle p}$. 2. If you succeed, the value of your investment increases from ${\displaystyle 1}$ to ${\displaystyle 1+b}$. 3. If you fail (for which the probability is ${\displaystyle q=1-p}$) the value of your investment decreases from ${\displaystyle 1}$ to ${\displaystyle 1-a}$. (Note that the previous description above assumes that a is 1). In this case, the Kelly criterion turns out to be the relatively simple expression ${\displaystyle f^{*}=p/a-q/b.\!}$ Note that this reduces to the original expression for the special case above (${\displaystyle f^{*}=p-q}$) for ${\displaystyle b=a=1}$. Clearly, in order to decide in favor of investing at least a small amount ${\displaystyle (f^{*}>0)}$, you must have ${\displaystyle pb>qa.\!}$ which obviously is nothing more than the fact that your expected profit must exceed the expected loss for the investment to make any sense. The general result clarifies why leveraging (taking a loan to invest) decreases the optimal fraction to be invested, as in that case ${\displaystyle a>1}$. Obviously, no matter how large the probability of success, ${\displaystyle p}$, is, if ${\displaystyle a}$ is sufficiently large, the optimal fraction to invest is zero. Thus, using too much margin is not a good investment strategy, no matter how good an investor you are. ## Proof Heuristic proofs of the Kelly criterion are straightforward.[10] For a symbolic verification with Python and SymPy one would set the derivative y'(x) of the expected value of the logarithmic bankroll y(x) to 0 and solve for x: >>> from sympy import * >>> x,b,p = symbols('x b p') >>> y = p*log(1+b*x) + (1-p)*log(1-x) >>> solve(diff(y,x), x) [-(1 - p - b*p)/b]  For a rigorous and general proof, see Kelly's original paper[1] or some of the other references listed below. Some corrections have been published.[11] We give the following non-rigorous argument for the case b = 1 (a 50:50 "even money" bet) to show the general idea and provide some insights.[1] When b = 1, the Kelly bettor bets 2p - 1 times initial wealth, W, as shown above. If he wins, he has 2pW. If he loses, he has 2(1 - p)W. Suppose he makes N bets like this, and wins K of them. The order of the wins and losses doesn't matter, he will have: ${\displaystyle 2^{N}p^{K}(1-p)^{N-K}W\!.}$ Suppose another bettor bets a different amount, (2p - 1 + ${\displaystyle \Delta }$)W for some positive or negative ${\displaystyle \Delta }$. He will have (2p + ${\displaystyle \Delta }$)W after a win and [2(1 - p)- ${\displaystyle \Delta }$]W after a loss. After the same wins and losses as the Kelly bettor, he will have: ${\displaystyle (2p+\Delta )^{K}[2(1-p)-\Delta ]^{N-K}W\!}$ Take the derivative of this with respect to ${\displaystyle \Delta }$ and get: ${\displaystyle K(2p+\Delta )^{K-1}[2(1-p)-\Delta ]^{N-K}W-(N-K)(2p+\Delta )^{K}[2(1-p)-\Delta ]^{N-K-1}W\!}$ The turning point of the original function occurs when this derivative equals zero, which occurs at: ${\displaystyle K[2(1-p)-\Delta ]=(N-K)(2p+\Delta )\!}$ which implies: ${\displaystyle \Delta =2({\frac {K}{N}}-p)\!}$ but: ${\displaystyle \lim _{N\to +\infty }{\frac {K}{N}}=p\!}$ so in the long run, final wealth is maximized by setting ${\displaystyle \Delta }$ to zero, which means following the Kelly strategy. This illustrates that Kelly has both a deterministic and a stochastic component. If one knows K and N and wishes to pick a constant fraction of wealth to bet each time (otherwise one could cheat and, for example, bet zero after the Kth win knowing that the rest of the bets will lose), one will end up with the most money if one bets: ${\displaystyle \left(2{\frac {K}{N}}-1\right)W\!}$ each time. This is true whether N is small or large. The "long run" part of Kelly is necessary because K is not known in advance, just that as N gets large, K will approach pN. Someone who bets more than Kelly can do better if K > pN for a stretch; someone who bets less than Kelly can do better if K < pN for a stretch, but in the long run, Kelly always wins. The heuristic proof for the general case proceeds as follows.[citation needed] In a single trial, if you invest the fraction ${\displaystyle f}$ of your capital, if your strategy succeeds, your capital at the end of the trial increases by the factor ${\displaystyle 1-f+f(1+b)=1+fb}$, and, likewise, if the strategy fails, you end up having your capital decreased by the factor ${\displaystyle 1-fa}$. Thus at the end of ${\displaystyle N}$ trials (with ${\displaystyle pN}$ successes and ${\displaystyle qN}$ failures ), the starting capital of$1 yields

${\displaystyle C_{N}=(1+fb)^{pN}(1-fa)^{qN}.}$

Maximizing ${\displaystyle \log(C_{N})/N}$, and consequently ${\displaystyle C_{N}}$, with respect to ${\displaystyle f}$ leads to the desired result

${\displaystyle f^{*}=p/a-q/b.}$

For a more detailed discussion of this formula for the general case, see.[12] There, it can be seen that the substitution of ${\displaystyle p}$ for the ratio of the number of "successes" to the number of trials implies that the number of trials must be very large, since ${\displaystyle p}$ is defined as the limit of this ratio as the number of trials goes to infinity. In brief, betting ${\displaystyle f^{*}}$ each time will likely maximize the wealth growth rate only in the case where the number of trials is very large, and ${\displaystyle p}$ and ${\displaystyle b}$ are the same for each trial. In practice, this is a matter of playing the same game over and over, where the probability of winning and the payoff odds are always the same. In the heuristic proof above, ${\displaystyle pN}$ successes and ${\displaystyle qN}$ failures are highly likely only for very large ${\displaystyle N}$.

## Bernoulli

In a 1738 article, Daniel Bernoulli suggested that, when one has a choice of bets or investments, one should choose that with the highest geometric mean of outcomes. This is mathematically equivalent to the Kelly criterion, although the motivation is entirely different (Bernoulli wanted to resolve the St. Petersburg paradox).

The Bernoulli article was not translated into English until 1954,[13] but the work was well-known among mathematicians and economists.

## Multiple horses

Kelly's criterion may be generalized [14] on gambling on many mutually exclusive outcomes, like in horse races. Suppose there are several mutually exclusive outcomes. The probability that the k-th horse wins the race is ${\displaystyle p_{k}}$, the total amount of bets placed on k-th horse is ${\displaystyle B_{k}}$, and

${\displaystyle \beta _{k}={\frac {B_{k}}{\sum _{i}B_{i}}}={\frac {1}{1+Q_{k}}},}$

where ${\displaystyle Q_{k}}$ are the pay-off odds. ${\displaystyle D=1-tt}$, is the dividend rate where ${\displaystyle tt}$ is the track take or tax, ${\displaystyle {\frac {D}{\beta _{k}}}}$ is the revenue rate after deduction of the track take when k-th horse wins. The fraction of the bettor's funds to bet on k-th horse is ${\displaystyle f_{k}}$. Kelly's criterion for gambling with multiple mutually exclusive outcomes gives an algorithm for finding the optimal set ${\displaystyle S^{o}}$ of outcomes on which it is reasonable to bet and it gives explicit formula for finding the optimal fractions ${\displaystyle f_{k}^{o}}$ of bettor's wealth to be bet on the outcomes included in the optimal set ${\displaystyle S^{o}}$. The algorithm for the optimal set of outcomes consists of four steps.[14]

Step 1 Calculate the expected revenue rate for all possible (or only for several of the most promising) outcomes: ${\displaystyle er_{k}={\frac {D}{\beta _{k}}}p_{k}=D(1+Q_{k})p_{k}.}$

Step 2 Reorder the outcomes so that the new sequence ${\displaystyle er_{k}}$ is non-increasing. Thus ${\displaystyle er_{1}}$ will be the best bet.

Step 3 Set ${\displaystyle S=\varnothing }$ (the empty set), ${\displaystyle k=1}$, ${\displaystyle R(S)=1}$. Thus the best bet ${\displaystyle er_{k}=er_{1}}$ will be considered first.

Step 4 Repeat:

If ${\displaystyle er_{k}={\frac {D}{\beta _{k}}}p_{k}>R(S)}$ then insert k-th outcome into the set: ${\displaystyle S=S\cup \{k\}}$, recalculate ${\displaystyle R(S)}$ according to the formula: ${\displaystyle R(S)={\frac {1-\sum _{i\in S}{p_{i}}}{1-\sum _{i\in S}{\frac {\beta _{i}}{D}}}}}$ and then set ${\displaystyle k=k+1}$,

Else set ${\displaystyle S^{o}=S}$ and then stop the repetition.

If the optimal set ${\displaystyle S^{o}}$ is empty then do not bet at all. If the set ${\displaystyle S^{o}}$ of optimal outcomes is not empty then the optimal fraction ${\displaystyle f_{k}^{o}}$ to bet on k-th outcome may be calculated from this formula: ${\displaystyle f_{k}^{o}={\frac {er_{k}-R(S^{o})}{\frac {D}{\beta _{k}}}}=p_{k}-{\frac {R(S^{o})}{\frac {D}{\beta _{k}}}}}$.

One may prove[14] that

${\displaystyle R(S^{o})=1-\sum _{i\in S^{o}}{f_{i}^{o}}}$

where the right hand-side is the reserve rate[clarification needed]. Therefore the requirement ${\displaystyle er_{k}={\frac {D}{\beta _{k}}}p_{k}>R(S)}$ may be interpreted[14] as follows: k-th outcome is included in the set ${\displaystyle S^{o}}$ of optimal outcomes if and only if its expected revenue rate is greater than the reserve rate. The formula for the optimal fraction ${\displaystyle f_{k}^{o}}$ may be interpreted as the excess of the expected revenue rate of k-th horse over the reserve rate divided by the revenue after deduction of the track take when k-th horse wins or as the excess of the probability of k-th horse winning over the reserve rate divided by revenue after deduction of the track take when k-th horse wins. The binary growth exponent is

${\displaystyle G^{o}=\sum _{i\in S}{p_{i}\log _{2}{(er_{i})}}+(1-\sum _{i\in S}{p_{i}})\log _{2}{(R(S^{o}))},}$

and the doubling time is

${\displaystyle T_{d}={\frac {1}{G^{o}}}.}$

This method of selection of optimal bets may be applied also when probabilities ${\displaystyle p_{k}}$ are known only for several most promising outcomes, while the remaining outcomes have no chance to win. In this case it must be that ${\displaystyle \sum _{i}{p_{i}}<1}$ and ${\displaystyle \sum _{i}{\beta _{i}}<1}$.

## Application to the stock market

### Single Asset

Considering a single asset (Stock, index etc.) and a risk-free rate, it is easy to obtain the optimal fraction to invest through the Geometric Brownian Motion. Taking the value of asset ${\displaystyle S}$ at time ${\displaystyle t}$ (${\displaystyle S_{t}}$):

${\displaystyle \mathbb {E} \log(S_{t})=\log(S_{0})+(\mu -{\frac {\sigma ^{2}}{2}})t}$

Then the expected log return ${\displaystyle R_{s}}$ is

${\displaystyle R_{s}=\left(\mu -{\frac {\sigma ^{2}}{2}}\,\right)t}$

For a portfolio made of an asset ${\displaystyle S}$ and a bond paying risk-free rate ${\displaystyle r}$. With a fraction ${\displaystyle f}$ invested in ${\displaystyle S}$ and ${\displaystyle (1-f)}$ in the bond. The Expected rate of return ${\displaystyle G(f)}$ is given by:

${\displaystyle G(f)=f\mu -{\frac {(f\sigma )^{2}}{2}}+((1-f)\ r)}$

Solving ${\displaystyle \max(G(f))}$ we obtain:

${\displaystyle f^{*}={\frac {\mu -r}{\sigma ^{2}}}}$

Being ${\displaystyle f^{*}}$ the fraction that maximizes the return, and so, the Kelly fraction.

Thorp[15] arrive at the same result but through a different deduction / derivation.

Worth to remember that ${\displaystyle \mu }$ is different from the asset log return ${\displaystyle R_{s}}$. Confusing than is a very common mistake made by websites and articles talking about the Kelly Criterion.

### Many Assets

Consider a market with ${\displaystyle n}$ correlated stocks ${\displaystyle S_{k}}$ with stochastic returns ${\displaystyle r_{k}}$, ${\displaystyle k=1,...,n}$ and a riskless bond with return ${\displaystyle r}$. An investor puts a fraction ${\displaystyle u_{k}}$ of his capital in ${\displaystyle S_{k}}$ and the rest is invested in bond. Without loss of generality, assume that investor's starting capital is equal to 1. According to Kelly criterion one should maximize ${\displaystyle \mathbb {E} \left[\ln \left((1+r)+\sum \limits _{k=1}^{n}u_{k}(r_{k}-r)\right)\right]}$
Expanding it to the Taylor series around ${\displaystyle {\vec {u_{0}}}=(0,\ldots ,0)}$ we obtain
${\displaystyle \mathbb {E} \left[\ln(1+r)+\sum \limits _{k=1}^{n}{\frac {u_{k}(r_{k}-r)}{1+r}}-{\frac {1}{2}}\sum \limits _{k=1}^{n}\sum \limits _{j=1}^{n}u_{k}u_{j}{\frac {(r_{k}-r)(r_{j}-r)}{(1+r)^{2}}}\right]}$
Thus we reduce the optimization problem to quadratic programming and the unconstrained solution is ${\displaystyle {\vec {u^{\star }}}=(1+r)({\widehat {\Sigma }})^{-1}({\widehat {\vec {r}}}-r)}$
where ${\displaystyle {\widehat {\vec {r}}}}$ and ${\displaystyle {\widehat {\Sigma }}}$ are the vector of means and the matrix of second mixed noncentral moments of the excess returns.[16] There are also numerical algorithms for the fractional Kelly strategies and for the optimal solution under no leverage and no short selling constraints.

## References

1. ^ a b c d Kelly, J. L. (1956). "A New Interpretation of Information Rate" (PDF). Bell System Technical Journal. 35 (4): 917–926. doi:10.1002/j.1538-7305.1956.tb03809.x.
2. ^ Thorp, E. O. (January 1961), "Fortune's Formula: The Game of Blackjack", American Mathematical Society
3. ^ Thorp, E. O. (1962), Beat the dealer: a winning strategy for the game of twenty-one. A scientific analysis of the world-wide game known variously as blackjack, twenty-one, vingt-et-un, pontoon or Van John, Blaisdell Pub. Co
4. ^ Thorp, Edward O.; Kassouf, Sheen T. (1967), Beat the Market: A Scientific Stock Market System (PDF), Random House, ISBN 0-394-42439-5[page needed]
5. ^ a b Poundstone, William (2005), Fortune's Formula: The Untold Story of the Scientific Betting System That Beat the Casinos and Wall Street, New York: Hill and Wang, ISBN 0-8090-4637-7
6. ^ Thorp, E. O. (May 2008), "The Kelly Criterion: Part I", Wilmott Magazine
7. ^ Zenios, S. A.; Ziemba, W. T. (2006), Handbook of Asset and Liability Management, North Holland, ISBN 978-0-444-50875-1
8. ^ Pabrai, Mohnish (2007), The Dhandho Investor: The Low-Risk Value Method to High Returns, Wiley, ISBN 978-0-470-04389-9
9. ^ Thorp, E. O. (September 2008), "The Kelly Criterion: Part II", Wilmott Magazine
10. ^ Press, WH; Teukolsky, SA; Vetterling, WT; Flannery, BP (2007), "Section 14.7 (Example 2.)", Numerical Recipes: The Art of Scientific Computing (3rd ed.), New York: Cambridge University Press, ISBN 978-0-521-88068-8
11. ^ Thorp, E. O. (1969). "Optimal Gambling Systems for Favorable Games". Revue de l'Institut International de Statistique / Review of the International Statistical Institute. International Statistical Institute (ISI). 37 (3): 273–293. JSTOR 1402118. MR 0135630.
12. ^ THE KELLY CRITERION IN BLACKJACK, SPORTS BETTING, AND THE STOCK MARKET by Edward O. Thorp Paper presented at: The 10th International Conference on Gambling and Risk Taking Montreal, June 1997
13. ^ Bernoulli, Daniel (1954) [1738]. "Exposition of a New Theory on the Measurement of Risk". Econometrica. The Econometric Society. 22 (1): 22–36. JSTOR 1909829.
14. ^ a b c d Smoczynski, Peter; Tomkins, Dave (2010) "An explicit solution to the problem of optimizing the allocations of a bettor’s wealth when wagering on horse races", Mathematical Scientist", 35 (1), 10-17
15. ^ THE KELLY CRITERION IN BLACKJACK, SPORTS BETTING, AND THE STOCK MARKET by Edward O. Thorp Paper presented at: The 10th International Conference on Gambling and Risk Taking Montreal, June 1997
16. ^ Nekrasov, Vasily(2013) "Kelly Criterion for Multivariate Portfolios: A Model-Free Approach"