# Price equation

In the theory of evolution and natural selection, the Price equation (also known as Price's equation or Price's theorem) describes how a trait or allele changes in frequency over time. The equation uses a covariance between a trait and fitness, to give a mathematical description of evolution and natural selection. It provides a way to understand the effects that gene transmission and natural selection have on the frequency of alleles within each new generation of a population. The Price equation was derived by George R. Price, working in London to re-derive W.D. Hamilton's work on kin selection. Examples of the Price equation have been constructed for various evolutionary cases. The Price equation also has applications in economics.[1]

It is important to note that the Price equation is not a physical or biological law. It is not a concise or general expression of experimentally validated results. It is rather a purely mathematical relationship between various statistical descriptors of population dynamics. It is mathematically valid, and therefore not subject to experimental verification. In simple terms, it is a mathematical restatement of the expression "survival of the fittest" which is actually self-evident, given the mathematical definitions of "survival" and "fittest".

## Statement

The Price equation shows that a change in the average amount ${\displaystyle z}$ of a trait in a population from one generation to the next (${\displaystyle \Delta z}$) is determined by the covariance between the amounts ${\displaystyle z_{i}}$ of the trait for subpopulation ${\displaystyle i}$ and the fitnesses ${\displaystyle w_{i}}$ of the subpopulations, together with the expected change in the amount of the trait value due to fitness, namely ${\displaystyle \mathrm {E} (w_{i}\Delta z_{i})}$:

${\displaystyle \Delta {z}={\frac {1}{w}}\operatorname {cov} (w_{i},z_{i})+{\frac {1}{w}}\operatorname {E} (w_{i}\,\Delta z_{i}).}$

Here ${\displaystyle w}$ is the average fitness over the population, and ${\displaystyle \operatorname {E} }$ and ${\displaystyle \operatorname {cov} }$ represent the population mean and covariance respectively. 'Fitness' ${\displaystyle w}$ is the ratio of the average number of offspring for the whole population per the number of adult individuals in the population, and ${\displaystyle w_{i}}$ is that same ratio only for subpopulation ${\displaystyle i}$.

If the covariance between fitness (${\displaystyle w_{i}}$) and trait value (${\displaystyle z_{i}}$) is positive, the trait value is expected to rise on average across population ${\displaystyle i}$. If the covariance is negative, the characteristic is harmful, and its frequency is expected to drop.

The second term, ${\displaystyle \mathrm {E} (w_{i}\Delta z_{i})}$, represents the portion of ${\displaystyle \Delta z}$ due to all factors other than direct selection which can affect trait evolution. This term can encompass genetic drift, mutation bias, or meiotic drive. Additionally, this term can encompass the effects of multi-level selection or group selection. Price (1972) referred to this as the "environment change" term, and denoted both terms using partial derivative notation (∂NS and ∂EC). This concept of environment includes interspecies and ecological effects. Price describes this as follows:

Fisher adopted the somewhat unusual point of view of regarding dominance and epistasis as being environment effects. For example, he writes (1941): ‘A change in the proportion of any pair of genes itself constitutes a change in the environment in which individuals of the species find themselves.’ Hence he regarded the natural selection effect on M as being limited to the additive or linear effects of changes in gene frequencies, while everything else – dominance, epistasis, population pressure, climate, and interactions with other species – he regarded as a matter of the environment.

— G.R. Price (1972), Fisher's fundamental theorem made clear[2]

## Proof

Suppose we are given four equal-length lists of real numbers[3] ${\displaystyle n_{i}}$, ${\displaystyle z_{i}}$, ${\displaystyle n_{i}'}$, ${\displaystyle z_{i}'}$ from which we may define ${\displaystyle w_{i}=n_{i}'/n_{i}}$. ${\displaystyle n_{i}}$ and ${\displaystyle z_{i}}$ will be called the parent population numbers and characteristics associated with each index i. Likewise ${\displaystyle n_{i}'}$ and ${\displaystyle z_{i}'}$ will be called the child population numbers and characteristics, and ${\displaystyle w_{i}'}$ will be called the fitness associated with index i. (Equivalently, we could have been given ${\displaystyle n_{i}}$, ${\displaystyle z_{i}}$, ${\displaystyle w_{i}}$, ${\displaystyle z_{i}'}$ with ${\displaystyle n_{i}'=w_{i}n_{i}}$.) Define the parent and child population totals:

 ${\displaystyle n\;{\stackrel {\mathrm {def} }{=}}\;\sum _{i}n_{i}}$ ${\displaystyle n'\;{\stackrel {\mathrm {def} }{=}}\;\sum _{i}n_{i}'}$

and the probabilities (or frequencies):[4]

 ${\displaystyle q_{i}\;{\stackrel {\mathrm {def} }{=}}\;n_{i}/n}$ ${\displaystyle q_{i}'\;{\stackrel {\mathrm {def} }{=}}\;n_{i}'/n'}$

Note that these are of the form of probability mass functions in that ${\displaystyle \sum _{i}q_{i}=\sum _{i}q_{i}'=1}$ and are in fact the probabilities that a random individual drawn from the parent or child population has a characteristic ${\displaystyle z_{i}}$. Define the fitnesses:

${\displaystyle w_{i}\;{\stackrel {\mathrm {def} }{=}}\;n_{i}'/n_{i}}$

The average of any list ${\displaystyle x_{i}}$ is given by:

${\displaystyle E(x_{i})=\sum _{i}q_{i}x_{i}}$

so the average characteristics are defined as:

 ${\displaystyle z\;{\stackrel {\mathrm {def} }{=}}\;\sum _{i}q_{i}z_{i}}$ ${\displaystyle z'\;{\stackrel {\mathrm {def} }{=}}\;\sum _{i}q_{i}'z_{i}'}$

and the average fitness is:

${\displaystyle w\;{\stackrel {\mathrm {def} }{=}}\;\sum _{i}q_{i}w_{i}}$

A simple theorem can be proved: ${\displaystyle q_{i}w_{i}=\left({\frac {n_{i}}{n}}\right)\left({\frac {n_{i}'}{n_{i}}}\right)=\left({\frac {n_{i}'}{n'}}\right)\left({\frac {n'}{n}}\right)=q_{i}'\left({\frac {n'}{n}}\right)}$ so that:

${\displaystyle w={\frac {n'}{n}}\sum _{i}q_{i}'={\frac {n'}{n}}}$

and

${\displaystyle q_{i}w_{i}=w\,q_{i}'}$

The covariance of ${\displaystyle w_{i}}$ and ${\displaystyle z_{i}}$ is defined by:

${\displaystyle \operatorname {cov} (w_{i},z_{i})\;{\stackrel {\mathrm {def} }{=}}\;E(w_{i}z_{i})-E(w_{i})E(z_{i})=\sum _{i}q_{i}w_{i}z_{i}-wz}$

Defining ${\displaystyle \Delta z_{i}\;{\stackrel {\mathrm {def} }{=}}\;z_{i}'-z_{i}}$, the expectation value of ${\displaystyle w_{i}\Delta z_{i}}$ is

${\displaystyle E(w_{i}\Delta z_{i})=\sum q_{i}w_{i}(z_{i}'-z_{i})=\sum _{i}q_{i}w_{i}z_{i}'-\sum _{i}q_{i}w_{i}z_{i}}$

The sum of the two terms is:

${\displaystyle \operatorname {cov} (w_{i},z_{i})+E(w_{i}\Delta z_{i})=\sum _{i}q_{i}w_{i}z_{i}-wz+\sum _{i}q_{i}w_{i}z_{i}'-\sum _{i}q_{i}w_{i}z_{i}=\sum _{i}q_{i}w_{i}z_{i}'-wz}$

Using the above mentioned simple theorem, the sum becomes

${\displaystyle \operatorname {cov} (w_{i},z_{i})+E(w_{i}\Delta z_{i})=w\sum _{i}q_{i}'z_{i}'-wz=wz'-wz=w\Delta z}$

where ${\displaystyle \Delta z\;{\stackrel {\mathrm {def} }{=}}\;z'-z}$.

### Derivation of the continuous-time Price equation

Consider a set of groups with ${\displaystyle i=1,...,n}$ that are characterized by a particular trait, denoted by ${\displaystyle x_{i}}$. The number ${\displaystyle n_{i}}$ of individuals belonging to group ${\displaystyle i}$ experiences exponential growth:

${\displaystyle {dn_{i} \over {dt}}=f_{i}n_{i}}$
where ${\displaystyle f_{i}}$ corresponds to the fitness of the group. We want to derive an equation describing the time-evolution of the expected value of the trait:
${\displaystyle \mathbb {E} (x)=\sum _{i}p_{i}x_{i}\equiv \mu ,\quad p_{i}={n_{i} \over {\sum _{i}n_{i}}}}$
Based on the chain rule, we may derive an ordinary differential equation:
{\displaystyle {\begin{aligned}{d\mu \over {dt}}&=\sum _{i}{\partial \mu \over {\partial p_{i}}}{dp_{i} \over {dt}}+\sum _{i}{\partial \mu \over {\partial x_{i}}}{dx_{i} \over {dt}}\\&=\sum _{i}x_{i}{dp_{i} \over {dt}}+\sum _{i}p_{i}{dx_{i} \over {dt}}\\&=\sum _{i}x_{i}{dp_{i} \over {dt}}+\mathbb {E} \left({dx \over {dt}}\right)\end{aligned}}}
A further application of the chain rule for ${\displaystyle dp_{i}/dt}$ gives us:
${\displaystyle {dp_{i} \over {dt}}=\sum _{j}{\partial p_{i} \over {\partial n_{j}}}{dn_{j} \over {dt}},\quad {\partial p_{i} \over {\partial n_{j}}}={\begin{cases}-p_{i}/N,\quad &i\neq j\\(1-p_{i})/N,\quad &i=j\end{cases}}}$
Summing up the components gives us that:
{\displaystyle {\begin{aligned}{dp_{i} \over {dt}}&=p_{i}\left(f_{i}-\sum _{j}p_{j}f_{j}\right)\\&=p_{i}\left[f_{i}-\mathbb {E} (f)\right]\end{aligned}}}

which is also known as the replicator equation. Now, note that:

{\displaystyle {\begin{aligned}\sum _{i}x_{i}{dp_{i} \over {dt}}&=\sum _{i}p_{i}x_{i}\left[f_{i}-\mathbb {E} (f)\right]\\&=\mathbb {E} \left\{x_{i}\left[f_{i}-\mathbb {E} (f)\right]\right\}\\&={\text{Cov}}(x,f)\end{aligned}}}
Therefore, putting all of these components together, we arrive at the continuous-time Price equation:
${\displaystyle {d \over {dt}}\mathbb {E} (x)=\underbrace {{\text{Cov}}(x,f)} _{\text{Selection effect}}+\underbrace {\mathbb {E} ({\dot {x}})} _{\text{Dynamic effect}}}$

## Simple Price equation

When the characteristic values ${\displaystyle z_{i}}$ do not change from the parent to the child generation, the second term in the Price equation becomes zero resulting in a simplified version of the Price equation:

${\displaystyle w\,\Delta z=\operatorname {cov} \left(w_{i},z_{i}\right)}$

which can be restated as:

${\displaystyle \Delta z=\operatorname {cov} \left(v_{i},z_{i}\right)}$

where ${\displaystyle v_{i}}$ is the fractional fitness: ${\displaystyle v_{i}=w_{i}/w}$.

This simple Price equation can be proven using the definition in Equation (2) above. It makes this fundamental statement about evolution: "If a certain inheritable characteristic is correlated with an increase in fractional fitness, the average value of that characteristic in the child population will be increased over that in the parent population."

### Applications

The Price equation can describe any system that changes over time, but is most often applied in evolutionary biology. The evolution of sight provides an example of simple directional selection. The evolution of sickle cell anemia shows how a heterozygote advantage can affect trait evolution. The Price equation can also be applied to population context dependent traits such as the evolution of sex ratios. Additionally, the Price equation is flexible enough to model second order traits such as the evolution of mutability. The Price equation also provides an extension to Founder effect which shows change in population traits in different settlements

### Dynamical sufficiency and the simple Price equation

Sometimes the genetic model being used encodes enough information into the parameters used by the Price equation to allow the calculation of the parameters for all subsequent generations. This property is referred to as dynamical sufficiency. For simplicity, the following looks at dynamical sufficiency for the simple Price equation, but is also valid for the full Price equation.

Referring to the definition in Equation (2), the simple Price equation for the character ${\displaystyle z}$ can be written:

${\displaystyle w(z'-z)=\langle w_{i}z_{i}\rangle -wz}$

For the second generation:

${\displaystyle w'(z''-z')=\langle w'_{i}z'_{i}\rangle -w'z'}$

The simple Price equation for ${\displaystyle z}$ only gives us the value of ${\displaystyle z'}$ for the first generation, but does not give us the value of ${\displaystyle w'}$ and ${\displaystyle \langle w_{i}z_{i}\rangle }$, which are needed to calculate ${\displaystyle z''}$ for the second generation. The variables ${\displaystyle w_{i}}$ and ${\displaystyle \langle w_{i}z_{i}\rangle }$ can both be thought of as characteristics of the first generation, so the Price equation can be used to calculate them as well:

{\displaystyle {\begin{aligned}w(w'-w)&=\langle w_{i}^{2}\rangle -w^{2}\\w\left(\langle w'_{i}z'_{i}\rangle -\langle w_{i}z_{i}\rangle \right)&=\langle w_{i}^{2}z_{i}\rangle -w\langle w_{i}z_{i}\rangle \end{aligned}}}

The five 0-generation variables ${\displaystyle w}$, ${\displaystyle z}$, ${\displaystyle \langle w_{i}z_{i}\rangle }$, ${\displaystyle \langle w_{i}^{2}\rangle }$, and ${\displaystyle \langle w_{i}^{2}z_{i}}$ must be known before proceeding to calculate the three first generation variables ${\displaystyle w'}$, ${\displaystyle z'}$, and ${\displaystyle \langle w'_{i}z'_{i}\rangle }$, which are needed to calculate ${\displaystyle z''}$ for the second generation. It can be seen that in general the Price equation cannot be used to propagate forward in time unless there is a way of calculating the higher moments ${\displaystyle \langle w_{i}^{n}\rangle }$ and ${\displaystyle \langle w_{i}^{n}z_{i}\rangle }$ from the lower moments in a way that is independent of the generation. Dynamical sufficiency means that such equations can be found in the genetic model, allowing the Price equation to be used alone as a propagator of the dynamics of the model forward in time.

## Full Price equation

The simple Price equation was based on the assumption that the characters ${\displaystyle z_{i}}$ do not change over one generation. If it is assumed that they do change, with ${\displaystyle z_{i}}$ being the value of the character in the child population, then the full Price equation must be used. A change in character can come about in a number of ways. The following two examples illustrate two such possibilities, each of which introduces new insight into the Price equation.

### Genotype fitness

We focus on the idea of the fitness of the genotype. The index ${\displaystyle i}$ indicates the genotype and the number of type ${\displaystyle i}$ genotypes in the child population is:

${\displaystyle n'_{i}=\sum _{j}w_{ji}n_{j}\,}$

which gives fitness:

${\displaystyle w_{i}={\frac {n'_{i}}{n_{i}}}}$

Since the individual mutability ${\displaystyle z_{i}}$ does not change, the average mutabilities will be:

{\displaystyle {\begin{aligned}z&={\frac {1}{n}}\sum _{i}z_{i}n_{i}\\z'&={\frac {1}{n'}}\sum _{i}z_{i}n'_{i}\end{aligned}}}

with these definitions, the simple Price equation now applies.

### Lineage fitness

In this case we want to look at the idea that fitness is measured by the number of children an organism has, regardless of their genotype. Note that we now have two methods of grouping, by lineage, and by genotype. It is this complication that will introduce the need for the full Price equation. The number of children an ${\displaystyle i}$-type organism has is:

${\displaystyle n'_{i}=n_{i}\sum _{j}w_{ij}\,}$

which gives fitness:

${\displaystyle w_{i}={\frac {n'_{i}}{n_{i}}}=\sum _{j}w_{ij}}$

We now have characters in the child population which are the average character of the ${\displaystyle i}$-th parent.

${\displaystyle z'_{j}={\frac {\sum _{i}n_{i}z_{i}w_{ij}}{\sum _{i}n_{i}w_{ij}}}}$

with global characters:

{\displaystyle {\begin{aligned}z&={\frac {1}{n}}\sum _{i}z_{i}n_{i}\\z'&={\frac {1}{n'}}\sum _{i}z_{i}n'_{i}\end{aligned}}}

with these definitions, the full Price equation now applies.

## Criticism

The use of the change in average characteristic (${\displaystyle z'-z}$) per generation as a measure of evolutionary progress is not always appropriate. There may be cases where the average remains unchanged (and the covariance between fitness and characteristic is zero) while evolution is nevertheless in progress. For example, if we have ${\displaystyle z_{i}=(1,2,3)}$, ${\displaystyle n_{i}=(1,1,1)}$, and ${\displaystyle w_{i}=(1,4,1)}$, then for the child population, ${\displaystyle n_{i}'=(1,4,1)}$ showing that the peak fitness at ${\displaystyle w_{2}=4}$ is in fact fractionally increasing the population of individuals with ${\displaystyle z_{i}=2}$. However, the average characteristics are z=2 and z'=2 so that ${\displaystyle \Delta z=0}$. The covariance ${\displaystyle \mathrm {cov} (z_{i},w_{i})}$ is also zero. The simple Price equation is required here, and it yields 0=0. In other words, it yields no information regarding the progress of evolution in this system.

A critical discussion of the use of the Price equation can be found in van Veelen (2005),[5] van Veelen et al. (2012),[6] and van Veelen (2020).[7] Frank (2012) discusses the criticism in van Veelen et al. (2012).[8]

## Cultural references

Price's equation features in the plot and title of the 2008 thriller film WΔZ.

The Price equation also features in posters in the computer game BioShock 2, in which a consumer of a "Brain Boost" tonic is seen deriving the Price equation while simultaneously reading a book. The game is set in the 1950s, substantially before Price's work.

## References

1. ^ Knudsen, Thorbjørn (2004). "General selection theory and economic evolution: The Price equation and the replicator/interactor distinction". Journal of Economic Methodology. 11 (2): 147–173. doi:10.1080/13501780410001694109. S2CID 154197796. Retrieved 2011-10-22.
2. ^ Price, G.R. (1972). "Fisher's "fundamental theorem" made clear". Annals of Human Genetics. 36 (2): 129–140. doi:10.1111/j.1469-1809.1972.tb00764.x. PMID 4656569. S2CID 20757537.
3. ^ The lists may in fact be members of any field (i.e. a set on which addition, subtraction, multiplication, and division are defined and behave as the corresponding operations on rational and real numbers do
4. ^ Frank, Steven A. (1995). "George Price's Contributions to Evolutionary Genetics". J. Theor. Biol. 175 (3): 373–388. Bibcode:1995JThBi.175..373F. doi:10.1006/jtbi.1995.0148. PMID 7475081. Retrieved Mar 19, 2023.
5. ^ van Veelen, M. (December 2005). "On the use of the Price equation". Journal of Theoretical Biology. 237 (4): 412–426. Bibcode:2005JThBi.237..412V. doi:10.1016/j.jtbi.2005.04.026. PMID 15953618.
6. ^ van Veelen, M.; García, J.; Sabelis, M.W.; Egas, M. (April 2012). "Group selection and inclusive fitness are not equivalent; the Price equation vs. models and statistics". Journal of Theoretical Biology. 299: 64–80. Bibcode:2012JThBi.299...64V. doi:10.1016/j.jtbi.2011.07.025. PMID 21839750.
7. ^ van Veelen, M. (March 2020). "The problem with the Price equation". Philosophical Transactions of the Royal Society B. 375 (1797): 1–13. doi:10.1098/rstb.2019.0355. PMC 7133513. PMID 32146887.
8. ^ Frank, S.A. (2012). "Natural Selection IV: The Price equation". Journal of Evolutionary Biology. 25 (6): 1002–1019. arXiv:1204.1515. doi:10.1111/j.1420-9101.2012.02498.x. PMC 3354028. PMID 22487312.