Price equation

In the theory of evolution and natural selection, the Price equation (also known as Price's equation or Price's theorem) describes how a trait or allele changes in frequency over time. The equation uses a covariance between a trait and fitness, to give a mathematical description of evolution and natural selection. It provides a way to understand the effects that gene transmission and natural selection have on the frequency of alleles within each new generation of a population. The Price equation was derived by George R. Price, working in London to re-derive W.D. Hamilton's work on kin selection. Examples of the Price equation have been constructed for various evolutionary cases. The Price equation also has applications in economics.^[1]

The Price equation is a mathematical relationship between various statistical descriptors of population dynamics, rather than a physical or biological law, and as such is not subject to experimental verification. In simple terms, it is a mathematical statement of the expression "survival of the fittest".

Statement

The Price equation shows that a change in the average amount $z$ of a trait in a population from one generation to the next ( $\Delta z$ ) is determined by the covariance between the amounts $z_{i}$ of the trait for subpopulation $i$ and the fitnesses $w_{i}$ of the subpopulations, together with the expected change in the amount of the trait value due to fitness, namely $\mathrm {E} (w_{i}\Delta z_{i})$ :

\Delta {z}={\frac {1}{w}}\operatorname {cov} (w_{i},z_{i})+{\frac {1}{w}}\operatorname {E} (w_{i}\,\Delta z_{i}).

Here $w$ is the average fitness over the population, and $\operatorname {E}$ and $\operatorname {cov}$ represent the population mean and covariance respectively. 'Fitness' $w$ is the ratio of the average number of offspring for the whole population per the number of adult individuals in the population, and $w_{i}$ is that same ratio only for subpopulation $i$ .

If the covariance between fitness ( $w_{i}$ ) and trait value ( $z_{i}$ ) is positive, the trait value is expected to rise on average across population $i$ . If the covariance is negative, the characteristic is harmful, and its frequency is expected to drop.

The second term, $\mathrm {E} (w_{i}\Delta z_{i})$ , represents the portion of $\Delta z$ due to all factors other than direct selection which can affect trait evolution. This term can encompass genetic drift, mutation bias, or meiotic drive. Additionally, this term can encompass the effects of multi-level selection or group selection. Price (1972) referred to this as the "environment change" term, and denoted both terms using partial derivative notation (∂_NS and ∂_EC). This concept of environment includes interspecies and ecological effects. Price describes this as follows:

Fisher adopted the somewhat unusual point of view of regarding dominance and epistasis as being environment effects. For example, he writes (1941): ‘A change in the proportion of any pair of genes itself constitutes a change in the environment in which individuals of the species find themselves.’ Hence he regarded the natural selection effect on $M$ as being limited to the additive or linear effects of changes in gene frequencies, while everything else – dominance, epistasis, population pressure, climate, and interactions with other species – he regarded as a matter of the environment.
— G.R. Price (1972), Fisher's fundamental theorem made clear^[2]

Proof

Suppose we are given four equal-length lists of real numbers^[3] $n_{i}$ , $z_{i}$ , $n_{i}'$ , $z_{i}'$ from which we may define $w_{i}=n_{i}'/n_{i}$ . $n_{i}$ and $z_{i}$ will be called the parent population numbers and characteristics associated with each index i. Likewise $n_{i}'$ and $z_{i}'$ will be called the child population numbers and characteristics, and $w_{i}'$ will be called the fitness associated with index i. (Equivalently, we could have been given $n_{i}$ , $z_{i}$ , $w_{i}$ , $z_{i}'$ with $n_{i}'=w_{i}n_{i}$ .) Define the parent and child population totals:

n\;{\stackrel {\mathrm {def} }{=}}\;\sum _{i}n_{i}

n'\;{\stackrel {\mathrm {def} }{=}}\;\sum _{i}n_{i}'

and the probabilities (or frequencies):^[4]

q_{i}\;{\stackrel {\mathrm {def} }{=}}\;n_{i}/n

q_{i}'\;{\stackrel {\mathrm {def} }{=}}\;n_{i}'/n'

Note that these are of the form of probability mass functions in that $\sum _{i}q_{i}=\sum _{i}q_{i}'=1$ and are in fact the probabilities that a random individual drawn from the parent or child population has a characteristic $z_{i}$ . Define the fitnesses:

w_{i}\;{\stackrel {\mathrm {def} }{=}}\;n_{i}'/n_{i}

The average of any list $x_{i}$ is given by:

E(x_{i})=\sum _{i}q_{i}x_{i}

so the average characteristics are defined as:

z\;{\stackrel {\mathrm {def} }{=}}\;\sum _{i}q_{i}z_{i}

z'\;{\stackrel {\mathrm {def} }{=}}\;\sum _{i}q_{i}'z_{i}'

and the average fitness is:

w\;{\stackrel {\mathrm {def} }{=}}\;\sum _{i}q_{i}w_{i}

A simple theorem can be proved: $q_{i}w_{i}=\left({\frac {n_{i}}{n}}\right)\left({\frac {n_{i}'}{n_{i}}}\right)=\left({\frac {n_{i}'}{n'}}\right)\left({\frac {n'}{n}}\right)=q_{i}'\left({\frac {n'}{n}}\right)$ so that:

w={\frac {n'}{n}}\sum _{i}q_{i}'={\frac {n'}{n}}

and

q_{i}w_{i}=w\,q_{i}'

The covariance of $w_{i}$ and $z_{i}$ is defined by:

\operatorname {cov} (w_{i},z_{i})\;{\stackrel {\mathrm {def} }{=}}\;E(w_{i}z_{i})-E(w_{i})E(z_{i})=\sum _{i}q_{i}w_{i}z_{i}-wz

Defining $\Delta z_{i}\;{\stackrel {\mathrm {def} }{=}}\;z_{i}'-z_{i}$ , the expectation value of $w_{i}\Delta z_{i}$ is

E(w_{i}\Delta z_{i})=\sum q_{i}w_{i}(z_{i}'-z_{i})=\sum _{i}q_{i}w_{i}z_{i}'-\sum _{i}q_{i}w_{i}z_{i}

The sum of the two terms is:

\operatorname {cov} (w_{i},z_{i})+E(w_{i}\Delta z_{i})=\sum _{i}q_{i}w_{i}z_{i}-wz+\sum _{i}q_{i}w_{i}z_{i}'-\sum _{i}q_{i}w_{i}z_{i}=\sum _{i}q_{i}w_{i}z_{i}'-wz

Using the above mentioned simple theorem, the sum becomes

\operatorname {cov} (w_{i},z_{i})+E(w_{i}\Delta z_{i})=w\sum _{i}q_{i}'z_{i}'-wz=wz'-wz=w\Delta z

where $\Delta z\;{\stackrel {\mathrm {def} }{=}}\;z'-z$ .

Derivation of the continuous-time Price equation

Consider a set of groups with $i=1,...,n$ that are characterized by a particular trait, denoted by $x_{i}$ . The number $n_{i}$ of individuals belonging to group $i$ experiences exponential growth: ${dn_{i} \over {dt}}=f_{i}n_{i}$ where $f_{i}$ corresponds to the fitness of the group. We want to derive an equation describing the time-evolution of the expected value of the trait: $\mathbb {E} (x)=\sum _{i}p_{i}x_{i}\equiv \mu ,\quad p_{i}={n_{i} \over {\sum _{i}n_{i}}}$ Based on the chain rule, we may derive an ordinary differential equation: ${\begin{aligned}{d\mu \over {dt}}&=\sum _{i}{\partial \mu \over {\partial p_{i}}}{dp_{i} \over {dt}}+\sum _{i}{\partial \mu \over {\partial x_{i}}}{dx_{i} \over {dt}}\\&=\sum _{i}x_{i}{dp_{i} \over {dt}}+\sum _{i}p_{i}{dx_{i} \over {dt}}\\&=\sum _{i}x_{i}{dp_{i} \over {dt}}+\mathbb {E} \left({dx \over {dt}}\right)\end{aligned}}$ A further application of the chain rule for $dp_{i}/dt$ gives us: ${dp_{i} \over {dt}}=\sum _{j}{\partial p_{i} \over {\partial n_{j}}}{dn_{j} \over {dt}},\quad {\partial p_{i} \over {\partial n_{j}}}={\begin{cases}-p_{i}/N,\quad &i\neq j\\(1-p_{i})/N,\quad &i=j\end{cases}}$ Summing up the components gives us that: ${\begin{aligned}{dp_{i} \over {dt}}&=p_{i}\left(f_{i}-\sum _{j}p_{j}f_{j}\right)\\&=p_{i}\left[f_{i}-\mathbb {E} (f)\right]\end{aligned}}$

which is also known as the replicator equation. Now, note that: ${\begin{aligned}\sum _{i}x_{i}{dp_{i} \over {dt}}&=\sum _{i}p_{i}x_{i}\left[f_{i}-\mathbb {E} (f)\right]\\&=\mathbb {E} \left\{x_{i}\left[f_{i}-\mathbb {E} (f)\right]\right\}\\&={\text{Cov}}(x,f)\end{aligned}}$ Therefore, putting all of these components together, we arrive at the continuous-time Price equation: ${d \over {dt}}\mathbb {E} (x)=\underbrace {{\text{Cov}}(x,f)} _{\text{Selection effect}}+\underbrace {\mathbb {E} ({\dot {x}})} _{\text{Dynamic effect}}$

Simple Price equation

When the characteristic values $z_{i}$ do not change from the parent to the child generation, the second term in the Price equation becomes zero resulting in a simplified version of the Price equation:

w\,\Delta z=\operatorname {cov} \left(w_{i},z_{i}\right)

which can be restated as:

\Delta z=\operatorname {cov} \left(v_{i},z_{i}\right)

where $v_{i}$ is the fractional fitness: $v_{i}=w_{i}/w$ .

This simple Price equation can be proven using the definition in Equation (2) above. It makes this fundamental statement about evolution: "If a certain inheritable characteristic is correlated with an increase in fractional fitness, the average value of that characteristic in the child population will be increased over that in the parent population."

Applications

The Price equation can describe any system that changes over time, but is most often applied in evolutionary biology. The evolution of sight provides an example of simple directional selection. The evolution of sickle cell anemia shows how a heterozygote advantage can affect trait evolution. The Price equation can also be applied to population context dependent traits such as the evolution of sex ratios. Additionally, the Price equation is flexible enough to model second order traits such as the evolution of mutability. The Price equation also provides an extension to Founder effect which shows change in population traits in different settlements

Dynamical sufficiency and the simple Price equation

Sometimes the genetic model being used encodes enough information into the parameters used by the Price equation to allow the calculation of the parameters for all subsequent generations. This property is referred to as dynamical sufficiency. For simplicity, the following looks at dynamical sufficiency for the simple Price equation, but is also valid for the full Price equation.

Referring to the definition in Equation (2), the simple Price equation for the character $z$ can be written:

w(z'-z)=\langle w_{i}z_{i}\rangle -wz

For the second generation:

w'(z''-z')=\langle w'_{i}z'_{i}\rangle -w'z'

The simple Price equation for $z$ only gives us the value of $z'$ for the first generation, but does not give us the value of $w'$ and $\langle w_{i}z_{i}\rangle$ , which are needed to calculate $z''$ for the second generation. The variables $w_{i}$ and $\langle w_{i}z_{i}\rangle$ can both be thought of as characteristics of the first generation, so the Price equation can be used to calculate them as well:

{\begin{aligned}w(w'-w)&=\langle w_{i}^{2}\rangle -w^{2}\\w\left(\langle w'_{i}z'_{i}\rangle -\langle w_{i}z_{i}\rangle \right)&=\langle w_{i}^{2}z_{i}\rangle -w\langle w_{i}z_{i}\rangle \end{aligned}}

The five 0-generation variables $w$ , $z$ , $\langle w_{i}z_{i}\rangle$ , $\langle w_{i}^{2}\rangle$ , and $\langle w_{i}^{2}z_{i}$ must be known before proceeding to calculate the three first generation variables $w'$ , $z'$ , and $\langle w'_{i}z'_{i}\rangle$ , which are needed to calculate $z''$ for the second generation. It can be seen that in general the Price equation cannot be used to propagate forward in time unless there is a way of calculating the higher moments $\langle w_{i}^{n}\rangle$ and $\langle w_{i}^{n}z_{i}\rangle$ from the lower moments in a way that is independent of the generation. Dynamical sufficiency means that such equations can be found in the genetic model, allowing the Price equation to be used alone as a propagator of the dynamics of the model forward in time.

Full Price equation

The simple Price equation was based on the assumption that the characters $z_{i}$ do not change over one generation. If it is assumed that they do change, with $z_{i}$ being the value of the character in the child population, then the full Price equation must be used. A change in character can come about in a number of ways. The following two examples illustrate two such possibilities, each of which introduces new insight into the Price equation.

Genotype fitness

We focus on the idea of the fitness of the genotype. The index $i$ indicates the genotype and the number of type $i$ genotypes in the child population is:

n'_{i}=\sum _{j}w_{ji}n_{j}\,

which gives fitness:

w_{i}={\frac {n'_{i}}{n_{i}}}

Since the individual mutability $z_{i}$ does not change, the average mutabilities will be:

{\begin{aligned}z&={\frac {1}{n}}\sum _{i}z_{i}n_{i}\\z'&={\frac {1}{n'}}\sum _{i}z_{i}n'_{i}\end{aligned}}

with these definitions, the simple Price equation now applies.

Lineage fitness

In this case we want to look at the idea that fitness is measured by the number of children an organism has, regardless of their genotype. Note that we now have two methods of grouping, by lineage, and by genotype. It is this complication that will introduce the need for the full Price equation. The number of children an $i$ -type organism has is:

n'_{i}=n_{i}\sum _{j}w_{ij}\,

which gives fitness:

w_{i}={\frac {n'_{i}}{n_{i}}}=\sum _{j}w_{ij}

We now have characters in the child population which are the average character of the $i$ -th parent.

z'_{j}={\frac {\sum _{i}n_{i}z_{i}w_{ij}}{\sum _{i}n_{i}w_{ij}}}

with global characters:

{\begin{aligned}z&={\frac {1}{n}}\sum _{i}z_{i}n_{i}\\z'&={\frac {1}{n'}}\sum _{i}z_{i}n'_{i}\end{aligned}}

with these definitions, the full Price equation now applies.

Criticism

The use of the change in average characteristic ( $z'-z$ ) per generation as a measure of evolutionary progress is not always appropriate. There may be cases where the average remains unchanged (and the covariance between fitness and characteristic is zero) while evolution is nevertheless in progress. For example, if we have $z_{i}=(1,2,3)$ , $n_{i}=(1,1,1)$ , and $w_{i}=(1,4,1)$ , then for the child population, $n_{i}'=(1,4,1)$ showing that the peak fitness at $w_{2}=4$ is in fact fractionally increasing the population of individuals with $z_{i}=2$ . However, the average characteristics are z=2 and z'=2 so that $\Delta z=0$ . The covariance $\mathrm {cov} (z_{i},w_{i})$ is also zero. The simple Price equation is required here, and it yields 0=0. In other words, it yields no information regarding the progress of evolution in this system.

A critical discussion of the use of the Price equation can be found in van Veelen (2005),^[5] van Veelen et al. (2012),^[6] and van Veelen (2020).^[7] Frank (2012) discusses the criticism in van Veelen et al. (2012).^[8]

Cultural references

Price's equation features in the plot and title of the 2008 thriller film WΔZ.

The Price equation also features in posters in the computer game BioShock 2, in which a consumer of a "Brain Boost" tonic is seen deriving the Price equation while simultaneously reading a book. The game is set in the 1950s, substantially before Price's work.

References

^ Knudsen, Thorbjørn (2004). "General selection theory and economic evolution: The Price equation and the replicator/interactor distinction". Journal of Economic Methodology. 11 (2): 147–173. doi:10.1080/13501780410001694109. S2CID 154197796. Retrieved 2011-10-22.
^ Price, G.R. (1972). "Fisher's "fundamental theorem" made clear". Annals of Human Genetics. 36 (2): 129–140. doi:10.1111/j.1469-1809.1972.tb00764.x. PMID 4656569. S2CID 20757537.
^ The lists may in fact be members of any field (i.e. a set on which addition, subtraction, multiplication, and division are defined and behave as the corresponding operations on rational and real numbers do
^ Frank, Steven A. (1995). "George Price's Contributions to Evolutionary Genetics". J. Theor. Biol. 175 (3): 373–388. Bibcode:1995JThBi.175..373F. doi:10.1006/jtbi.1995.0148. PMID 7475081. Retrieved Mar 19, 2023.
^ van Veelen, M. (December 2005). "On the use of the Price equation". Journal of Theoretical Biology. 237 (4): 412–426. Bibcode:2005JThBi.237..412V. doi:10.1016/j.jtbi.2005.04.026. PMID 15953618.
^ van Veelen, M.; García, J.; Sabelis, M.W.; Egas, M. (April 2012). "Group selection and inclusive fitness are not equivalent; the Price equation vs. models and statistics". Journal of Theoretical Biology. 299: 64–80. Bibcode:2012JThBi.299...64V. doi:10.1016/j.jtbi.2011.07.025. PMID 21839750.
^ van Veelen, M. (March 2020). "The problem with the Price equation". Philosophical Transactions of the Royal Society B. 375 (1797): 1–13. doi:10.1098/rstb.2019.0355. PMC 7133513. PMID 32146887.
^ Frank, S.A. (2012). "Natural Selection IV: The Price equation". Journal of Evolutionary Biology. 25 (6): 1002–1019. arXiv:1204.1515. doi:10.1111/j.1420-9101.2012.02498.x. PMC 3354028. PMID 22487312.

v t e Population genetics
Key concepts	Hardy–Weinberg principle Genetic linkage Identity by descent Linkage disequilibrium Fisher's fundamental theorem Neutral theory Shifting balance theory Price equation Coefficient of inbreeding Coefficient of relationship Selection coefficient Fitness Heritability Population structure Constructive neutral evolution
Selection	Natural Artificial Sexual Ecological
Effects of selection on genomic variation	Genetic hitchhiking Background selection
Genetic drift	Small population size Population bottleneck Founder effect Coalescence Balding–Nichols model
Founders	R. A. Fisher J. B. S. Haldane Sewall Wright
Related topics	Biogeography Evolution Evolutionary game theory Fitness landscape Genetic genealogy Landscape genetics and genomics Microevolution Population genomics Phylogeography Quantitative genetics
Index of evolutionary biology articles