# Neutral theory of molecular evolution

The neutral theory of molecular evolution holds that most evolutionary changes occur at the molecular level, and most of the variation within and between species are due to random genetic drift of mutant alleles that are selectively neutral. The theory applies only for evolution at the molecular level, and is compatible with phenotypic evolution being shaped by natural selection as postulated by Charles Darwin. The neutral theory allows for the possibility that most mutations are deleterious, but holds that because these are rapidly removed by natural selection, they do not make significant contributions to variation within and between species at the molecular level. A neutral mutation is one that does not affect an organism's ability to survive and reproduce. The neutral theory assumes that most mutations that are not deleterious are neutral rather than beneficial. Because only a fraction of gametes are sampled in each generation of a species, the neutral theory suggests that a mutant allele can arise within a population and reach fixation by chance, rather than by selective advantage.[1]

The theory was introduced by the Japanese biologist Motoo Kimura in 1968, and independently by two American biologists Jack Lester King and Thomas Hughes Jukes in 1969, and described in detail by Kimura in his 1983 monograph The Neutral Theory of Molecular Evolution. The proposal of the neutral theory was followed by an extensive "neutralist-selectionist" controversy over the interpretation of patterns of molecular divergence and gene polymorphism, peaking in the 1970s and 1980s.

Neutral theory is frequently used as the null hypothesis, as opposed to adaptive explanations, for describing the emergence of morphological or genetic features in organisms and populations. This has been suggested in a number of areas, including in explaining genetic variation between populations of one nominal species,[2] the emergence of complex subcellular machinery,[3] and the convergent emergence of several typical microbial morphologies.[4]

## Origins

While some scientists, such as Freese (1962)[5] and Freese and Yoshida (1965),[6] had suggested that neutral mutations were probably widespread, the original mathematical derivation of the theory had been published by R.A. Fisher in 1930.[7] Fisher however gave a reasoned argument for believing that in practice neutral gene substitutions would be very rare.[8] A coherent theory of neutral evolution was first proposed by Motoo Kimura in 1968,[9] and by King and Jukes independently in 1969.[10] Kimura initially focused on differences among species, King and Jukes on differences within species.

Many molecular biologists and population geneticists also contributed to the development of the neutral theory.[1][11][12] Principles of population genetics, established by J.B.S. Haldane, R.A. Fisher and Sewall Wright, created a mathematical approach to analyzing gene frequencies that contributed to the development of Kimura's theory.

Haldane's dilemma regarding the cost of selection was used as motivation by Kimura. Haldane estimated that it takes about 300 generations for a beneficial mutation to become fixed in a mammalian lineage, meaning that the number of substitutions (1.5 per year) in the evolution between humans and chimpanzees was too high to be explained by beneficial mutations.

## Functional constraint

The neutral theory holds that as functional constraint diminishes, the probability that a mutation is neutral rises, and so should the rate of sequence divergence.

When comparing various proteins, extremely high evolutionary rates were observed in proteins such as fibrinopeptides and the C chain of the proinsulin molecule, which both have little to no functionality compared to their active molecules. Kimura and Ohta also estimated that the alpha and beta chains on the surface of a hemoglobin protein evolve at a rate almost ten times faster than the inside pockets, which would imply that the overall molecular structure of hemoglobin is less significant than the inside where the iron-containing heme groups reside.[13]

There is evidence that rates of nucleotide substitution are particularly high in the third position of a codon, where there is little functional constraint.[14] This view is based in part on the degenerate genetic code, in which sequences of three nucleotides (codons) may differ and yet encode the same amino acid (GCC and GCA both encode alanine, for example). Consequently, many potential single-nucleotide changes are in effect "silent" or "unexpressed" (see synonymous or silent substitution). Such changes are presumed to have little or no biological effect.[15]

## Quantitative theory

Kimura also developed the infinite sites model (ISM) to provide insight into evolutionary rates of mutant alleles. If ${\displaystyle v}$ were to represent the rate of mutation of gametes per generation of ${\displaystyle N}$ individuals, each with two sets of chromosomes, the total number of new mutants in each generation is ${\displaystyle 2Nv}$. Now let ${\displaystyle k}$ represent the evolution rate in terms of a mutant allele ${\displaystyle \mu }$ becoming fixed in a population.[16]

${\displaystyle k=2Nv\mu }$

According to ISM, selectively neutral mutations appear at rate ${\displaystyle \mu }$ in each of the ${\displaystyle 2N}$ copies of a gene, and fix with probability ${\displaystyle 1/(2N)}$. Because any of the ${\displaystyle 2N}$ genes have the ability to become fixed in a population, ${\displaystyle 1/2N}$ is equal to ${\displaystyle \mu }$, resulting in the rate of evolutionary rate equation:

${\displaystyle k=v}$

This means that if all mutations were neutral, the rate at which fixed differences accumulate between divergent populations is predicted to be equal to the per-individual mutation rate, independent of population size. When the proportion of mutations that are neutral is constant, so is the divergence rate between populations. This provides a rationale for the molecular clock - which predated neutral theory.[17] The ISM also demonstrates a constancy that is observed in molecular lineages.

This stochastic process is assumed to obey equations describing random genetic drift by means of accidents of sampling, rather than for example genetic hitchhiking of a neutral allele due to genetic linkage with non-neutral alleles. After appearing by mutation, a neutral allele may become more common within the population via genetic drift. Usually, it will be lost, or in rare cases it may become fixed, meaning that the new allele becomes standard in the population.

According to the neutral theory of molecular evolution, the amount of genetic variation within a species should be proportional to the effective population size.

## The "neutralist–selectionist" debate

A heated debate arose when Kimura's theory was published, largely revolving around the relative percentages of polymorphic and fixed alleles that are "neutral" versus "non-neutral".

A genetic polymorphism means that different forms of particular genes, and hence of the proteins that they produce, are co-existing within a species. Selectionists claimed that such polymorphisms are maintained by balancing selection, while neutralists view the variation of a protein as a transient phase of molecular evolution.[1] Studies by Richard K. Koehn and W. F. Eanes demonstrated a correlation between polymorphism and molecular weight of their molecular subunits.[18] This is consistent with the neutral theory assumption that larger subunits should have higher rates of neutral mutation. Selectionists, on the other hand, contribute environmental conditions to be the major determinants of polymorphisms rather than structural and functional factors.[16]

According to the neutral theory of molecular evolution, the amount of genetic variation within a species should be proportional to the effective population size. Levels of genetic diversity vary much less than census population sizes, giving rise to the "paradox of variation" .[19] While high levels of genetic diversity were one of the original arguments in favor of neutral theory, the paradox of variation has been one of the strongest arguments against neutral theory.

There are a large number of statistical methods for testing whether neutral theory is a good description of evolution (e.g., McDonald-Kreitman test[20]), and many authors claimed detection of selection (Fay et al. 2002,[21] Begun et al. 2007,[22] Shapiro et al. 2007,[23] Hahn 2008,[24] Akey 2009,[25] Kern 2018[26]). Some researchers have nevertheless argued that the neutral theory still stands, while expanding the definition of neutral theory to include background selection at linked sites.[27]

## Nearly neutral theory

Tomoko Ohta also emphasized the importance of nearly neutral mutations, in particularly slightly deleterious mutations.[28] The population dynamics of nearly neutral mutations are only slightly different from those of neutral mutations unless the absolute magnitude of the selection coefficient is greater than 1/N, where N is the effective population size in respect of selection.[1][11][12] The value of N may therefore affect how many mutations can be treated as neutral and how many as deleterious.[citation needed]

## Constructive Neutral Evolution

The groundworks for the theory of constructive neutral evolution (CNE) was laid by two papers in the 1990s.[29][30][31] Constructive neutral evolution is a theory which suggests that complex structures and processes can emerge through neutral transitions. Although a separate theory altogether, the emphasis on neutrality as a process whereby neutral alleles are randomly fixed by genetic drift finds some inspiration from the earlier attempt by the neutral theory to invoke its importance in evolution.[31] Conceptually, there are two components A and B (which may represent two proteins) which interact with each other. A, which performs a function for the system, does not depend on its interaction with B for its functionality, and the interaction itself may have randomly arisen in an individual with the ability to disappear without an effect on the fitness of A. This present yet currently unnecessary interaction is therefore called an "excess capacity" of the system. However, a mutation may occur which compromises the ability of A to perform its function independently. However, the A:B interaction that has already emerged sustains the capacity of A to perform its initial function. Therefore, the emergence of the A:B interaction "presuppresses" the deleterious nature of the mutation, making it a neutral change in the genome that is capable of spreading through the population via random genetic drift. Hence, A has gained a dependency on its interaction with B.[32] In this case, the loss of B or the A:B interaction would have a negative effect on fitness and so purifying selection would eliminate individuals where this occurs. While each of these steps are individually reversible (for example, A may regain the capacity to function independently or the A:B interaction may be lost), a random sequence of mutations tends to further reduce the capacity of A to function independently and a random walk through the dependency space may very well result in a configuration in which a return to functional independence of A is far too unlikely to occur, which makes CNE a one-directional or "ratchet-like" process.[33] CNE, which does not invoke adaptationist mechanisms for the origins of more complex systems (which involve more parts and interactions contributing to the whole), has seen application in the understanding of the evolutionary origins of the spliceosomal eukaryotic complex, RNA editing, additional ribosomal proteins beyond the core, the emergence of long-noncoding RNA from junk DNA, and so forth.[34][35][36][37] In some cases, ancestral sequence reconstruction techniques have afforded the ability for experimental demonstration of some proposed examples of CNE, as in heterooligomeric ring protein complexes in some fungal lineages.[38]

CNE has also been put forwards as the null hypothesis for explaining complex structures, and thus adaptationist explanations for the emergence of complexity must be rigorously tested on a case-by-case basis against this null hypothesis prior to acceptance. Grounds for invoking CNE as a null include that it does not presume that changes offered an adaptive benefit to the host or that they were directionally selected for, while maintaining the importance of more rigorous demonstrations of adaptation when invoked so as to avoid the excessive flaws of adaptationism criticized by Gould and Lewontin.[39][3][40]

## Empirical evidence for the neutral theory

One of corollaries of the neutral theory is that the efficiency of positive selection is higher in population or species with higher effective population size.[41] This relationship between the effective population size and selection efficiency was evidenced by genomic studies of species including chimpanzee and human[41] and domesticated species.[42]

## References

1. ^ a b c d Kimura, Motoo (1983). The neutral theory of molecular evolution. Cambridge University Press. ISBN 978-0-521-31793-1.
2. ^ Fenchel, Tom (2005-11-11). "Cosmopolitan microbes and their 'cryptic' species". Aquatic Microbial Ecology. 41 (1): 49–54. doi:10.3354/ame041049. ISSN 0948-3055.
3. ^ a b Koonin, Eugene V. (2016). "Splendor and misery of adaptation, or the importance of neutral null for understanding evolution". BMC Biology. 14 (1): 114. doi:10.1186/s12915-016-0338-2. ISSN 1741-7007. PMC 5180405. PMID 28010725.
4. ^ Lahr, Daniel J. G.; Laughinghouse, Haywood Dail; Oliverio, Angela M.; Gao, Feng; Katz, Laura A. (2014). "How discordant morphological and molecular evolution among microorganisms can revise our notions of biodiversity on Earth: Prospects & Overviews". BioEssays. 36 (10): 950–959. doi:10.1002/bies.201400056. PMC 4288574. PMID 25156897.
5. ^ Freese, E. (July 1962). "On the evolution of the base composition of DNA". Journal of Theoretical Biology. 3 (1): 82–101. Bibcode:1962JThBi...3...82F. doi:10.1016/S0022-5193(62)80005-8.
6. ^ Freese, E.; Yoshida, A. (1965). "The role of mutations in evolution.". In Bryson, V.; Vogel, H. J. (eds.). Evolving Genes and Proteins. New York: Academic. pp. 341–355.
7. ^ Fisher R.A. 1930. The distribution of gene ratios for rare mutations. Proceedings of the Royal Society of Edinburgh volume 50, pages 205-230.
8. ^ R.J. Berry, T.J. Crawford, G.M. Hewitt 1992. Genes in Ecology. Blackwell Scientific Publications, Oxford. pp.29-54 J.R.G.Turner: Stochastic processes in populations: the horse behind the cart?.
9. ^ Kimura, Motoo (February 1968). "Evolutionary rate at the molecular level". Nature. 217 (5129): 624–6. Bibcode:1968Natur.217..624K. doi:10.1038/217624a0. PMID 5637732. S2CID 4161261.
10. ^ King, J. L.; Jukes, T. H. (May 1969). "Non-Darwinian evolution". Science. 164 (3881): 788–98. Bibcode:1969Sci...164..788L. doi:10.1126/science.164.3881.788. PMID 5767777.
11. ^ a b Nei, Masatoshi (December 2005). "Selectionism and neutralism in molecular evolution". Molecular Biology and Evolution. 22 (12): 2318–2342. doi:10.1093/molbev/msi242. PMC 1513187. PMID 16120807.
12. ^ a b Nei, Masatoshi (2013). Mutation-driven evolution. Oxford University Press.
13. ^ Kimura, M. (1969-08-01). "The Rate of Molecular Evolution Considered from the Standpoint of Population Genetics". Proceedings of the National Academy of Sciences. 63 (4): 1181–1188. Bibcode:1969PNAS...63.1181K. doi:10.1073/pnas.63.4.1181. ISSN 0027-8424. PMC 223447. PMID 5260917.
14. ^ Bofkin, L.; Goldman, N. (2006-11-13). "Variation in Evolutionary Processes at Different Codon Positions". Molecular Biology and Evolution. 24 (2): 513–521. doi:10.1093/molbev/msl178. ISSN 0737-4038. PMID 17119011.
15. ^ Crick, F.H.C. (1989), "Codon—Anticodon Pairing: The Wobble Hypothesis", Molecular Biology, Elsevier, pp. 370–377, doi:10.1016/b978-0-12-131200-8.50026-5, ISBN 978-0-12-131200-8, retrieved 2021-04-03
16. ^ a b Kimura, Motoo (November 1979). "The neutral theory of molecular evolution". Scientific American. 241 (5): 98–100, 102, 108 passim. Bibcode:1979SciAm.241e..98K. doi:10.1038/scientificamerican1179-98. JSTOR 24965339. PMID 504979.
17. ^ Zuckerkandl, Emile; Pauling, Linus B. (1962). "Molecular disease, evolution, and genetic heterogeneity". In Kasha, M.; Pullman, B. (eds.). Horizons in Biochemistry. Academic Press. pp. 189–225.
18. ^ Eanes, Walter F. (November 1999). "Analysis of Selection on Enzyme Polymorphisms". Annual Review of Ecology and Systematics. 30 (1): 301–326. doi:10.1146/annurev.ecolsys.30.1.301.
19. ^ Lewontin, Richard C. (1973). The genetic basis of evolutionary change (4th printing ed.). Columbia University Press. ISBN 978-0231033923.
20. ^ Kreitman, M. (2000). "Methods to detect selection in populations with applications to the human". Annual Review of Genomics and Human Genetics. 1 (1): 539–59. doi:10.1146/annurev.genom.1.1.539. PMID 11701640.
21. ^ Fay, J. C.; Wyckoff, G. J.; Wu, C. I. (February 2002). "Testing the neutral theory of molecular evolution with genomic data from Drosophila". Nature. 415 (6875): 1024–6. Bibcode:2002Natur.415.1024F. doi:10.1038/4151024a. PMID 11875569. S2CID 4420010.
22. ^ Begun, D. J.; Holloway, A. K.; Stevens, K.; Hillier, L. W.; Poh, Y. P.; Hahn, M. W.; Nista, P. M.; Jones, C. D.; Kern, A. D.; Dewey, C. N.; Pachter, L.; Myers, E.; Langley, C. H. (November 2007). "Population genomics: whole-genome analysis of polymorphism and divergence in Drosophila simulans". PLOS Biology. 5 (11): e310. doi:10.1371/journal.pbio.0050310. PMC 2062478. PMID 17988176.
23. ^ Shapiro, J. A.; Huang, W.; Zhang, C.; Hubisz, M. J.; Lu, J.; Turissini, D. A.; Fang, S.; Wang, H. Y.; Hudson, RR; Nielsen, R.; Chen, Z.; Wu, C. I. (February 2007). "Adaptive genic evolution in the Drosophila genomes". PNAS. 104 (7): 2271–6. Bibcode:2007PNAS..104.2271S. doi:10.1073/pnas.0610385104. PMC 1892965. PMID 17284599.
24. ^ Hahn, M. W. (February 2008). "Toward a selection theory of molecular evolution". Evolution; International Journal of Organic Evolution. 62 (2): 255–65. doi:10.1111/j.1558-5646.2007.00308.x. PMID 18302709.
25. ^ Akey, J. M. (May 2009). "Constructing genomic maps of positive selection in humans: where do we go from here?". Genome Research. 19 (5): 711–22. doi:10.1101/gr.086652.108. PMC 3647533. PMID 19411596.
26. ^ Kern, A. D.; Hahn, M. W. (June 2018). "The Neutral Theory in Light of Natural Selection". Molecular Biology and Evolution. 35 (6): 1366–1371. doi:10.1093/molbev/msy092. PMC 5967545. PMID 29722831.
27. ^ Jensen, J.D.; Payseur, B. A.; Stephan, W.; Aquadro C. F.; Lynch, M. Charlesworth, D.; Charlesworth, B. (January 2019). "The importance of the Neutral Theory in 1968 and 50 years on: A response to Kern and Hahn 2018". Evolution; International Journal of Organic Evolution. 73 (1): 111–114. doi:10.1111/evo.13650. PMC 6496948. PMID 30460993.{{cite journal}}: CS1 maint: multiple names: authors list (link)
28. ^ Ohta, T. (December 2002). "Near-neutrality in evolution of genes and gene regulation". Proceedings of the National Academy of Sciences of the United States of America. 99 (25): 16134–7. Bibcode:2002PNAS...9916134O. doi:10.1073/pnas.252626899. PMC 138577. PMID 12461171.
29. ^ Covello, Patrick S.; Gray, MichaelW. (1993). "On the evolution of RNA editing". Trends in Genetics. 9 (8): 265–268. doi:10.1016/0168-9525(93)90011-6. PMID 8379005.
30. ^ Stoltzfus, Arlin (1999). "On the Possibility of Constructive Neutral Evolution". Journal of Molecular Evolution. 49 (2): 169–181. Bibcode:1999JMolE..49..169S. doi:10.1007/PL00006540. ISSN 0022-2844. PMID 10441669. S2CID 1743092.
31. ^ a b Muñoz-Gómez, Sergio A.; Bilolikar, Gaurav; Wideman, Jeremy G.; Geiler-Samerotte, Kerry (2021-04-01). "Constructive Neutral Evolution 20 Years Later". Journal of Molecular Evolution. 89 (3): 172–182. Bibcode:2021JMolE..89..172M. doi:10.1007/s00239-021-09996-y. ISSN 1432-1432. PMC 7982386. PMID 33604782.
32. ^ Speijer, Dave (2011). "Does constructive neutral evolution play an important role in the origin of cellular complexity?: Making sense of the origins and uses of biological complexity". BioEssays. 33 (5): 344–349. doi:10.1002/bies.201100010. PMID 21381061. S2CID 205470421.
33. ^ Stoltzfus, Arlin (2012-10-13). "Constructive neutral evolution: exploring evolutionary theory's curious disconnect". Biology Direct. 7 (1): 35. doi:10.1186/1745-6150-7-35. ISSN 1745-6150. PMC 3534586. PMID 23062217.
34. ^ Gray, Michael W.; Lukeš, Julius; Archibald, John M.; Keeling, Patrick J.; Doolittle, W. Ford (2010-11-12). "Irremediable Complexity?". Science. 330 (6006): 920–921. Bibcode:2010Sci...330..920G. doi:10.1126/science.1198594. ISSN 0036-8075. PMID 21071654. S2CID 206530279.
35. ^ Lukeš, Julius; Archibald, John M.; Keeling, Patrick J.; Doolittle, W. Ford; Gray, Michael W. (2011). "How a neutral evolutionary ratchet can build cellular complexity". IUBMB Life. 63 (7): 528–537. doi:10.1002/iub.489. PMID 21698757. S2CID 7306575.
36. ^ Lamech, Lilian T.; Mallam, Anna L.; Lambowitz, Alan M. (2014-12-23). Herschlag, Daniel (ed.). "Evolution of RNA-Protein Interactions: Non-Specific Binding Led to RNA Splicing Activity of Fungal Mitochondrial Tyrosyl-tRNA Synthetases". PLOS Biology. 12 (12): e1002028. doi:10.1371/journal.pbio.1002028. ISSN 1545-7885. PMC 4275181. PMID 25536042.
37. ^ Palazzo, Alexander F.; Koonin, Eugene V. (2020-11-25). "Functional Long Non-coding RNAs Evolve from Junk Transcripts". Cell. 183 (5): 1151–1161. doi:10.1016/j.cell.2020.09.047. PMID 33068526. S2CID 222815635.
38. ^ Finnigan, Gregory C.; Hanson-Smith, Victor; Stevens, Tom H.; Thornton, Joseph W. (2012-01-09). "Evolution of increased complexity in a molecular machine". Nature. 481 (7381): 360–364. Bibcode:2012Natur.481..360F. doi:10.1038/nature10724. ISSN 0028-0836. PMC 3979732. PMID 22230956.
39. ^ Gould, S. J.; Lewontin, R. C.; Maynard Smith, J.; Holliday, Robin (1979-09-21). "The spandrels of San Marco and the Panglossian paradigm: a critique of the adaptationist programme". Proceedings of the Royal Society of London. Series B. Biological Sciences. 205 (1161): 581–598. Bibcode:1979RSPSB.205..581G. doi:10.1098/rspb.1979.0086. PMID 42062. S2CID 2129408.
40. ^ Brunet, T. D. P.; Doolittle, W. Ford (2018-03-19). "The generality of Constructive Neutral Evolution". Biology & Philosophy. 33 (1): 2. doi:10.1007/s10539-018-9614-6. ISSN 1572-8404. S2CID 90290787.
41. ^ a b Bakewell, Margaret A.; Shi, Peng; Zhang, Jianzhi (May 2007). "More genes underwent positive selection in chimpanzee evolution than in human evolution". Proceedings of the National Academy of Sciences. 104 (18): 7489–7494. Bibcode:2007PNAS..104.7489B. doi:10.1073/pnas.0701705104. ISSN 0027-8424. PMC 1863478. PMID 17449636.
42. ^ Chen, Jianhai; Ni, Pan; Li, Xinyun; Han, Jianlin; Jakovlić, Ivan; Zhang, Chengjun; Zhao, Shuhong (2018-01-19). "Population size may shape the accumulation of functional mutations following domestication". BMC Evolutionary Biology. 18 (1): 4. doi:10.1186/s12862-018-1120-6. ISSN 1471-2148. PMC 5775542. PMID 29351740.