Neutral mutations are changes in DNA sequence that are neither beneficial nor detrimental to the ability of an organism to survive and reproduce. In population genetics, mutations in which natural selection does not affect the spread of the mutation in a species are termed neutral mutations. Neutral mutations that are inheritable and not linked to any genes under selection will either be lost or will replace all other alleles of the gene. This loss or fixation of the gene proceeds based on random sampling known as genetic drift. A neutral mutation that is in linkage disequilibrium with other alleles that are under selection may proceed to loss or fixation via genetic hitchhiking and/or background selection.
While many mutations in a genome may decrease an organism’s ability to survive and reproduce, also known as fitness, these mutations are selected against and not passed on to future generations. The most commonly observed mutations detectable as variation in the genetic makeup of organisms and populations appear to have no visible effect on the fitness of individuals and are therefore neutral. The identification and study of neutral mutations has led to the development of the neutral theory of molecular evolution. The neutral theory of molecular evolution is an important and often controversial theory proposing that most molecular variation within and among species is essentially neutral and not acted on by selection. Neutral mutations are also the basis for using molecular clocks to identify such evolutionary events as speciation and adaptive or evolutionary radiations.
Charles Darwin commented on the idea of neutral mutation in his work, hypothesizing that mutations that do not give an advantage or disadvantage may fluctuate or become fixed apart from natural selection. "Variations neither useful nor injurious would not be affected by natural selection, and would be left either a fluctuating element, as perhaps we see in certain polymorphic species, or would ultimately become fixed, owing to the nature of the organism and the nature of the conditions." While Darwin is widely credited with introducing the idea of natural selection which was the focus of his studies, he also saw the possibility for changes that did not benefit or hurt an organism.
Darwin's view of change being mostly driven by traits that provide advantage was widely accepted until the 1960s. While researching mutations that produce nucleotide substitutions in 1968, Motoo Kimura found that the rate of substitution was so high that if each mutation improved fitness, the gap between the most fit and typical genotype would be implausibly large. However, Kimura explained this rapid rate of mutation by suggesting that the majority of mutations were neutral, i.e. had little or no effect on the fitness of the organism. Kimura developed mathematical models of the behavior of neutral mutations subject to random genetic drift in biological populations. This theory has become known as the neutral theory of molecular evolution.
As technology has allowed for better analysis of genomic data, research has continued in this area. While natural selection may encourage adaptation to a changing environment, neutral mutation may push divergence of species due to nearly random genetic drift.
Impact on evolutionary theory
Neutral mutation has become a part of the neutral theory of molecular evolution, proposed in the 1960s. This theory suggests that neutral mutations are responsible for a large portion of DNA sequence changes in a species. For example, bovine and human insulin, while differing in amino acid sequence are still able to perform the same function. The amino acid substitutions between species were seen therefore to be neutral or not impactful to the function of the protein. Neutral mutation and the neutral theory of molecular evolution are not separate from natural selection but add to Darwin's original thoughts. Mutations can give an advantage, create a disadvantage, or make no measurable difference to an organism's survival.
A number of observations associated with neutral mutation were predicted in neutral theory including: amino acids with similar biochemical properties should be substituted more often than biochemically different amino acids; synonymous base substitutions should be observed more often than nonsynonymous substitutions; introns should evolve at the same rate as synonymous mutations in coding exons; and pseudogenes should also evolve at a similar rate. These predictions have been confirmed with the introduction of additional genetic data since the theory’s introduction.
Synonymous mutation of bases
When an incorrect nucleotide is inserted during replication or transcription of a coding region, it can affect the eventual translation of the sequence into amino acids. Since multiple codons are used for the same amino acids, a change in a single base may still lead to translation of the same amino acid. This phenomenon is referred to as degeneracy and allows for a variety of codon combinations leading to the same amino acid being produced. For example, the codes TCT, TCC, TCA, TCG, AGT, and AGC all code for the amino acid serine. This can be explained by the wobble concept. Francis Crick proposed this theory to explain why specific tRNA molecules could recognize multiple codons. The area of the tRNA that recognizes the codon called the anticodon is able to bind multiple interchangeable bases at its 5' end due to its spatial freedom. A fifth base called inosine can also be substituted on a tRNA and is able to bind with A, U, or C. This flexibility allows for changes in bases in codons leading to translation of the same amino acid. The changing of a base in a codon without the changing of the translated amino acid is called a synonymous mutation. Since the amino acid translated remains the same a synonymous mutation has traditionally been considered a neutral mutation. Some research has suggested that there is bias in selection of base substitution in synonymous mutation. This could be due to selective pressure to improve translation efficiency associated with the most available tRNAs or simply mutational bias. If these mutations influence the rate of translation or an organism’s ability to manufacture protein they may actually influence the fitness of the affected organism.
|Amino acids biochemical properties||nonpolar||polar||basic||acidic||Termination: stop codon|
|T||TTT||(Phe/F) Phenylalanine||TCT||(Ser/S) Serine||TAT||(Tyr/Y) Tyrosine||TGT||(Cys/C) Cysteine||T|
|TTA||(Leu/L) Leucine||TCA||TAA||Stop (Ochre)[B]||TGA||Stop (Opal)[B]||A|
|TTG[A]||TCG||TAG||Stop (Amber)[B]||TGG||(Trp/W) Tryptophan||G|
|C||CTT||CCT||(Pro/P) Proline||CAT||(His/H) Histidine||CGT||(Arg/R) Arginine||T|
|A||ATT||(Ile/I) Isoleucine||ACT||(Thr/T) Threonine||AAT||(Asn/N) Asparagine||AGT||(Ser/S) Serine||T|
|ATA||ACA||AAA||(Lys/K) Lysine||AGA||(Arg/R) Arginine||A|
|G||GTT||(Val/V) Valine||GCT||(Ala/A) Alanine||GAT||(Asp/D) Aspartic acid||GGT||(Gly/G) Glycine||T|
|GTA||GCA||GAA||(Glu/E) Glutamic acid||GGA||A|
- A The codon ATG both codes for methionine and serves as an initiation site: the first ATG in an mRNA's coding region is where translation into protein begins. The other start codons listed by GenBank are rare in eukaryotes and generally codes for Met/fMet.
- B ^ ^ ^ The historical basis for designating the stop codons as amber, ochre and opal is described in an autobiography by Sydney Brenner and in a historical article by Bob Edgar.
Neutral amino acid substitution
While substitution of a base in a noncoding area of a genome may make little difference and be considered neutral, base substitutions in or around genes may impact the organism. Some base substitutions lead to synonymous mutation and no difference in the amino acid translated as noted above. However, a base substitution can also change the genetic code so that a different amino acid is translated. This sort of substitution usually has a negative effect on the protein being formed and will be eliminated from the population through purifying selection. However, if the change has a positive influence, the mutation may become more and more common in a population until it becomes a fixed genetic piece of that population. Organisms changing via these two options comprise the classic view of natural selection. A third possibility is that the amino acid substitution makes little or no positive or negative difference to the affected protein. Proteins demonstrate some tolerance to changes in amino acid structure. This is somewhat dependent on where in the protein the substitution takes place. If it occurs in an important structural area or in the active site, one amino acid substitution may inactivate or substantially change the functionality of the protein. Substitutions in other areas may be nearly neutral and drift randomly over time.
Identification and measurement of neutrality
Neutral mutations are measured in population and evolutionary genetics often by looking at variation in populations. These have been measured historically by gel electrophoresis to determine allozyme frequencies. Statistical analyses of this data is used to compare variation to predicted values based on population size, mutation rates and effective population size. Early observations that indicated higher than expected heterozygosity and overall variation within the protein isoforms studied, drove arguments as to the role of selection in maintaining this variation versus the existence of variation through the effects of neutral mutations arising and their random distribution due to genetic drift. The accumulation of data based on observed polymorphism led to the formation of the neutral theory of evolution. According to the neutral theory of evolution, the rate of fixation in a population of a neutral mutation will be directly related to the rate of formation of the neutral allele.
In Kimura’s original calculations, mutations with |2 Ns|<1 or |s|≤1/(2N) are defined as neutral. In this equation, N is the effective population size and is a quantitative measurement of the ideal population size that assumes such constants as equal sex ratios and no emigration, migration, mutation nor selection. Conservatively, it is often assumed that effective population size is approximately one fifth of the total population size. s is the selection coefficient and is a value between 0 and 1. It is a measurement of the contribution of a genotype to the next generation where a value of 1 would be completely selected against and make no contribution and 0 is not selected against at all. This definition of neutral mutation has been criticized due to the fact that very large effective population sizes can make mutations with small selection coefficients appear non neutral. Additionally, mutations with high selection coefficients can appear neutral in very small populations. The testable hypothesis of Kimura and others showed that polymorphism within species are approximately that which would be expected in a neutral evolutionary model.
For many molecular biology approaches, as opposed to mathematical genetics, neutral mutations are generally assumed to be those mutations which cause no appreciable effect on gene function. This simplification eliminates the effect of minor allelic differences in fitness and avoids problems when selection has only a minor effect.
Early convincing evidence of this definition of neutral mutation was shown through the lower mutational rates in functionally important parts of genes such as cytochrome c versus less important parts and the functionally interchangeable nature of mammalian cytochrome c in in vitro studies. Nonfunctional pseudogenes provide more evidence for the role of neutral mutations in evolution. The rates of mutation in mammalian globin pseudogenes has been shown to be much higher than rates in functional genes. According to neo-Darwinian evolution, such mutations should rarely exist as these sequences are functionless and positive selection would not be able to operate.
The McDonald-Kreitman test has been used to study selection over long periods of evolutionary time. This is a statistical test that compares polymorphism in neutral and functional sites and estimates what fraction of substitutions have been acted on by positive selection. The test often uses synonymous substitutions in protein coding genes as the neutral component; however, synonymous mutations have been shown to be under purifying selection in many instances.
Molecular clocks can be used to estimate the amount of time since divergence of two species and for placing evolutionary events in time. Pauling and Zuckerkandl, proposed the idea of the molecular clock in 1962 based on the observation that the random mutation process occurs at an approximate constant rate. Individual proteins were shown to have linear rates of amino acid changes over evolutionary time. Despite controversy from some biologists arguing that morphological evolution would not proceed at a constant rate, many amino acid changes were shown to accumulate in a constant fashion. Kimura and Ohta explained these rates as part of the framework of the neutral theory. These mutations were reasoned to be neutral as positive selection should be rare and deleterious mutations should be eliminated quickly from a population. By this reasoning, the accumulation of these neutral mutations should only be influenced by the mutation rate. Therefore, the neutral mutation rate in individual organisms should match the molecular evolution rate in species over evolutionary time. The neutral mutation rate is affected by the amount of neutral sites in a protein or DNA sequence versus the amount of mutation in sites that are functionally constrained. By quantifying these neutral mutations in protein and/or DNA and comparing them between species or other groups of interest, rates of divergence can be determined.
Molecular clocks have caused controversy due to the dates they derive for events such as explosive radiations seen after extinction events like the Cambrian explosion and the radiations of mammals and birds. Two-fold differences exist in dates derived from molecular clocks and the fossil record. While some paleontologists argue that molecular clocks are systemically inaccurate, others attribute the discrepancies to lack of robust fossil data and bias in sampling. While not without constancy and discrepancies with the fossil record, the data from molecular clocks have shown how evolution is dominated by the mechanisms of a neutral model and is less influenced by the action of natural selection.
- Darwin, C. (1987; 1859). On the origin of species by means of natural selection : Or the preservation of favoured races in the struggle for life (Special ed.). Birmingham, Ala.: Gryphon Editions.
- Duret, L. (2008). "Neutral theory: The null hypothesis of molecular evolution". Nature Education. 1 (1): 803–6. 218.
- Kimura, Motoo (1983). The Neutral Theory of Molecular Evolution. Cambridge University Press. ISBN 978-1-139-93567-8.
- Nei, M; Suzuki, Y; Nozawa, M (2010). "The neutral theory of molecular evolution in the genomic era". Annual Review of Genomics and Human Genetics. 11: 265–89. doi:10.1146/annurev-genom-082908-150129. PMID 20565254.
- Watson, James D.; Baker, Tania A.; Bell, Stephen P.; Gann, Alexander; Levine, Michael; Losik, Richard; Harrison, Stephen C. (2013). Molecular biology of the gene (7th ed.). Benjamin-Cummings. pp. 573–6. ISBN 978-0321762436.
- Venetianer, Pál (1 January 2012). "Are synonymous codons indeed synonymous?". Biomolecular Concepts. 3 (1): 21–8. doi:10.1515/bmc.2011.050. PMID 25436522.
- Duret, L (December 2002). "Evolution of synonymous codon usage in metazoans". Current Opinion in Genetics & Development. 12 (6): 640–9. doi:10.1016/s0959-437x(02)00353-2. PMID 12433576.
- Nakamoto T (March 2009). "Evolution and the universality of the mechanism of initiation of protein synthesis". Gene. 432 (1–2): 1–6. doi:10.1016/j.gene.2008.11.001. PMID 19056476.
- Blattner, F. R.; Plunkett g, G.; Bloch, C. A.; Perna, N. T.; Burland, V.; Riley, M.; Collado-Vides, J.; Glasner, J. D.; Rode, C. K.; Mayhew, G. F.; Gregor, J.; Davis, N. W.; Kirkpatrick, H. A.; Goeden, M. A.; Rose, D. J.; Mau, B.; Shao, Y. (1997). "The Complete Genome Sequence of Escherichia coli K-12". Science. 277 (5331): 1453–1462. doi:10.1126/science.277.5331.1453. PMID 9278503.
- Brenner S. A Life in Science (2001) Published by Biomed Central Limited ISBN 0-9540278-0-9 see pages 101-104
- Edgar B (2004). "The genome of bacteriophage T4: an archeological dig". Genetics. 168 (2): 575–82. PMC 1448817. PMID 15514035. see pages 580-581
- Ng, PC; Henikoff, S (2006). "Predicting the effects of amino acid substitutions on protein function". Annual Review of Genomics and Human Genetics. 7: 61–80. doi:10.1146/annurev.genom.7.080505.115630. PMID 16824020.
- Guo, HH; Choe, J; Loeb, LA (22 June 2004). "Protein tolerance to random amino acid change". Proc. Natl. Acad. Sci. U.S.A. 101 (25): 9205–10. doi:10.1073/pnas.0403255101. PMC 438954. PMID 15197260.
- Lewontin, RC (August 1991). "Twenty-five years ago in Genetics: electrophoresis in the development of evolutionary genetics: milestone or millstone?". Genetics. 128 (4): 657–62. PMC 1204540. PMID 1916239.
- Kimura, Motoo (17 February 1968). "Evolutionary Rate at the Molecular Level". Nature. 217 (5129): 624–6. doi:10.1038/217624a0. PMID 5637732.
- Lewontin, RC; Hubby, JL (August 1966). "A molecular approach to the study of genic heterozygosity in natural populations. II. Amount of variation and degree of heterozygosity in natural populations of Drosophila pseudoobscura". Genetics. 54 (2): 595–609. PMC 1211186. PMID 5968643.
- Nei, M (December 2005). "Selectionism and neutralism in molecular evolution". Molecular Biology and Evolution. 22 (12): 2318–42. doi:10.1093/molbev/msi242. PMC 1513187. PMID 16120807.
- Tomizawa, J (20 June 2000). "Derivation of the relationship between neutral mutation and fixation solely from the definition of selective neutrality". Proc. Natl. Acad. Sci. U.S.A. 97 (13): 7372–5. doi:10.1073/pnas.97.13.7372. PMC 16552. PMID 10861006.
- Harmon, Luke J.; Braude, Stanton (2009). "12: Conservation of Small Populations: Effective Population Sizes, Inbreeding, and the 50/500 Rule". In Braude, Stanton; Low, Bobbi S. (eds.). An introduction to methods and models in ecology, evolution, and conservation biology. Princeton University Press. pp. 125–8. ISBN 9780691127248.
- Mace, Georgina M.; Lande, Russell (June 1991). "Assessing Extinction Threats: Toward a Reevaluation of IUCN Threatened Species Categories". Conservation Biology. 5 (2): 148–157. doi:10.1111/j.1523-1739.1991.tb00119.x. JSTOR 2386188.
- Ridley, Mark (2004). Evolution (3rd ed.). Blackwell. ISBN 978-1-4051-0345-9.
- Yamazaki, T.; Maruyama, T. (6 October 1972). "Evidence for the Neutral Hypothesis of Protein Polymorphism". Science. 178 (4056): 56–58. doi:10.1126/science.178.4056.56. PMID 5070515.
- Nei, M; Graur, D (1984). Extent of protein polymorphism and the neutral mutation theory. Evolutionary Biology. 17. pp. 73–118. doi:10.1007/978-1-4615-6974-9_3. ISBN 978-1-4615-6976-3.
- Dickerson, RE (1971). "The structures of cytochrome c and the rates of molecular evolution". Journal of Molecular Evolution. 1 (1): 26–45. doi:10.1007/bf01659392. PMID 4377446.
- Jacobs, EE; Sanadi, DR (February 1960). "The reversible removal of cytochrome c from mitochondria". The Journal of Biological Chemistry. 235 (2): 531–4. PMID 14406362.
- Li, Wen-Hsiung; Gojobori, Takashi; Nei, Masatoshi (16 July 1981). "Pseudogenes as a paradigm of neutral evolution". Nature. 292 (5820): 237–9. doi:10.1038/292237a0. PMID 7254315.
- Miyata, T; Yasunaga, T (September 1980). "Molecular evolution of mRNA: a method for estimating evolutionary rates of synonymous and amino acid substitutions from homologous nucleotide sequences and its application". Journal of Molecular Evolution. 16 (1): 23–36. doi:10.1007/bf01732067. PMID 6449605.
- McDonald, JH; Kreitman, M (20 June 1991). "Adaptive protein evolution at the Adh locus in Drosophila". Nature. 351 (6328): 652–4. doi:10.1038/351652a0. PMID 1904993.
- Egea, R; Casillas, S; Barbadilla, A (1 July 2008). "Standard and generalized McDonald-Kreitman test: a website to detect selection by comparing different classes of DNA sites". Nucleic Acids Research. 36 (Web Server issue): W157–62. doi:10.1093/nar/gkn337. PMC 2447769. PMID 18515345.
- Hellmann, I; Zollner, S; Enard, W; Ebersberger, I; Nickel, B; Paabo, S (May 2003). "Selection on human genes as revealed by comparisons to chimpanzee cDNA". Genome Research. 13 (5): 831–7. doi:10.1101/gr.944903. PMC 430916. PMID 12727903.
- Zhou, T; Gu, W; Wilke, CO (August 2010). "Detecting positive and purifying selection at synonymous sites in yeast and worm". Molecular Biology and Evolution. 27 (8): 1912–22. doi:10.1093/molbev/msq077. PMC 2915641. PMID 20231333.
- Bromham, L; Penny, D (March 2003). "The modern molecular clock". Nature Reviews Genetics. 4 (3): 216–24. doi:10.1038/nrg1020. PMID 12610526.
- Zuckerkandl, E.; Pauling, L. (1962). "Molecular Disease, Evolution and Genetic Heterogeneity". In Kasha, M.; Pullman, B. (eds.). Horizons in Biochemistry: Albert Szent-Györgyi dedicatory volume. New York: Academic Press. pp. 189–225. OCLC 174774459.
- Kimura, Motoo; Ohta, Tomoko (March 1971). "On the rate of molecular evolution". Journal of Molecular Evolution. 1 (1): 1–17. doi:10.1007/BF01659390. PMID 5173649.
- Kumar, S (August 2005). "Molecular clocks: four decades of evolution". Nature Reviews Genetics. 6 (8): 654–62. doi:10.1038/nrg1659. PMID 16136655.
- Smith, Andrew B.; Peterson, Kevin J. (May 2002). "DATING THE TIME OF ORIGIN OF MAJOR CLADES: Molecular Clocks and the Fossil Record". Annual Review of Earth and Planetary Sciences. 30 (1): 65–88. doi:10.1146/annurev.earth.30.091201.140057.