Nucleic acid thermodynamics
Nucleic acid thermodynamics is the study of how temperature affects the nucleic acid structure of double-stranded DNA (dsDNA). The melting temperature (Tm) is defined as the temperature at which half of the DNA strands are in the random coil or single-stranded (ssDNA) state. Tm depends on the length of the DNA molecule and its specific nucleotide sequence. DNA, when in a state where its two strands are dissociated (i.e., the dsDNA molecule exists as two independent strands), is referred to as having been denatured by the high temperature.
Hybridization is the process of establishing a non-covalent, sequence-specific interaction between two or more complementary strands of nucleic acids into a single complex, which in the case of two strands is referred to as a duplex. Oligonucleotides, DNA, or RNA will bind to their complement under normal conditions, so two perfectly complementary strands will bind to each other readily. In order to reduce the diversity and obtain the most energetically preferred complexes, a technique called annealing is used in laboratory practice. However, due to the different molecular geometries of the nucleotides, a single inconsistency between the two strands will make binding between them less energetically favorable. Measuring the effects of base incompatibility by quantifying the temperature at which two strands anneal can provide information as to the similarity in base sequence between the two strands being annealed. The complexes may be dissociated by thermal denaturation, also referred to as melting. In the absence of external negative factors, the processes of hybridization and melting may be repeated in succession indefinitely, which lays the ground for polymerase chain reaction. Most commonly, the pairs of nucleic bases A=T and G≡C are formed, of which the latter is more stable.
DNA denaturation, also called DNA melting, is the process by which double-stranded deoxyribonucleic acid unwinds and separates into single-stranded strands through the breaking of hydrophobic stacking attractions between the bases. See Hydrophobic effect. Both terms are used to refer to the process as it occurs when a mixture is heated, although "denaturation" can also refer to the separation of DNA strands induced by chemicals like formamide or urea.
The process of DNA denaturation can be used to analyze some aspects of DNA. Because cytosine / guanine base-pairing is generally stronger than adenosine / thymine base-pairing, the amount of cytosine and guanine in a genome (called the "GC content") can be estimated by measuring the temperature at which the genomic DNA melts. Higher temperatures are associated with high GC content.
DNA denaturation can also be used to detect sequence differences between two different DNA sequences. DNA is heated and denatured into single-stranded state, and the mixture is cooled to allow strands to rehybridize. Hybrid molecules are formed between similar sequences and any differences between those sequences will result in a disruption of the base-pairing. On a genomic scale, the method has been used by researchers to estimate the genetic distance between two species, a process known as DNA-DNA hybridization. In the context of a single isolated region of DNA, denaturing gradient gels and temperature gradient gels can be used to detect the presence of small mismatches between two sequences, a process known as temperature gradient gel electrophoresis.
Methods of DNA analysis based on melting temperature have the disadvantage of being proxies for studying the underlying sequence; DNA sequencing is generally considered a more accurate method.
The process of DNA melting is also used in molecular biology techniques, notably in the polymerase chain reaction. Although the temperature of DNA melting is not diagnostic in the technique, methods for estimating Tm are important for determining the appropriate temperatures to use in a protocol. DNA melting temperatures can also be used as a proxy for equalizing the hybridization strengths of a set of molecules, e.g. the oligonucleotide probes of DNA microarrays.
Annealing, in genetics, means for complementary sequences of single-stranded DNA or RNA to pair by hydrogen bonds to form a double-stranded polynucleotide. The term is often used to describe the binding of a DNA probe, or the binding of a primer to a DNA strand during a polymerase chain reaction. The term is also often used to describe the reformation (renaturation) of reverse-complementary strands that were separated by heat (thermally denatured). Proteins such as RAD52 can help DNA anneal.
Thermodynamics of the two-state model
Several formulas are used to calculate Tm values. Some formulas are more accurate in predicting melting temperatures of DNA duplexes. For DNA oligonucleotides, i.e. short sequences of DNA, the thermodynamics of hybridization can be accurately described as a two-state process. In this approximation one neglects the possibility of intermediate partial binding states in the formation of a double strand state from two single stranded oligonucleotides. Under this assumption one can elegantly describe the thermodynamic parameters for forming double-stranded nucleic acid AB from single-stranded nucleic acids A and B.
- AB ↔ A + B
The equilibrium constant for this reaction is . According to the Van´t Hoff equation, the relation between free energy, ΔG, and K is ΔG° = -RTln K, where R is the ideal gas law constant, and T is the kelvin temperature of the reaction. This gives, for the nucleic acid system,
The melting temperature, Tm, occurs when half of the double-stranded nucleic acid has dissociated. If no additional nucleic acids are present, then [A], [B], and [AB] will be equal, and equal to half the initial concentration of double-stranded nucleic acid, [AB]initial. This gives an expression for the melting point of a nucleic acid duplex of
Because ΔG° = ΔH° -TΔS°, Tm is also given by
The terms ΔH° and ΔS° are usually given for the association and not the dissociation reaction (see the nearest-neighbor method for example). This formula then turns into:
, where [B]total < [A]total.
As mentioned, this equation is based on the assumption that only two states are involved in melting: the double stranded state and the random-coil state. However, nucleic acids may melt via several intermediate states. To account for such complicated behavior, the methods of statistical mechanics must be used, which is especially relevant for long sequences.
Estimating thermodynamic properties from nucleic acid sequence
The previous paragraph shows how melting temperature and thermodynamic parameters (ΔG° or ΔH° & ΔS°) are related to each other. From the observation of melting temperatures one can experimentally determine the thermodynamic parameters. Vice versa, and important for applications, when the thermodynamic parameters of a given nucleic acid sequence are known, the melting temperature can be predicted. It turns out that for oligonucleotides, these parameters can be well approximated by the nearest-neighbor model.
The interaction between bases on different strands depends somewhat on the neighboring bases. Instead of treating a DNA helix as a string of interactions between base pairs, the nearest-neighbor model treats a DNA helix as a string of interactions between 'neighboring' base pairs. So, for example, the DNA shown below has nearest-neighbor interactions indicated by the arrows.
- ↓ ↓ ↓ ↓ ↓
- 5' C-G-T-T-G-A 3'
- 3' G-C-A-A-C-T 5'
The free energy of forming this DNA from the individual strands, ΔG°, is represented (at 37 °C) as
ΔG°37(predicted) = ΔG°37(CG initiation) + ΔG°37(CG/GC) + ΔG°37(GT/CA) + ΔG°37(TT/AA) + ΔG°37(TG/AC) + ΔG°37(GA/CT) + ΔG°37(AT initiation)
The first term represents the free energy of the first base pair, CG, in the absence of a nearest neighbor. The second term includes both the free energy of formation of the second base pair, GC, and stacking interaction between this base pair and the previous base pair. The remaining terms are similarly defined. In general, the free energy of forming a nucleic acid duplex is
Each ΔG° term has enthalpic, ΔH°, and entropic, ΔS°, parameters, so the change in free energy is also given by
Values of ΔH° and ΔS° have been determined for the ten possible pairs of interactions. These are given in Table 1, along with the value of ΔG° calculated at 37 °C. Using these values, the value of ΔG37° for the DNA helix shown above is calculated to be −22.4 kJ/mol. The experimental value is −21.8 kJ/mol.
|Terminal A-T base pair||9.6||17.2||4.31|
|Terminal G-C base pair||0.4||−11.7||4.05|
The parameters associated with the ten groups of neighbors shown in table 1 are determined from melting points of short oligonucleotide duplexes. Curiously, it works out that only eight of the ten groups are independent.
The nearest-neighbor model can be extended beyond the Watson-Crick pairs to include parameters for interactions between mismatches and neighboring base pairs. This allows the estimation of the thermodynamic parameters of sequences containing isolated mismatches, like e.g. (arrows indicating mismatch)
- 5' G-G-A-C-T-G-A-C-G 3'
- 3' C-C-T-G-G-C-T-G-C 5'
These parameters have been fitted from melting experiments and an extension of Table 1 which includes mismatches can be found in literature.
A more realistic way of modeling the behavior of nucleic acids would seem to be to have parameters that depend on the neighboring groups on both sides of a nucleotide, giving a table with entries like "TCG/AGC". However, this would involve around 32 groups for Watson-Crick pairing and even more for sequences containing mismatches; the number of DNA melting experiments needed to get reliable data for so many groups would be inconveniently high. However, other means exist to access thermodynamic parameters of nucleic acids: microarray technology allows hybridization monitoring of tens of thousands sequences in parallel. This data, in combination with molecular adsorption theory allows the determination of many thermodynamic parameters in a single experiment and to go beyond the nearest neighbor model. In general the predictions from the nearest neighbor method agree reasonably well with experimental results, but some unexpected outlying sequences, calling for further insights, do exist.
- Melting point
- Primer (molecular biology) for calculations of Tm
- Base pair
- Complementary DNA
- Western blot
- M. Mandel and J. Marmur (1968). "Use of Ultraviolet Absorbance-Temperature Profile for Determining the Guanine plus Cytosine Content of DNA". Methods in Enzymology. Methods in Enzymology 12 (2): 198–206. doi:10.1016/0076-6879(67)12133-2. ISBN 978-0-12-181856-2.
- C.G. Sibley and J.E. Ahlquist (1984). "The Phylogeny of the Hominoid Primates, as Indicated by DNA-DNA Hybridization". Journal of Molecular Evolution 20 (1): 2–15. doi:10.1007/BF02101980. PMID 6429338.
- R.M. Myers, T. Maniatis, and L.S. Lerman (1987). "Detection and Localization of Single Base Changes by Denaturing Gradient Gel Electrophoresis". Methods in Enzymology. Methods in Enzymology 155: 501–527. doi:10.1016/0076-6879(87)55033-9. ISBN 978-0-12-182056-5. PMID 3431470.
- T. Po, G. Steger, V. Rosenbaum, J. Kaper, and D. Riesner (1987). "Double-stranded cucumovirus associated RNA 5: experimental analysis of necrogenic and non-necrogenic variants by temperature-gradient gel electrophoresis". Nucleic Acids Research 15 (13): 5069–5083. doi:10.1093/nar/15.13.5069. PMC 305948. PMID 3601667.
- Breslauer, K.J.; Frank, R; Blöcker, H; Marky, LA; et al. (1986). "Predicting DNA Duplex Stability from the Base Sequence". Proc. Natl. Acad. Sci. USA. 83 (11): 3746–3750. doi:10.1073/pnas.83.11.3746. PMC 323600. PMID 3459152. (pdf)
- Rychlik, W.; Spencer, W. J.; Rhoads, R. E. (1990). "Optimization of the annealing temperature for DNA amplification in vitro". Nucleic Acids Res. 18 (21): 6409–6412. doi:10.1093/nar/18.21.6409. PMC 332522. PMID 2243783.
- Owczarzy R., Vallone P.M., Gallo F.J., Paner T.M., Lane M.J. and Benight A.S (1997). "Predicting sequence-dependent melting stability of short duplex DNA oligomers". Biopolymers 44 (3): 217–239. doi:10.1002/(SICI)1097-0282(1997)44:3<217::AID-BIP3>3.0.CO;2-Y. PMID 9591477. (pdf)
- John SantaLucia Jr. (1998). "A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics". Proc. Natl. Acad. Sci. USA 95 (4): 1460–5. doi:10.1073/pnas.95.4.1460. PMC 19045. PMID 9465037.
- John SantaLucia Jr., John; Donald Hicks (June 2004). "The thermodynamics of DNA structural motifs". Annual Review of Biophysics and Biomolecular Structure 33: 415–440. doi:10.1146/annurev.biophys.32.110601.141800. PMID 15139820. Retrieved 27 March 2013.
- Hooyberghs, J.; Van Hummelen, P.; Carlon, E. (2009). "The effects of mismatches on hybridization in DNA microarrays: Determination of nearest neighbor parameters". Nucleic Acids Research 37. doi:10.1093/nar/gkp109. PMID 19270064.
- Hadiwikarta, W. W.; Walter, J. C.; Hooyberghs, J.; Carlon, E. (2012). "Probing hybridization parameters from microarray experiments: Nearest-neighbor model and beyond". Nucleic Acids Research 40: e138. doi:10.1093/nar/gks475. PMID 22661582.
|Library resources about
Nucleic acid hybridization
- Tm calculations in OligoAnalyzer – Integrated DNA Technologies
- DNA thermodynamics calculations – Tm, melting profile, mismatches, free energy calculations
- Tm calculation – by bioPHP.org.
- Invitrogen Tm calculation
- AnnHyb Open Source software for Tm calculation using the Nearest-neighbour method
- Sigma-aldrich technical notes
- Primer3 calculation
- "Discovery of the Hybrid Helix and the First DNA-RNA Hybridization" by Alexander Rich
- uMelt: Melting Curve Prediction
- Sequence Tm Utility v1.5
- Nearest Neighbor Database: Provides a description of RNA-RNA interaction nearest neighbor parameters and examples of their use.