# Nucleic acid structure

(Redirected from DNA structure)

Nucleic acid structure refers to the structure of nucleic acids such as DNA and RNA. Chemically speaking, DNA and RNA are very similar. Nucleic acid structure is often divided into four different levels: primary, secondary, tertiary and quaternary.

## Primary structure

Main article: Nucleic acid sequence
Chemical structure of DNA

Primary structure consists of a linear sequence of nucleotides that are linked together by phosphodiester bonds. It is this linear sequence of nucleotides that make up the primary structure of DNA or RNA. Nucleotides consist of 3 components:

1. Nitrogenous base
2. Guanine
3. Cytosine
4. Thymine(present in DNA only)
5. Uracil (present in RNA only)
2. 5-carbon sugar which is called deoxyribose (found in DNA) and ribose (found in RNA).
3. One or more phosphate groups.[1]

The nitrogen bases adenine and guanine are purine in structure and form a glycosidic bond between their 9' nitrogen and the 1' -OH group of the deoxyribose. Cytosine, thymine and uracil are pyrimidines, hence the glycosidic bonds forms between their 1' nitrogen and the 1' -OH of the deoxyribose. For both the purine and pyrimidine bases, the phosphate group forms a bond with the deoxyribose sugar through an ester bond between one of its negatively charged oxygen groups and the 5' -OH of the sugar.[2] The polarity in DNA and RNA is derived from the oxygen and nitrogen atoms in the backbone. Nucleic acids are formed when nucleotides come together through phosphodiester linkages between the 5' and 3' carbon atoms.[3] A Nucleic acid sequence is the order of nucleotides within a DNA (GACT) or RNA (GACU) molecule that is determined by a series of letters. Sequences are presented from the 5' to 3' end and determine the covalent structure of the entire molecule. Sequences can be complementary to another sequence in that the base on each position is complementary as well as in the reverse order. An example of a complementary sequence to AGCT is TCGA. DNA is double-stranded containing both a sense strand and an antisense strand. Therefore, the complementary sequence will be to the sense strand.[4]

Nucleic acid design can be used to create nucleic acid complexes with complicated secondary structures such as this four-arm junction. These four strands associate into this structure because it maximizes the number of correct base pairs, with A's matched to T's and C's matched to G's. Image from Mao, 2004.[5]

## Secondary structure

Secondary structure is the set of interactions between bases, i.e., parts of which is strands are bound to each other. In DNA double helix, the two strands of DNA are held together by hydrogen bonds. The nucleotides on one strand base pairs with the nucleotide on the other strand. The secondary structure is responsible for the shape that the nucleic acid assumes. The bases in the DNA are classified as Purines and Pyrimidines. The purines are Adenine and Guanine. Purines consist of a double ring structure, a six membered and a five membered ring containing nitrogen. The pyrimidine are Cytosine and Thymine. It has a single ringed structure, a six membered ring containing nitrogen. A purine base always pairs with a pyrimidine base (Guanosine (G) pairs with Cytosine(C)and Adenine(A) pairs with Thymine (T) or Uracil (U). DNA's secondary structure is predominantly determined by base-pairing of the two polynucleotide strands wrapped around each other to form a double helix. There is also a major groove and a minor groove on the double helix.

The secondary structure of RNA consists of a single polynucleotide. Base pairing in RNA occurs when RNA folds between complementarity regions. Both single- and double-stranded regions are often found in RNA molecules. The antiparallel strands form a helical shape.[3] The four basic elements in the secondary structure of RNA are helices, loops, bulges, and junctions. Stem-loop or hairpin loop is the most common element of RNA secondary structure.[6] Stem-loop is formed when the RNA chains fold back on themselves to form a double helical tract called the stem, the unpaired nucleotides forms single stranded region called the loop.[7] Secondary structure of RNA can be predicted by experimental data on the secondary structure elements, helices, loops and bulges. Bulges and internal loops are formed by separation of the double helical tract on either one strand (bulge) or on both strands (internal loops) by unpaired nucleotides. A Tetraloop is a four-base pairs hairpin RNA structure. There are three common families of tetraloop in ribosomal RNA: UNCG, GNRA, and CUUG (N is one of the four nucleotides and R is a purine).UNCG is the most stable tetraloop.[8] Pseudoknot is a RNA secondary structure first identified in turnip yellow mosaic virus.[9] Pseudoknots are formed when nucleotides from the hairpin loop pairs with a single stranded region outside of the hairpin to form a helical segment. H-type fold pseudoknots are best characterized. In H-type fold, nucleotides in the hairpin loop pairs with the bases outside the hairpin stem forming second stem and loop. This causes formation of pseudoknots with two stems and two loops.[10] Pseudoknots are functional elements in RNA structure having diverse function and found in most classes of RNA. DotKnot-PW method is used for comparative pseudoknots prediction .The main points in the DotKnot-PW method is scoring the similarities found in stems, secondary elements and H-type pseudoknots.[11]

## Tertiary structure

DNA structure and bases
A-B-Z-DNA Side View

Tertiary structure is the locations of the atoms in three-dimensional space, taking into consideration geometrical and steric constraints. A higher order than the secondary structure in which large-scale folding in a linear polymer occurs and the entire chain is folded into a specific 3-dimensional shape. There are 4 areas in which the structural forms of DNA can differ.

1. Handedness - right or left
2. Length of the helix turn
3. Number of base pairs per turn
4. Difference in size between the major and minor grooves[3]

The tertiary arrangement of DNA's double helix in space includes B-DNA, A-DNA and Z-DNA.

B-DNA is the most common form of DNA in vivo and is more narrow, elongated helix than A-DNA. Its wide major groove makes it more accessible to proteins. On the other hand, it has a narrow minor groove. B-DNAs favored conformations occurs at high water concentrations and the hydration of the minor groove appears to favor B-DNA. B-DNA base pairs nearly perpendicular to helix axis. The sugar pucker which determines the shape of the a-helix, whether the helix will exist in the A-form or in the B-form occurs at the C2'-endo.[12]

A-DNA is shorter and wider than helix B. Most RNA and RNA-DNA duplex in this form. A-DNA has a deep, narrow major groove which does not make it easily accessible to proteins. On the other hand, its wide, shallow minor groove makes it accessible to proteins but with lower information content than the major groove. Its favored conformation is at low water concentrations. A-DNAs base pairs tilt to helix axis and are displaced from axis. The sugar pucker occurs at the C3'-endo and in RNA 2'-OH inhibits C2'-endo conformation.[12]

Z-DNA is a relatively rare left-handed double-helix. Given the proper sequence and superhelical tension, it can be formed in vivo but its function is unclear. It has a more narrow, more elongated helix than A or B. Z-DNA's major groove is not really groove and it has a narrow minor groove. The most favored conformation occurs when there are high salt concentrations. There are some base substitutions but requires an alternating purine-pyrimidine sequence. The N2-amino of G H-bonds to 5' PO which explains the slow exchange of protons and the need for the G purine. Z-DNA base pairs nearly perpendicular to the helix axis. Z-DNA does not contain single base-pairs but rather a GpC repeat with P-P distances varying for GpC and CpG. On the GpC stack there is good base overlap whereas on the CpG stack there is less overlap. Z-DNA's zigzag backbone is due to the C sugar conformation compensating for G glycosidic bond conformation. The conformation of G is syn, C2'-endo and for C it is anti, C3'-endo.[12]

Linear DNA molecule having free ends can rotate to adjust to changes of various dynamic processes in the cell by changing the number of times two chains of the double helix twist around each other. Some DNA molecules are circular and are topologically constrained. A covalently closed, circular DNA also known as cccDNA is topologically constrained as the number of times the chains coiled around one other cannot change. This cccDNA can be supercoiled which is the tertiary structure of DNA. Supercoiling is characterized by the linking number, twist and writhe. The Linking number Lk for circular DNA is defined as the number of times one strand would have to pass through the other strand to completely separate the two strands. The linking number for circular DNA can only be changed by breaking of a covalent bond in one of the two strands. Linking number is always an integer. The Linking number of a cccDNA is sum of two components twists (Tw) and writhes (Wr).[13]

$Lk = Tw + Wr$

Twists are the number of times the two strands of DNA are twisted around each other. Writhes are number of times the DNA helix crosses over itself. DNA in cell is negatively supercoiled and has the tendency to unwind. Hence the separation of strand is easier in negatively supercoiled DNA than the relaxed DNA. The two component of supercoiled DNA, are solenoid and plectonemic. The plectonemic supercoil is found in prokaryotes and the solenoidal supercoiling is mostly seen in eukaryotes.

## Quaternary structure

DNA to Chromatin

The quaternary structure of nucleic acids is similar to that of protein quaternary structure. Although some of the concepts are not exactly the same, the quaternary structure refers to a higher-level of organization of nucleic acids. Moreover, it refers to interactions of the nucleic acids with other molecules. The most commonly seen form of higher-level organization of nucleic acids is seen in the form of chromatin which leads to its interactions with the small proteins histones. Also, the quaternary structure refers to the interactions between separate RNA units in the ribosome or spliceosome.[14]

## Notes and references

1. ^ Krieger M, Scott MP, Matsudaira PT, Lodish HF, Darnell JE, Lawrence Z, Kaiser C, Berk A (2004). "Section 4.1: Structure of Nucleic Acids". Molecular cell biology. New York: W.H. Freeman and CO. ISBN 0-7167-4366-3.
2. ^ "Structure of Nucleic Acids". SparkNotes.
3. ^ a b c Anthony-Cahill SJ; Mathews CK, van Holde KE, Appling DR (2012). Biochemistry (4th Edition). Englewood Cliffs, N.J: Prentice Hall. ISBN 0-13-800464-1.
4. ^ Alberts B, Johnson A, Lewis J, Raff M, Roberts K & Wlater P (2002). Molecular Biology of the Cell (4th ed.). New York NY: Garland Science. ISBN 0-8153-3218-1.
5. ^ Mao, Chengde (December 2004). "The Emergence of Complexity: Lessons from DNA". PLOS Biology 2 (12): 2036–2038. doi:10.1371/journal.pbio.0020431. ISSN 1544-9173. PMC 535573. PMID 15597116.
6. ^ Tinoco I, Jr; Bustamante, C (Oct 22, 1999). "How RNA folds.". Journal of Molecular Biology 293 (2): 271–81. doi:10.1006/jmbi.1999.3001. PMID 10550208.
7. ^
8. ^ Hollyfield, JG; Besharse, JC; Rayborn, ME (December 1976). "The effect of light on the quantity of phagosomes in the pigment epithelium.". Experimental eye research 23 (6): 623–35. doi:10.1016/0014-4835(76)90221-9. PMID 1087245.
9. ^ Rietveld, K; Van Poelgeest, R; Pleij, CW; Van Boom, JH; Bosch, L (Mar 25, 1982). "The tRNA-like structure at the 3' terminus of turnip yellow mosaic virus RNA. Differences and similarities with canonical tRNA". Nucleic Acids Research 10 (6): 1929–46. doi:10.1093/nar/10.6.1929. PMC 320581. PMID 7079175.
10. ^ Staple, DW; Butcher, SE (June 2005). "Pseudoknots: RNA structures with diverse functions". PLoS Biology 3 (6): e213. doi:10.1371/journal.pbio.0030213. PMC 1149493. PMID 15941360.
11. ^ Sperschneider, J; Datta, A; Wise, MJ (Dec 1, 2012). "Predicting pseudoknotted structures across two RNA sequences". Bioinformatics (Oxford, England) 28 (23): 3058–65. doi:10.1093/bioinformatics/bts575. PMC 3516145. PMID 23044552.
12. ^ a b c Dickerson RE, Drew HR, Conner BN, Wing RM, Fratini AV, Kopka ML (April 1982). "The anatomy of A-, B-, and Z-DNA". Science 216 (4545): 475–85. doi:10.1126/science.7071593. PMID 7071593.
13. ^ Mirkin SM (2001). "DNA Topology: Fundamentals". Encyclopedia of Life Sciences. doi:10.1038/npg.els.0001038. ISBN 0470016175.
14. ^ "Strucual Biochemistry/Nucleic Acid/DNA/DNA Structure". Retrieved 11 December 2012.