DNA-binding domain
From Wikipedia, the free encyclopedia
A DNA-binding domain (DBD) is an independently folded protein domain which contains at least one motif that recognizes double- or single-stranded DNA. A DBD can recognize a specific DNA sequence (a recognition sequence) or have a general affinity to DNA.[1] Some DNA-binding domains may also include nucleic acids in their folded structure.
Contents |
[edit] Function
One or more DNA-binding domains are often part of a larger protein consisting of additional domains with differing function. The additional domains often regulate the activity of the DNA-binding domain. The function of DNA binding is either structural or involving transcription regulation, with the two roles sometimes overlapping.
DNA-binding domains with functions involving DNA structure have biological roles in the replication, repair, storage, and modification of DNA, such as methylation.
Many proteins involved in the regulation of gene expression contain DNA-binding domains. For example, proteins that regulate transcription by binding DNA are called transcription factors. The final output of most cellular signaling cascades is gene regulation.
The DBD interacts with the nucleotides of DNA in a DNA sequence-specific or non-sequence-specific manner, but even non-sequence-specific recognition involves some sort of molecular complementarity between protein and DNA. DNA recognition by the DBD can occur at the major or minor groove of DNA, or at the sugar-phosphate DNA backbone (see the structure of DNA). Each specific type of DNA recognition is tailored to the protein's function. For example, the DNA-cutting enzyme DNAse I cuts DNA almost randomly and so must bind to DNA in a non-sequence-specific manner. But even so, DNAse I recognizes a certain 3-D DNA structure, yielding a somewhat specific DNA cleavage pattern that can be useful for studying DNA recognition by a technique called DNA footprinting.
Many DNA-binding domains must recognize specific DNA sequences, such as DBD's of transcription factors that activate specific genes, or those of enzymes that modify DNA at specific sites, like restriction enzymes and telomerase. The hydrogen bonding pattern in the DNA major groove is less degenerate than that of the DNA minor groove, providing a more attractive site for sequence-specific DNA recognition.
The specificity of DNA-binding proteins can be studied using many biochemical and biophysical techniques, such as gel electrophoresis, analytical ultracentrifugation, calorimetry, DNA mutation, protein structure mutation or modification, nuclear magnetic resonance, x-ray crystallography, surface plasmon resonance, electron paramagnetic resonance, cross-linking.
[edit] Types of DNA-binding domains
[edit] Helix-turn-helix
Originally discovered in bacteria, this motif is commonly found in repressor proteins and is about 20 amino acids long. In eukaryotes, the homeodomain comprises 3 helices, of which the third recognizes the DNA (aka recognition helix). They are common in proteins that regulate developmental processes (PROSITE HTH).
[edit] Zinc finger
This domain is 30 amino acids long and consists of a recognition helix and a 2-strand beta-sheet. The domain also contains four regularly spaced ligands for Zinc (either histidines or cysteines). The Zn ion stabilizes the 3D structure of the domain. Each finger contains one Zn ion and recognizes a specific triplet of DNA basepairs.
[edit] Leucine zipper
The basic leucine zipper (bZIP) domain contains an alpha helix with a leucine at every 7th amino acid. If two such helices find one another, the leucines can interact as the teeth in a zipper, allowing dimerization of two proteins. When binding to the DNA, basic amino acid residues bind to the sugar-phosphate backbone while the helices sit in the major grooves.It regulates gene expression.
[edit] Winged helix
Consisting of about 110 amino acids, the winged helix (WH) domain has four helices and a two-strand beta-sheet.
[edit] Winged helix turn helix
The winged helix turn helix domain (wHTH) SCOP 46785 is typically 85-90 amino acids long. It is formed by a 3-helical bundle and a 4-strand beta-sheet (wing).
[edit] Helix-loop-helix
This domain is found in some transcription factors and is characterized by two α helices connected by a loop. One helix is typically smaller and due to the flexibility of the loop, allows dimerization by folding and packing against another helix. The larger helix typically contains the DNA binding regions.
[edit] Unusual DNA binding domains
[edit] Immunoglobulin fold
The domain (IPR013783) consists of a beta-sheet structure with large connecting loops, which serve to recognize either DNA major grooves or antigens. Usually found in immunoglobulin proteins, they are also present in Stat proteins of the cytokine pathway. This is likely because the cytokine pathway evolved relatively recently and has made use of systems that were already functional, rather than creating its own.
[edit] B3 domain
The B3 DBD (IPR003340, SCOP 117343) is found exclusively in transcription factors from higher plants and consists of 100-120 residues. It includes seven beta sheets and two alpha helices which form a DNA-binding pseudobarrel protein fold.
[edit] See also
[edit] References
- ^ Lilley, David M. J. (1995). DNA-protein: structural interactions. Oxford: IRL Press at Oxford University Press. ISBN 0-19-963453-X.
[edit] External links
- DBD database of predicted transcription factors Kummerfeld SK, Teichmann SA. (2006). "DBD: a transcription factor prediction database". Nucleic Acids Res. 34 (Database issue): D74-81. doi:. PMID 16381825. Uses a curated set of DNA-binding domains to predict transcription factors in all completely sequenced genomes
- Table of DNA-binding motifs
- MeSH DNA+Footprinting
- MeSH DNA-Binding+Proteins
- DNA-binding domains in PROSITE