Coding region

From Wikipedia, the free encyclopedia
  (Redirected from Coding regions)
Jump to: navigation, search

The coding region of a gene, also known as the coding sequence or CDS (from coding DNA sequence), is that portion of a gene's DNA or RNA, composed of exons, that codes for protein. The region is bounded nearer the 5' end by a start codon and nearer the 3' end with a stop codon. The coding region in mRNA is bounded by the five prime untranslated region (5'-UTR) and the three prime untranslated region (3'-UTR), which are also parts of the exons.[1] The CDS is that portion of an mRNA transcript that is translated by a ribosome.

The coding region of an organism is the sum total of the organism's genome that is composed of gene coding regions.[2]

Difference with cDNA[edit]

A cDNA sequence is derived from the transcript by reverse transcription, but in this case it also contains the 5' and 3' UTRs, which are not part of the CDS (they are transcribed, but not translated). Due to this, a CDS will -almost- always start with an AUG codon and stop at one of the three STOP codons (UAA,UGA,UAG). These stop codons are called the stop signal.

Coding sequence annotation[edit]

While identification of open reading frames within a DNA sequence is straightforward, identifying coding sequences is not, because the cell translates only a subset of all open reading frames to proteins.[3] Currently CDS prediction uses sampling and sequencing of mRNA from cells, although there is still the problem of determining which parts of a given mRNA are actually translated to protein. CDS prediction is a subset of gene prediction, the latter also including prediction of DNA sequences that code not only for protein but also for other functional elements such as RNA genes and regulatory sequences.

See also[edit]


  1. ^ Twyman, Richard (1 August 2003). "Gene Structure". The Wellcome Trust. Archived from the original on 28 March 2007. Retrieved 6 April 2003. 
  2. ^ Goto, Mami; et al. (April 8, 2000). Analysis of CpG Dinucleotide Frequency in Bacterial Genomes with Respect to Genomic Regions and Codon (PDF). The Fourth Annual International Conference on Computational Molecular Biology, Tokyo, Japan. Retrieved 6 April 2009. 
  3. ^ Furuno, Masaaki; Kasukawa, Takeya; Saito, Rintaro; Adachi, Jun; Suzuki, Harukazu; Baldarelli, Richard; Hayashizaki, Yoshihide; Okazaki, Yasushi (September 2011). "CDS Annotation in Full-Length cDNA Sequence". Genome Research. Cold Spring Harbor Laboratory Press. 21 (9): 1478–1487. doi:10.1101/gr.1060303. Retrieved 18 September 2011