Synthetic genomics is a nascent field of synthetic biology that uses aspects of genetic modification on pre-existing life forms with the intent of producing some product or desired behavior on the part of the life form so created.
Synthetic genomics is unlike genetic modification in the sense that it does not use naturally occurring genes in its life forms. It may make use of custom designed base pair series, though in a more expanded and presently unrealized sense synthetic genomics could utilize genetic codes that are not composed of the two base pairs of DNA that are currently used by life.
The development of synthetic genomics is related to certain recent technical abilities and technologies in the field of genetics. The ability to construct long base pair chains cheaply and accurately on a large scale has allowed researchers to perform experiments on genomes that do not exist in nature. Coupled with the developments in protein folding models and decreasing computational costs the field synthetic genomics is beginning to enter a productive stage of vitality.
Recombinant DNA Technology
Soon after the discovery of restriction endonucleases and ligases, the field of genetics began using these molecular tools to assemble artificial sequences from smaller fragments of synthetic or naturally-occurring DNA. The advantage in using the recombinatory approach as opposed to continual DNA synthesis stems from the inverse relationship that exists between synthetic DNA length and percent purity of that synthetic length. In other words, as you synthesize longer sequences, the number of error-containing clones increases due to the inherent error rates of current technologies. Although recombinant DNA technology is more commonly used in the construction of fusion proteins and plasmids, several techniques with larger capacities have emerged, allowing for the construction of entire genomes.
Polymerase Cycling Assembly
Polymerase cycling assembly (PCA) uses a series of oligonucleotides (or oligos), approximately 40 to 60 nucleotides long, that altogether constitute both strands of the DNA being synthesized. These oligos are designed such that a single oligo from one strand contains a length of approximately 20 nucleotides at each end that is complementary to sequences of two different oligos on the opposite strand, thereby creating regions of overlap. The entire set is processed through cycles of: (a) hybridization at 60 °C; (b) elongation via Taq polymerase and a standard ligase; and (c) denaturation at 95 °C, forming progressively longer contiguous strands and ultimately resulting in the final genome. PCA was used to generate the first synthetic genome in history, that of the Phi X 174 virus.
Gibson Assembly Method
The Gibson Assembly Method, designed by Daniel Gibson during his time at the J. Craig Venter Institute, requires a set of double-stranded DNA cassettes that constitute the entire genome being synthesized. Note that cassettes differ from contigs by definition, in that these sequences contain regions of homology to other cassettes for the purposes of recombination. In contrast to Polymerase Cycling Assembly, Gibson Assembly is a single-step, isothermal reaction with larger sequence-length capacity; ergo, it is used in place of Polymerase Cycling Assembly for genomes larger than 6 kb.
A T5 exonuclease performs a chew-back reaction at the terminal segments, working in the 5' to 3' direction, thereby producing complimentary overhangs. The overhangs hybridize to each other, a Phusion DNA polymerase fills in any missing nucleotides and the nicks are sealed with a ligase. However, the genomes capable of being synthesized using this method alone is limited because as DNA cassettes increase in length, they require propagation in vitro in order to continue hybridizing; accordingly, Gibson assembly is often used in conjunction with Transformation-Associated Recombination (see below) to synthesize genomes several hundred kilobases in size.
The goal of Transformation-Associated Recombination (TAR) technology in synthetic genomics is to combine DNA contigs by means of Homologous recombination performed by the Yeast Artificial Chromosome (YAC). Of importance is the CEN element within the YAC vector, which corresponds to the yeast centromere. This sequence gives the vector the ability to behave in a chromosomal manner, thereby allowing it to perform homologous recombination.
First, Gap Repair Cloning is performed to generate regions of homology flanking the DNA contigs. Gap Repair Cloning is a particular form of the Polymerase Chain Reaction in which specialized primers with extensions beyond the sequence of the DNA target are utilized. Then, the DNA cassettes are exposed to the YAC vector, which drives the process of homologous recombination, thereby connecting the DNA cassettes. Polymerase Cycling Assembly and TAR technology were used together to construct the 600 kb Mycoplasma genitalium genome in 2008, the first synthetic organism ever created. Similar steps were taken in synthesizing the larger Mycoplasma mycoides genome a few years later.
Unnatural base pair (UBP)
An unnatural base pair (UBP) is a designed subunit (or nucleobase) of DNA which is created in a laboratory and does not occur in nature. In 2012, a group of American scientists led by Floyd Romesberg, a chemical biologist at the Scripps Research Institute in San Diego, California, published that his team designed an unnatural base pair (UBP). The two new artificial nucleotides or Unnatural Base Pair (UBP) were named d5SICS and dNaM. More technically, these artificial nucleotides bearing hydrophobic nucleobases, feature two fused aromatic rings that form a (d5SICS–dNaM) complex or base pair in DNA. In 2014 the same team from the Scripps Research Institute reported that they synthesized a stretch of circular DNA known as a plasmid containing natural T-A and C-G base pairs along with the best-performing UBP Romesberg's laboratory had designed, and inserted it into cells of the common bacterium E. coli that successfully replicated the unnatural base pairs through multiple generations. This is the first known example of a living organism passing along an expanded genetic code to subsequent generations. This was in part achieved by the addition of a supportive algal gene that expresses a nucleotide triphosphate transporter which efficiently imports the triphosphates of both d5SICSTP and dNaMTP into E. coli bacteria. Then, the natural bacterial replication pathways use them to accurately replicate the plasmid containing d5SICS–dNaM.
The successful incorporation of a third base pair is a significant breakthrough toward the goal of greatly expanding the number of amino acids which can be encoded by DNA, from the existing 20 amino acids to a theoretically possible 172, thereby expanding the potential for living organisms to produce novel proteins. The artificial strings of DNA do not encode for anything yet, but scientists speculate they could be designed to manufacture new proteins which could have industrial or pharmaceutical uses.
- Montague, Michael G; Lartigue, Carole; Vashee, Sanjay. "Synthetic genomics: potential and limitations". Current Opinion in Biotechnology 23 (5): 659–665. doi:10.1016/j.copbio.2012.01.014.
- Gibson, Daniel (2011). Synthetic Biology, Part B: Computer Aided Design and DNA Assembly; Chapter Fifteen - Enzymatic Assembly of Overlapping DNA Fragments. Academic Press. pp. 349–361. ISBN 978-0-12-385120-8.
- Stemmer, Willem P. C.; Crameri, Andreas; Ha, Kim D.; Brennan, Thomas M.; Heyneker, Herbert L. (1995-10-16). "Single-step assembly of a gene and entire plasmid from large numbers of oligodeoxyribonucleotides". Gene 164 (1): 49–53. doi:10.1016/0378-1119(95)00511-4. PMID 7590320.
- Smith, Hamilton O.; Hutchison, Clyde A.; Pfannkoch, Cynthia; Venter, J. Craig (2003-12-23). "Generating a synthetic genome by whole genome assembly: φX174 bacteriophage from synthetic oligonucleotides". Proceedings of the National Academy of Sciences 100 (26): 15440–15445. doi:10.1073/pnas.2237126100. ISSN 0027-8424. PMC 307586. PMID 14657399.
- Gibson, Daniel G; Young, Lei; Chuang, Ray-Yuan; Venter, J Craig; Hutchison, Clyde A; Smith, Hamilton O (2009-04-12). "Enzymatic assembly of DNA molecules up to several hundred kilobases". Nature Methods 6 (5): 343–345. doi:10.1038/nmeth.1318. PMID 19363495.
- Kouprina, Natalay; Larionov, Vladimir (2003-12-01). "Exploiting the yeast Saccharomyces cerevisiae for the study of the organization and evolution of complex genomes". FEMS Microbiology Reviews 27 (5): 629–649. doi:10.1016/S0168-6445(03)00070-6. ISSN 1574-6976. PMID 14638416.
- Marsischky, Gerald; LaBaer, Joshua (2004-10-15). "Many Paths to Many Clones: A Comparative Look at High-Throughput Cloning Methods". Genome Research 14 (10b): 2020–2028. doi:10.1101/gr.2528804. ISSN 1088-9051. PMID 15489321.
- Gibson, Daniel G.; Benders, Gwynedd A.; Andrews-Pfannkoch, Cynthia; Denisova, Evgeniya A.; Baden-Tillson, Holly; Zaveri, Jayshree; Stockwell, Timothy B.; Brownley, Anushka; Thomas, David W. (2008-02-29). "Complete Chemical Synthesis, Assembly, and Cloning of a Mycoplasma genitalium Genome". Science 319 (5867): 1215–1220. doi:10.1126/science.1151721. ISSN 0036-8075. PMID 18218864.
- Gibson, Daniel G.; Glass, John I.; Lartigue, Carole; Noskov, Vladimir N.; Chuang, Ray-Yuan; Algire, Mikkel A.; Benders, Gwynedd A.; Montague, Michael G.; Ma, Li (2010-07-02). "Creation of a Bacterial Cell Controlled by a Chemically Synthesized Genome". Science 329 (5987): 52–56. doi:10.1126/science.1190719. ISSN 0036-8075. PMID 20488990.
- Malyshev, Denis A.; Dhami, Kirandeep; Quach, Henry T.; Lavergne, Thomas; Ordoukhanian, Phillip (24 July 2012). "Efficient and sequence-independent replication of DNA containing a third base pair establishes a functional six-letter genetic alphabet". Proceedings of the National Academy of Sciences of the United States of America (PNAS) 109 (30): 12005–12010. Bibcode:2012PNAS..10912005M. doi:10.1073/pnas.1205176109. Retrieved 2014-05-11.
- Malyshev, Denis A.; Dhami, Kirandeep; Lavergne, Thomas; Chen, Tingjian; Dai, Nan; Foster, Jeremy M.; Corrêa, Ivan R.; Romesberg, Floyd E. (May 7, 2014). "A semi-synthetic organism with an expanded genetic alphabet". Nature 509: 385–8. doi:10.1038/nature13314. PMC 4058825. PMID 24805238. Retrieved May 7, 2014.
- Callaway, Ewan (May 7, 2014). "Scientists Create First Living Organism With 'Artificial' DNA". Nature News (Huffington Post). Retrieved 8 May 2014.
- Fikes, Bradley J. (May 8, 2014). "Life engineered with expanded genetic code". San Diego Union Tribune. Retrieved 8 May 2014.
- Sample, Ian (May 7, 2014). "First life forms to pass on artificial DNA engineered by US scientists". The Guardian. Retrieved 8 May 2014.
- Pollack, Andrew (May 7, 2014). "Scientists Add Letters to DNA’s Alphabet, Raising Hope and Fear". New York Times. Retrieved 8 May 2014.
- Synthetic Genomes: Technologies and Impact - A 2004 study completed for the DOE on the subject.
- Effects of Developments in Synthetic Genomics: Hearing before the Committee on Energy and Commerce, House of Representatives, One Hundred Eleventh Congress, Second Session, May 27, 2010