Whole genome sequencing
Whole genome sequencing (also known as full genome sequencing, complete genome sequencing, or entire genome sequencing) is a laboratory process that determines the complete DNA sequence of an organism's genome at a single time. This entails sequencing all of an organism's chromosomal DNA as well as DNA contained in the mitochondria and, for plants, in the chloroplast.
Whole genome sequencing should not be confused with DNA profiling, which only determines the likelihood that genetic material came from a particular individual or group, and does not contain additional information on genetic relationships, origin or susceptibility to specific diseases. Also unlike full genome sequencing, SNP genotyping covers less than 0.1% of the genome. Almost all truly complete genomes are of microbes; the term "full genome" is thus sometimes used loosely to mean "greater than 95%". The remainder of this article focuses on nearly complete human genomes.
High-throughput genome sequencing technologies have largely been used as a research tool and are currently being introduced in the clinics. In the future of personalized medicine, whole genome sequence data will be an important tool to guide therapeutic intervention. The tool of gene sequencing at SNP level is also used to pinpoint functional variants from association studies and improve the knowledge available to researchers interested in evolutionary biology, and hence may lay the foundation for predicting disease susceptibility and drug response.
Also, by aligning the sequenced genomes, can be obtained somatic mutations produced as base substitutions.
- 1 A brief history of whole genome sequencing
- 2 Cells used for sequencing
- 3 Mutation frequencies in cancers
- 4 Early techniques
- 5 Current techniques
- 6 Commercialization
- 7 Disruption to DNA array market
- 8 Sequencing versus analysis
- 9 Diagnostic use and societal impact
- 10 Ethical concerns
- 11 People with public genome sequences
- 12 See also
- 13 References
- 14 External links
A brief history of whole genome sequencing
The shift from manual DNA sequencing methods such as Maxam-Gilbert sequencing and Sanger sequencing in the 1970s and 1980s to more rapid, automated sequencing methods in the 1990s played a crucial role in giving scientists the ability to sequence whole genomes. Haemophilus influenzae, a commensal bacterium which resides in the human respiratory tract was the first organism to have its entire genome sequenced (Figure 2.1). The entire genome of this bacterium was published in 1995. The genomes of H. influenzae, other Bacteria, and some Archaea were the first to be sequenced - largely due to their small genome size. H. influenzae has a genome of just over 1.8 nucleotide pairs in size. In contrast, eukaryotes, both unicellular and multicellular such as Amoeba dubia and humans (Homo sapiens) respectively, have much larger genomes (see C-value paradox). Amoeba dubia has a genome of 700 billion nucleotide pairs spread across thousands of chromosomes. Humans contain fewer nucleotide pairs (about 3.2 billion in each germ cell - note the exact size of the human genome is still being revised) than A. dubia however their genome size far outweighs the genome size of individual bacteria.
The first bacterial and archaeal genomes, including that of H. influenzae, were sequenced by Shotgun sequencing. In 1996, the first eukaryotic genome - that of the yeast Saccharomyces cerevisiae was sequenced. S. cerevisiae, a model organism in biology has a genome of only around 12 million nucleotide pairs. S. cerevisiae was the first unicellular eukaryote to have its whole genome sequenced. The first multicellular eukaryote, and animal, to have its whole genome sequenced was the nematode worm: Caenorhabditis elegans in 1998 (Figure 2.2). Eukaryotic genomes are sequenced by several methods including Shotgun sequencing of short DNA fragments and sequencing of larger DNA clones from DNA libraries (see library (biology)) such as Bacterial artificial chromosomes (BACs) and Yeast artificial chromosomes (YACs).
In 1999, the entire DNA sequence of human chromosome 22, the shortest human autosome, was published. By the year 2000, the second animal and second invertebrate (yet first insect) genome was sequenced - that of the fruit fly Drosophila melanogaster (Figure 2.3) - a popular choice of model organism in experimental research. The first plant genome - that of the model organism Arabidopsis thaliana - was also fully sequenced by 2000 (Figure 2.4). By 2001, a draft of the entire human genome sequence was published. The genome of the laboratory mouse Mus musculus was completed in 2002 (Figure 2.5).
Currently, thousands of genomes have been sequenced.
Cells used for sequencing
Almost any biological sample containing a full copy of the DNA—even a very small amount of DNA or ancient DNA—can provide the genetic material necessary for full genome sequencing. Such samples may include saliva, epithelial cells, bone marrow, hair (as long as the hair contains a hair follicle), seeds, plant leaves, or anything else that has DNA-containing cells.
The genome sequence of a single cell selected from a mixed population of cells can be determined using techniques of single cell genome sequencing. This has important advantages in environmental microbiology in cases where a single cell of a particular microorganism species can be isolated from a mixed population by microscopy on the basis of its morphological or other distinguishing characteristics. In such cases the normally necessary steps of isolation and growth of the organism in culture may be omitted, thus allowing the sequencing of a much greater spectrum of organism genomes.
Single cell genome sequencing is being tested as a method of preimplantation genetic diagnosis, wherein a cell from the embryo created by in vitro fertilization is taken and analyzed before embryo transfer into the uterus. After implantation, cell-free fetal DNA can be taken by simple venipuncture from the mother and used for whole genome sequencing of the fetus.
Mutation frequencies in cancers
Whole genome sequencing has established the mutation frequency for whole human genomes. The mutation frequency in the whole genome between generations for humans (parent to child) is about 70 new mutations per generation. An even lower level of variation was found comparing whole genome sequencing in blood cells for a pair of monozygotic (identical twins) 100 year old centenarians. Only 8 somatic differences were found, though somatic variation occurring in less than 20% of blood cells would be undetected.
In the specifically protein coding regions of the human genome, it is estimated that there are about 0.35 mutations that would change the protein sequence between parent/child generations (less than one mutated protein per generation).
Cancers, however, have much higher mutation frequencies. The particular frequency depends on tissue type, whether there is a mis-match DNA repair deficiency, and exposure to DNA damaging agents such as UV-irradiation or components of tobacco smoke. Tuna and Amos have summarized the mutation frequencies per megabase (Mb), as shown in the table (along with the indicated frequencies of mutations per genome).
The high mutation frequencies in cancers reflect the genome instability characteristic of cancers.
|Cell type||Mutation frequency/Mb||Mutation frequency per diploid genome|
|Acute lymphocytic leukemia||0.3||1,800|
|Chronic lymphocytic leukemia||<1||<6,000|
|Microsatellite stable (MSS) colon cancer||2.8||16,800|
|Microsatellite instable (MSI) colon cancer (mismatch repair deficient)||47||282,000|
|Small cell lung cancer||7.4||44,400|
|Non-small cell lung cancer (smokers)||10.5||63,000|
|Non-small cell lung cancer (never-smokers)||0.6||3,600|
|Lung adenocarcinoma (smokers)||9.8||58,500|
|Lung adenocarcinoma (never-smokers)||1.7||10,200|
|Chronic UV-irradiation induced melanoma||111||666,000|
|Non-UV-induced melanoma of hairless skin of extremities||3-14||18,000-84,000|
|Non-UV-induced melanoma of hair-bearing skin||5-55||30,000-330,000|
Sequencing of nearly an entire human genome was first accomplished in 2000 partly through the use of shotgun sequencing technology. While full genome shotgun sequencing for small (4000–7000 base pair) genomes was already in use in 1979, broader application benefited from pairwise end sequencing, known colloquially as double-barrel shotgun sequencing. As sequencing projects began to take on longer and more complicated genomes, multiple groups began to realize that useful information could be obtained by sequencing both ends of a fragment of DNA. Although sequencing both ends of the same fragment and keeping track of the paired data was more cumbersome than sequencing a single end of two distinct fragments, the knowledge that the two sequences were oriented in opposite directions and were about the length of a fragment apart from each other was valuable in reconstructing the sequence of the original target fragment.
The first published description of the use of paired ends was in 1990 as part of the sequencing of the human HPRT locus, although the use of paired ends was limited to closing gaps after the application of a traditional shotgun sequencing approach. The first theoretical description of a pure pairwise end sequencing strategy, assuming fragments of constant length, was in 1991. In 1995 Roach et al. introduced the innovation of using fragments of varying sizes, and demonstrated that a pure pairwise end-sequencing strategy would be possible on large targets. The strategy was subsequently adopted by The Institute for Genomic Research (TIGR) to sequence the entire genome of the bacterium Haemophilus influenzae in 1995, and then by Celera Genomics to sequence the entire fruit fly genome in 2000, and subsequently the entire human genome. Applied Biosystems, now called Life Technologies, manufactured the automated capillary sequencers utilized by both Celera Genomics and The Human Genome Project.
While capillary sequencing was the first approach to successfully sequence a nearly full human genome, it is still too expensive and takes too long for commercial purposes. Because of this, since 2005 capillary sequencing has been progressively displaced by newer technologies such as pyrosequencing, SMRT sequencing, and nanopore technology; all of these new technologies nevertheless continue to employ the basic shotgun strategy, namely, parallelization and template generation via genome fragmentation.
Because the sequence data that is produced can be quite large (for example, there are approximately six billion base pairs in each human diploid genome), genomic data is stored electronically and requires a large amount of computing power and storage capacity. Full genome sequencing would have been nearly impossible before the advent of the microprocessor, computers, and the Information Age.
One possible way to accomplish the cost-effective high-throughput sequencing necessary to accomplish full genome sequencing is by using nanopore technology, which is a patented technology held by Harvard University and Oxford Nanopore Technologies and licensed to biotechnology companies. To facilitate their full genome sequencing initiatives, Illumina licensed nanopore sequencing technology from Oxford Nanopore Technologies and Sequenom licensed the technology from Harvard University.
Another possible way to accomplish cost-effective high-throughput sequencing is by utilizing fluorophore technology. Pacific Biosciences is currently using this approach in their SMRT (single molecule real time) DNA sequencing technology.
Pyrosequencing is a method of DNA sequencing based on the sequencing by synthesis principle. The technique was developed by Pål Nyrén and his student Mostafa Ronaghi at the Royal Institute of Technology in Stockholm in 1996, and is currently being used by 454 Life Sciences as a basis for a full genome sequencing platform.
A number of public and private companies are competing to develop a full genome sequencing platform that is commercially robust for both research and clinical use, including Illumina, Knome, Sequenom, 454 Life Sciences, Pacific Biosciences, Complete Genomics, Helicos Biosciences, GE Global Research (General Electric), Affymetrix, IBM, Intelligent Bio-Systems, Life Technologies and Oxford Nanopore Technologies. These companies are heavily financed and backed by venture capitalists, hedge funds, and investment banks.
In October 2006, the X Prize Foundation, working in collaboration with the J. Craig Venter Science Foundation, established the Archon X Prize for Genomics, intending to award US$10 million to "the first Team that can build a device and use it to sequence 100 human genomes within 10 days or less, with an accuracy of no more than one error in every 1,000,000 bases sequenced, with sequences accurately covering at least 98% of the genome, and at a recurring cost of no more than $1,000 per genome". An error rate of 1 in 1,000,000 bases, out of a total of approximately six billion bases in the human diploid genome, would mean about 6,000 errors per genome. The error rates required for widespread clinical use, such as predictive medicine is currently set by over 1,400 clinical single gene sequencing tests (for example, errors in BRCA1 gene for breast cancer risk analysis). As of August 2013[update], the Archon X Prize for Genomics has been cancelled.
In March 2009, it was announced that Complete Genomics has signed a deal with the Broad Institute to sequence cancer patients' genomes and will be sequencing five full genomes to start. In April 2009, Complete Genomics announced that it plans to sequence 1,000 full genomes between June 2009 and the end of the year and that they plan to be able to sequence one million full genomes per year by 2013.
In June 2009, Illumina announced that they were launching their own Personal Full Genome Sequencing Service at a depth of 30× for $48,000 per genome. Jay Flatley, Illumina's President and CEO, stated that "during the next five years, perhaps markedly sooner," the price point for full genome sequencing will fall from $48,000 to under $1,000.
In August 2009, the founder of Helicos Biosciences, Stephen Quake, stated that using the company's Single Molecule Sequencer he sequenced his own full genome for less than $50,000. He stated that he expects the cost to decrease to the $1,000 range within the next two to three years.
In August 2009, Pacific Biosciences secured an additional $68 million in new financing, bringing their total capitalization to $188 million. Pacific Biosciences said they are going to use this additional investment in order to prepare for the upcoming launch of their full genome sequencing service in 2010. Complete Genomics followed by securing another $45 million in a fourth round venture funding during the same month. Complete Genomics has also made the claim that it will sequence 10,000 full genomes by the end of 2010.
In October 2009, IBM announced that they were also in the heated race to provide full genome sequencing for under $1,000, with their ultimate goal being able to provide their service for US$100 per genome. IBM's full genome sequencing technology, which uses nanopores, is known as the "DNA Transistor".
In November 2009, Complete Genomics published a peer-reviewed paper in Science demonstrating its ability to sequence a complete human genome for $1,700. If true, this would mean the cost of full genome sequencing has come down exponentially within just a single year from around $100,000 to $50,000 and now to $1,700. This consumables cost was clearly detailed in the Science paper. However, Complete Genomics has previously released statements that it was unable to follow through on. For example, the company stated it would officially launch and release its service during the "summer of 2009", provide a "$5,000" full genome sequencing service by the "summer of 2009", and "sequence 1,000 genomes between June 2009 and the end of 2009" – all of which, as of November 2009, have not yet occurred. Complete Genomics launched its R&D human genome sequencing service in October 2008 and its commercial service in May 2010. The company sequenced 50 genomes in 2009. Since then, it has significantly increased the throughput of its genome sequencing factory and was able to sequence and analyze 300 genomes in Q3 2010.
Also in November 2009, Complete Genomics announced that it was beginning a large-scale human genome sequencing study of Huntington's disease (up to 100 genomes) with the Institute for Systems Biology.
In March 2010, Researchers from the Medical College of Wisconsin announced the first successful use of Genome Wide sequencing to change the treatment of a patient. This story was later retold in a Pulitzer prize winning article  and touted as a significant accomplishment in the journal Nature and by the director of the NIH in presentations at congress.
In June 2010, Illumina lowered the cost of its individual sequencing service to $19,500 from $48,000.
In May 2011, Illumina lowered its Full Genome Sequencing service to $5,000 per human genome, or $4,000 if ordering 50 or more. Helicos Biosciences, Pacific Biosciences, Complete Genomics, Illumina, Sequenom, ION Torrent Systems, Halcyon Molecular, NABsys, IBM, and GE Global appear to all be going head to head in the race to commercialize full genome sequencing.
In January 2012, Life Technologies introduced a sequencer claimed to decode a human genome in one day for $1,000 although these claims have yet to be validated by customers on commercial devices. A UK firm spun out from Oxford University has come up with a DNA sequencing machine (the MinION) the size of a USB memory stick which costs $900 and can sequence smaller genomes (but not full human genomes in the first version). (While Oxford Nanopore stated in February that they would target having a sequencer in commercial early access by the end of 2012, this did not occur.)
In November 2012, Gene by Gene, Ltd started offering whole genome sequencing at an introductory price of $5,495 (with a minimum requirement of 3 samples per order). Currently the price is $6,995 and the minimum requirement has been removed.
A series of publications in 2012 showed the utility of SMRT sequencing from Pacific Biosciences in generating full genome sequences with de novo assembly. Some of these papers reported automated pipelines that could be used for generating these whole-genome assemblies. Other papers demonstrated how PacBio sequence data could be used to upgrade draft genomes to complete genomes.
Disruption to DNA array market
Full genome sequencing provides information on a genome that is orders of magnitude larger than that provided by the previous leader in genotyping technology, DNA arrays. For humans, DNA arrays currently provide genotypic information on up to one million genetic variants, while full genome sequencing will provide information on all six billion bases in the human genome, or 3,000 times more data. Because of this, full genome sequencing is considered a disruptive innovation to the DNA array markets as the accuracy of both range from 99.98% to 99.999% (in non-repetitive DNA regions) and their consumables cost of $5000 per 6 billion base pairs is competitive (for some applications) with DNA arrays ($500 per 1 million basepairs). Agilent, another established DNA array manufacturer, is working on targeted (selective region) genome sequencing technologies. It is thought that Affymetrix, the pioneer of array technology in the 1990s, has fallen behind due to significant corporate and stock turbulence and is currently not working on any known full genome sequencing approach. It is unknown what will happen to the DNA array market once full genome sequencing becomes commercially widespread, especially as companies and laboratories providing this disruptive technology start to realize economies of scale. It is postulated, however, that this new technology may significantly diminish the total market size for arrays and any other sequencing technology once it becomes commonplace for individuals and newborns to have their full genomes sequenced.
Sequencing versus analysis
In principle, full genome sequencing can provide raw data on all six billion nucleotides in an individual's DNA. However, it does not provide an analysis of what that information means or how it might be utilized in various clinical applications, such as in medicine to help prevent disease. As of 2010 the companies that are working on providing full genome sequencing provide clinical CLIA certified data (Illumina) and analytical services for the interpretation of the full genome data (Knome), with only one institution offering sequencing and analysis in a clinical setting. Nevertheless there is plenty of room for researchers or companies to improve such analyses and make it useful to physicians and patients.
Diagnostic use and societal impact
Inexpensive, time-efficient full genome sequencing will be a major accomplishment not only for the field of genomics, but for the entire human civilization because, for the first time, individuals will be able to have their entire genome sequenced. Utilizing this information, it is speculated that health care professionals, such as physicians and genetic counselors, will eventually be able to use genomic information to predict what diseases a person may get in the future and attempt to either minimize the impact of that disease or avoid it altogether through the implementation of personalized, preventive medicine. Full genome sequencing will allow health care professionals to analyze the entire human genome of an individual and therefore detect all disease-related genetic variants, regardless of the genetic variant's prevalence or frequency. This will enable the rapidly emerging medical fields of predictive medicine and personalized medicine and will mark a significant leap forward for the clinical genetic revolution. Full genome sequencing is clearly of great importance for research into the basis of genetic disease and has shown significant benefit to a subset of individuals with rare disease in the clinical setting. Illumina's CEO, Jay Flatley, stated in February 2009 that "A complete DNA read-out for every newborn will be technically feasible and affordable in less than five years, promising a revolution in healthcare" and that "by 2019 it will have become routine to map infants' genes when they are born". This potential use of genome sequencing is highly controversial, as it runs counter to established ethical norms for predictive genetic testing of asymptomatic minors that have been well established in the fields of medical genetics and genetic counseling. The traditional guidelines for genetic testing have been developed over the course of several decades since it first became possible to test for genetic markers associated with disease, prior to the advent of cost-effective, comprehensive genetic screening. It is established that norms, such as in the sciences and the field of genetics, are subject to change and evolve over time. It is unknown whether traditional norms practiced in medical genetics today will be altered by new technological advancements such as full genome sequencing.
Currently available newborn screening for childhood diseases allows detection of rare disorders that can be prevented or better treated by early detection and intervention. Specific genetic tests are also available to determine an etiology when a child's symptoms appear to have a genetic basis. Full genome sequencing, in addition has the potential to reveal a large amount of information (such as carrier status for autosomal recessive disorders, genetic risk factors for complex adult-onset diseases, and other predictive medical and non-medical information) that is currently not completely understood, may not be clinically useful to the child during childhood, and may not necessarily be wanted by the individual upon reaching adulthood. In addition to predicting disease risk in childhood, genetic testing may have other benefits (such as discovery of non-paternity) but may also have potential downsides (genetic discrimination, loss of anonymity, and psychological impacts). Many publications regarding ethical guidelines for predictive genetic testing of asymptomatic minors may therefore have more to do with protecting minors and preserving the individual's privacy and autonomy to know or not to know their genetic information, than with the technology that makes the tests themselves possible.
Due to recent cost reductions (see above) whole genome sequencing has become a realistic application in DNA diagnostics. In 2013, the 3Gb-TEST consortium obtained funding from the European Union to prepare the health care system for these innovations in DNA diagnostics. Quality assessment schemes, Health technology assessment and guidelines have to be in place. The 3Gb-TEST consortium has identified the analysis and interpretation of sequence data as the most complicated step in the diagnostic process. At the Consortium meeting in Athens in September 2014, the Consortium coined the word genotranslation for this crucial step. This step leads to a so-called genoreport. Guidelines are needed to determine the required content of these reports.
The majority of ethicists insist that the privacy of individuals undergoing genetic testing must be protected under all circumstances. Data obtained from whole genome sequencing can not only reveal much information about the individual who is the source of DNA, but it can also reveal much probabilistic information about the DNA sequence of close genetic relatives. Furthermore, the data obtained from whole genome sequencing can also reveal much useful predictive information about the relatives present and future health risks. This raises important questions about what obligations, if any, are owed to the family members of the individuals who are undergoing genetic testing. In our Western/European society, tested individuals are usually encouraged to share important information on the genetic diagnosis with their close relatives since the importance of the genetic diagnosis for offspring and other close relatives is usually one of the reasons for seeking a genetic testing in the first place. Nevertheless, Sijmons et al. (2011) also mention that a major ethical dilemma can develop when the patients refuse to share information on a diagnosis that is made for serious genetic disorder that is highly preventable and where there is a high risk to relatives carrying the same disease mutation. Under such circumstances, the clinician may suspect that the relatives would rather know of the diagnosis and hence the clinician can face a conflict of interest with respect to patient-doctor confidentiality.
Another major privacy concern is the scientific need to put information on patient's genotypes and phenotypes into the public scientific databases such as the locus specific databases. Although only anonymous patient data are submitted to the locus specific databases, patients might still be identifiable by their relatives in the case of finding a rare disease or a rare missense mutation.
People with public genome sequences
The first nearly complete human genomes sequenced were J. Craig Venter's (American at 7.5-fold average coverage) in 2007, followed by James Watson's (American at 7.4-fold), a Han Chinese (YH at 36-fold), a Yoruban from Nigeria (at 30-fold), a female leukemia patient (at 33 and 14-fold coverage for tumor and normal tissues), and Seong-Jin Kim (Korean at 29-fold). The first two persons with their full genome sequenced, James Watson and Craig Venter, two American scientists of European ancestry, were found to be genetically more closely related to and having more alleles in common with Korean scientist, Seong-Jin Kim (1,824,482 and 1,736,340, respectively) than with each other (1,715,851). Steve Jobs was among the first 20 people to have their whole genome sequenced, reportedly for the cost of $100,000. As of June 2012[update], there are 69 nearly complete human genomes publicly available.(reference - page not found) Commercialization of full genome sequencing is in an early stage and growing rapidly.
- Alberts, Bruce; Johnson, Alexander; Lewis, Julian; Raff, Martin; Roberts, Keith; Walter, Peter (2008). "8". Molecular biology of the cell (5th ed.). New York: Garland Science. p. 550. ISBN 0-8153-4106-7.
- Kijk magazine, 01 January 2009
- Gilissen (Jul 2014). "Genome sequencing identifies major causes of severe intellectual disability". Nature 511 (7509): 344–7. doi:10.1038/nature13394. PMID 24896178.
- Nones, K; Waddell, N; Wayte, N; Patch, AM; Bailey, P; Newell, F; Holmes, O; Fink, JL; Quinn, MC; Tang, YH; Lampe, G; Quek, K; Loffler, KA; Manning, S; Idrisoglu, S; Miller, D; Xu, Q; Waddell, N; Wilson, PJ; Bruxner, TJ; Christ, AN; Harliwong, I; Nourse, C; Nourbakhsh, E; Anderson, M; Kazakoff, S; Leonard, C; Wood, S; Simpson, PT; Reid, LE; Krause, L; Hussey, DJ; Watson, DI; Lord, RV; Nancarrow, D; Phillips, WA; Gotley, D; Smithers, BM; Whiteman, DC; Hayward, NK; Campbell, PJ; Pearson, JV; Grimmond, SM; Barbour, AP (29 October 2014). "Genomic catastrophes frequently arise in esophageal adenocarcinoma and drive tumorigenesis". Nature communications 5: 5224. doi:10.1038/ncomms6224. PMID 25351503.
- van El, CG; Cornel, MC; Borry, P; Hastings, RJ; Fellmann, F; Hodgson, SV; Howard, HC; Cambon-Thomsen, A; Knoppers, BM; Meijers-Heijboer, H; Scheffer, H; Tranebjaerg, L; Dondorp, W; de Wert, GM (June 2013). "Whole-genome sequencing in health care. Recommendations of the European Society of Human Genetics". European journal of human genetics : EJHG. 21 Suppl 1: S1–5. PMID 23819146.
- Mooney, Sean (Sep 2014). "Progress towards the integration of pharmacogenomics in practice". Human Genetics. doi:10.1007/s00439-014-1484-7. PMID 25238897.
- Fareed M., Afzal M (2013). "Single nucleotide polymorphism in genome-wide association of human population: A tool for broad spectrum service". Egyptian Journal of Medical Human Genetics 14: 123–134. doi:10.1016/j.ejmhg.2012.08.001.
- Marx, Vivien (11 September 2013). "Next-generation sequencing: The genome jigsaw". Nature 501 (7466): 263–268. doi:10.1038/501261a.
- al.], Bruce Alberts ... [et (2008). Molecular biology of the cell (5th ed.). New York: Garland Science. p. 551. ISBN 0-8153-4106-7.
- Fleischmann, R.; Adams, M.; White, O; Clayton, R.; Kirkness, E.; Kerlavage, A.; Bult, C.; Tomb, J.; Dougherty, B.; Merrick, J.; al., e. (28 July 1995). "Whole-genome random sequencing and assembly of Haemophilus influenzae Rd". Science 269 (5223): 496–512. doi:10.1126/science.7542800.
- Eddy, Sean R. (November 2012). "The C-value paradox, junk DNA and ENCODE". Current Biology 22 (21): R898–R899. doi:10.1016/j.cub.2012.10.002.
- PELLICER, JAUME; FAY, MICHAEL F.; LEITCH, ILIA J. (15 September 2010). "The largest eukaryotic genome of them all?". Botanical Journal of the Linnean Society 164 (1): 10–15. doi:10.1111/j.1095-8339.2010.01072.x.
- Human Genome Sequencing Consortium, International (21 October 2004). "Finishing the euchromatic sequence of the human genome". Nature 431 (7011): 931–945. doi:10.1038/nature03001.
- Goffeau, A.; Barrell, B. G.; Bussey, H.; Davis, R. W.; Dujon, B.; Feldmann, H.; Galibert, F.; Hoheisel, J. D.; Jacq, C.; Johnston, M.; Louis, E. J.; Mewes, H. W.; Murakami, Y.; Philippsen, P.; Tettelin, H.; Oliver, S. G. (25 October 1996). "Life with 6000 Genes" (PDF). Science 274 (5287): 546–567. doi:10.1126/science.274.5287.546.
- The C. elegans Sequencing Consortium (11 December 1998). "Genome Sequence of the Nematode C. elegans: A Platform for Investigating Biology". Science 282 (5396): 2012–2018. doi:10.1126/science.282.5396.2012.
- al.], Bruce Alberts ... [et (2008). Molecular biology of the cell (5th ed.). New York: Garland Science. p. 552. ISBN 0-8153-4106-7.
- Dunham, I. "The DNA sequence of human chromosome 22". nature.com.
- "The Genome Sequence of Drosophila melanogaster". Science 287: 2185–2195. 2000-03-24. doi:10.1126/science.287.5461.2185.
- "Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.". Nature: 796–815. 2000-12-14. PMID 11130711.
- "The Sequence of the Human Genome". Science 291: 1304–1351. 2001-02-16. doi:10.1126/science.1058040.
- "Initial sequencing and comparative analysis of the mouse genome". Nature 420. 2002-10-31. doi:10.1038/nature01262.
- "Finishing the euchromatic sequence of the human genome". Nature 431. 07.09.2004. doi:10.1038/nature03001. Check date values in:
- Braslavsky, Ido et al. (2003). "Sequence information can be obtained from single DNA molecules". Proc Natl Acad Sci USA 100 (7): 3960–3984. doi:10.1073/pnas.0230489100. PMC 153030. PMID 12651960.
- Single-cell Sequencing Makes Strides in the Clinic with Cancer and PGD First Applications from Clinical Sequencing News. By Monica Heger. October 02, 2013
- Yurkiewicz, I. R.; Korf, B. R.; Lehmann, L. S. (2014). "Prenatal whole-genome sequencing--is the quest to know a fetus's future ethical?". New England Journal of Medicine 370 (3): 195–7. doi:10.1056/NEJMp1215536. PMID 24428465.
- Roach JC; Glusman G; Smit AF et al. (April 2010). "Analysis of genetic inheritance in a family quartet by whole-genome sequencing". Science 328 (5978): 636–9. doi:10.1126/science.1186802. PMC 3037280. PMID 20220176.
- Campbell CD; Chong JX; Malig M et al. (November 2012). "Estimating the human mutation rate using autozygosity in a founder population". Nat. Genet. 44 (11): 1277–81. doi:10.1038/ng.2418. PMC 3483378. PMID 23001126.
- Ye K; Beekman M; Lameijer EW; Zhang Y; Moed MH; van den Akker EB; Deelen J; Houwing-Duistermaat JJ; Kremer D; Anvar SY; Laros JF; Jones D; Raine K; Blackburne B; Potluri S; Long Q; Guryev V; van der Breggen R; Westendorp RG; 't Hoen PA; den Dunnen J; van Ommen GJ; Willemsen G; Pitts SJ; Cox DR; Ning Z; Boomsma DI; Slagboom PE (December 2013). "Aging as accelerated accumulation of somatic variants: whole-genome sequencing of centenarian and middle-aged monozygotic twin pairs". Twin Res Hum Genet 16 (6): 1026–32. doi:10.1017/thg.2013.73. PMID 24182360.
- Keightley PD (February 2012). "Rates and fitness consequences of new mutations in humans". Genetics 190 (2): 295–304. doi:10.1534/genetics.111.134668. PMC 3276617. PMID 22345605.
- Tuna M; Amos CI (November 2013). "Genomic sequencing in cancer". Cancer Lett. 340 (2): 161–70. doi:10.1016/j.canlet.2012.11.004. PMID 23178448.
- Staden R (June 1979). "A strategy of DNA sequencing employing computer programs". Nucleic Acids Res. 6 (7): 2601–10. doi:10.1093/nar/6.7.2601. PMC 327874. PMID 461197.
- Edwards, A; Caskey, T (1991). "Closure strategies for random DNA sequencing". Methods: A Companion to Methods in Enzymology 3 (1): 41–47. doi:10.1016/S1046-2023(05)80162-8.
- Edwards A; Voss H; Rice P; Civitello A; Stegemann J; Schwager C; Zimmermann J; Erfle H; Caskey CT; Ansorge W (April 1990). "Automated DNA sequencing of the human HPRT locus". Genomics 6 (4): 593–608. doi:10.1016/0888-7543(90)90493-E. PMID 2341149.
- Roach JC; Boysen C; Wang K; Hood L (March 1995). "Pairwise end sequencing: a unified approach to genomic mapping and sequencing". Genomics 26 (2): 345–53. doi:10.1016/0888-7543(95)80219-C. PMID 7601461.
- Fleischmann RD; Adams MD; White O; Clayton RA; Kirkness EF; Kerlavage AR; Bult CJ; Tomb JF; Dougherty BA; Merrick JM; McKenney; Sutton; Fitzhugh; Fields; Gocyne; Scott; Shirley; Liu; Glodek; Kelley; Weidman; Phillips; Spriggs; Hedblom; Cotton; Utterback; Hanna; Nguyen; Saudek et al. (July 1995). "Whole-genome random sequencing and assembly of Haemophilus influenzae Rd". Science 269 (5223): 496–512. Bibcode:1995Sci...269..496F. doi:10.1126/science.7542800. PMID 7542800.
- Adams, MD et al. (2000). "The genome sequence of Drosophila melanogaster". Science 287 (5461): 2185–95. Bibcode:2000Sci...287.2185.. doi:10.1126/science.287.5461.2185. PMID 10731132.
- Mukhopadhyay R (February 2009). "DNA sequencers: the next generation". Anal. Chem. 81 (5): 1736–40. doi:10.1021/ac802712u. PMID 19193124.
- "Harvard University and Oxford Nanopore Technologies Announce Licence Agreement to Advance Nanopore DNA Sequencing and other Applications". Nanotechwire. August 5, 2008. Retrieved 2009-02-23.
- "Illumina and Oxford Nanopore Enter into Broad Commercialization Agreement". Reuters. January 12, 2009. Retrieved 2009-02-23.
- [dead link]
- "Single Molecule Real Time (SMRT) DNA Sequencing". Pacific Biosciences. Retrieved 2009-02-23.[dead link]
- "Complete Human Genome Sequencing Technology Overview" (PDF). Complete Genomics. 2009. Retrieved 2009-02-23.[dead link]
- "Definition of pyrosequencing from the Nature Reviews Genetics Glossary". Retrieved 2008-10-28.
- Ronaghi M; Uhlén M; Nyrén P (July 1998). "A sequencing method based on real-time pyrophosphate". Science 281 (5375): 363, 365. doi:10.1126/science.281.5375.363. PMID 9705713.
- Ronaghi M; Karamohamed S; Pettersson B; Uhlén M; Nyrén P (November 1996). "Real-time DNA sequencing using detection of pyrophosphate release". Anal. Biochem. 242 (1): 84–9. doi:10.1006/abio.1996.0432. PMID 8923969.
- Nyrén P (2007). "The history of pyrosequencing". Methods Mol. Biol. 373: 1–14. doi:10.1385/1-59745-377-3:1. ISBN 1-59745-377-3. PMID 17185753.
- "Article : Race to Cut Whole Genome Sequencing Costs Genetic Engineering & Biotechnology News — Biotechnology from Bench to Business". Genengnews.com. Retrieved 2009-02-23.
- "Whole Genome Sequencing Costs Continue to Drop". Eyeondna.com. Retrieved 2009-02-23.
- Harmon, Katherine (2010-06-28). "Genome Sequencing for the Rest of Us". Scientific American. Retrieved 2010-08-13.
- San Diego/Orange County Technology News. "Sequenom to Develop Third-Generation Nanopore-Based Single Molecule Sequencing Technology". Freshnews.com. Retrieved 2009-02-24.
- "Article : Whole Genome Sequencing in 24 Hours Genetic Engineering & Biotechnology News — Biotechnology from Bench to Business". Genengnews.com. Retrieved 2009-02-23.
- "Pacific Bio lifts the veil on its high-speed genome-sequencing effort". VentureBeat. Retrieved 2009-02-23.
- "Bio-IT World". Bio-IT World. 2008-10-06. Retrieved 2009-02-23.
- "With New Machine, Helicos Brings Personal Genome Sequencing A Step Closer". Xconomy. 2008-04-22. Retrieved 2011-01-28.
- "Whole genome sequencing costs continue to fall: $300 million in 2003, $1 million 2007, $60,000 now, $5000 by year end". Nextbigfuture.com. 2008-03-25. Retrieved 2011-01-28.
- "Han Cao's nanofluidic chip could cut DNA sequencing costs dramatically". Technology Review.
- John Carroll (2008-07-14). "Pacific Biosciences gains $100M for sequencing tech". FierceBiotech. Retrieved 2009-02-23.
- Sibley, Lisa (2009-02-08). "Complete Genomics brings radical reduction in cost". Silicon Valley / San Jose Business Journal (Sanjose.bizjournals.com). Retrieved 2009-02-23.
- Carlson, Rob (2007-01-02). "A Few Thoughts on Rapid Genome Sequencing and The Archon Prize — synthesis". Synthesis.cc. Retrieved 2009-02-23.
- "PRIZE Overview: Archon X PRIZE for Genomics".
- Bentley DR (December 2006). "Whole-genome re-sequencing". Curr. Opin. Genet. Dev. 16 (6): 545–552. doi:10.1016/j.gde.2006.10.009. PMID 17055251.
- Diamandis, Peter. "Outpaced by Innovation: Canceling an XPRIZE". Huffington Post.
- "SOLiD System — a next-gen DNA sequencing platform announced". Gizmag.com. 2007-10-27. Retrieved 2009-02-24.
- "The $1000 Genome: Coming Soon?". Dddmag.com. 2010-04-01. Retrieved 2011-01-28.
- "Complete Genomics, Broad Institute Forge Cancer Sequencing Collaboration". Bio-IT World. Retrieved 2011-01-28.
- Walsh, Fergus (2009-04-08). "Era of personalised medicine awaits". BBC News. Retrieved 2010-05-03.
- "Individual genome sequencing — Illumina, Inc.". Everygenome.com. Retrieved 2011-01-28.
- "Illumina launches personal genome sequencing service for $48,000 : Genetic Future". Scienceblogs.com. Retrieved 2011-01-28.
- "Illumina demos concept iPhone app for genetic data sharing". mobihealthnews. 2009-06-10. Retrieved 2011-01-28.
- Wade, Nicholas (2009-08-11). "Cost of Decoding a Genome Is Lowered". The New York Times. Retrieved 2010-05-03.
- Camille Ricketts (2009-08-13). "Pacific Biosciences takes $68M as genome sequencing becomes more competitive". VentureBeat. Retrieved 2011-01-28.
- "Pacific Biosciences Raises Additional $68 Million in Financing". FierceBiotech. 2009-08-12. Retrieved 2011-01-28.
- "Silicon Valley startup Complete Genomics promises low-cost DNA sequencing". San Jose Mercury News. Mercurynews.com. Retrieved 2011-01-28.
- "Silicon Valley Startup Complete Genomics Promises Low-Cost DNA Sequencing". Istockanalyst.com. 2009-08-24. Retrieved 2011-01-28.
- Jacquin Niles. "Explaining Sequencing | The Daily Scan". GenomeWeb. Retrieved 2011-01-28.
- "NHGRI Awards More than $50M for Low-Cost DNA Sequencing Tech Development". Genome Web. 2009.
- JOHN MARKOFF (October 5, 2009). "I.B.M. Joins Pursuit of $1,000 Personal Genome". The Newyork Times. Retrieved May 15, 2013.
- Shankland, Stephen (2009-10-06). "IBM Research jumps into genetic sequencing | Deep Tech". CNET News. News.cnet.com. Retrieved 2011-01-28.
- [dead link]
- Drmanac R, Sparks AB, Callow MJ et al.: Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays" Science 327(5961), 78-81 (2010)
- "Broad Institute to use Complete Genomics to sequence genomes of cancer patients : Genetic Future". Scienceblogs.com. Retrieved 2011-01-28.
- "Five Thousand Bucks for Your Genome". Technology Review. 2008-10-20. Retrieved 2009-02-23.
- "One In A Billion: A boy's life, a medical mystery".
- "US clinics quietly embrace whole-genome sequencing".
- Herper, Matthew (2010-06-03). "Your Genome is Coming". Forbes. Retrieved 2010-08-13.
- Lauerman, John (2009-02-05). "Complete Genomics Drives Down Cost of Genome Sequence to $5,000". Bloomberg.com. Retrieved 2011-01-28.
- "Illumina Announces $5,000 Genome Pricing".
- "Products". dnadtc.com. Retrieved 28 November 2012.
- "Gene By Gene Launches DNA DTC". The Wall Street Journal. 29 November 2012. Retrieved 29 November 2012.
- Vorhaus, Dan (29 November 2012). "DNA DTC: The Return of Direct to Consumer Whole Genome Sequencing". genomicslawreport.com. Retrieved 30 November 2012.
- "Finished bacterial genomes from shotgun sequence data" (PDF).
- Koren, Sergey (July 2012). "Hybrid error correction and de novo assembly of single-molecule sequencing reads". NatureBiotechnology 30 (7): 693–700. doi:10.1038/nbt.2280. PMID 22750884.
- "Mind the Gap:Upgrading Genomes with Pacific Biosciences RS Long-Read Sequencing Technology". PLoS ONE 7: e47768. doi:10.1371/journal.pone.0047768.
- "Illumina Sequencer Enables $1,000 Genome". News: Genomics & Proteomics. Gen. Eng. Biotechnol. News (paper) 34 (4). 15 February 2014. p. 18.
- "Genomics Core". Gladstone.ucsf.edu. Retrieved 2009-02-23.
- Nishida N; Koike A; Tajima A; Ogasawara Y; Ishibashi Y; Uehara Y; Inoue I; Tokunaga K (2008). "Evaluating the performance of Affymetrix SNP Array 6.0 platform with 400 Japanese individuals". BMC Genomics 9 (1): 431. doi:10.1186/1471-2164-9-431. PMC 2566316. PMID 18803882.
- Petrone, Justin. "Illumina, DeCode Build 1M SNP Chip; Q2 Launch to Coincide with Release of Affy's 6.0 SNP Array | BioArray News | Arrays". GenomeWeb. Retrieved 2009-02-23.
- "Agilent Technologies Announces Licensing Agreement with Broad Institute to Develop Genome-Partitioning Kits to Streamline Next-Generation Sequencing".[dead link]
- "Affymetrix stock slumps 30% on forecast". Sacramento Business Journal (Sacramento.bizjournals.com). 2008-07-25. Retrieved 2009-02-23.
- Bluis, John (2006-04-24). "Affymetrix Gets Chipped Again". Fool.com. Retrieved 2009-02-23.
- "The chips are down". Nature 444 (7117): 256–7. November 2006. Bibcode:2006Natur.444..256.. doi:10.1038/444256a. PMID 17108930.
- Coombs A (October 2008). "The sequencing shakeup". Nat. Biotechnol. 26 (10): 1109–12. doi:10.1038/nbt1008-1109. PMID 18846083.
- "Following Diagnostic Sequencing Success, MCW Creates Comprehensive Framework to Guide Future Cases".
- "The Wall Street Journal—Video". The Wall Street Journal.
- Ashley EA; Butte AJ; Wheeler MT; Chen R; Klein TE; Dewey FE; Dudley JT; Ormond KE; Pavlovic A; Morgan AA; Pushkarev D; Neff NF; Hudgins L; Gong L; Hodges LM; Berlin DS; Thorn CF; Sangkuhl K; Hebert JM; Woon M; Sagreiya H; Whaley R; Knowles JW; Chou MF; Thakuria JV; Rosenbaum AM; Zaranek AW; Church GM; Greely HT; Quake SR; Altman RB (May 2010). "Clinical assessment incorporating a personal genome". Lancet 375 (9725): 1525–35. doi:10.1016/S0140-6736(10)60452-7. PMC 2937184. PMID 20435227.
- "Genomes, Environments, Traits (GET) Evidence".
- Ng SB; Buckingham KJ; Lee C et al. (January 2010). "Exome sequencing identifies the cause of a mendelian disorder". Nat. Genet. 42 (1): 30–5. doi:10.1038/ng.499. PMC 2847889. PMID 19915526.
- Hannibal MC; Buckingham KJ; Ng SB et al. (July 2011). "Spectrum of MLL2 (ALR) mutations in 110 cases of Kabuki syndrome". Am. J. Med. Genet. A 155A (7): 1511–6. doi:10.1002/ajmg.a.34074. PMC 3121928. PMID 21671394.
- Worthey EA; Mayer AN; Syverson GD et al. (March 2011). "Making a definitive diagnosis: successful clinical application of whole exome sequencing in a child with intractable inflammatory bowel disease". Genet. Med. 13 (3): 255–62. doi:10.1097/GIM.0b013e3182088158. PMID 21173700.
- Goh V; Helbling D; Biank V; Jarzembowski J; Dimmock D (June 2011). "Next Generation Sequencing Facilitates The Diagnosis In A Child With Twinkle Mutations Causing Cholestatic Liver Failure". J Pediatr Gastroenterol Nutr 54 (2): 291–4. doi:10.1097/MPG.0b013e318227e53c. PMID 21681116.
- Henderson, Mark (2009-02-09). "Genetic mapping of babies by 2019 will transform preventive medicine". London: Times Online. Retrieved 2009-02-23.
- McCabe LL; McCabe ER (June 2001). "Postgenomic medicine. Presymptomatic testing for prediction and prevention". Clin Perinatol 28 (2): 425–34. doi:10.1016/S0095-5108(05)70094-4. PMID 11499063.
- Nelson RM; Botkjin JR; Kodish ED et al. (June 2001). "Ethical issues with genetic testing in pediatrics". Pediatrics 107 (6): 1451–5. doi:10.1542/peds.107.6.1451. PMID 11389275.
- Borry P; Fryns JP; Schotsmans P; Dierickx K (February 2006). "Carrier testing in minors: a systematic review of guidelines and position papers". Eur. J. Hum. Genet. 14 (2): 133–8. doi:10.1038/sj.ejhg.5201509. PMID 16267502.
- Borry P; Stultiens L; Nys H; Cassiman JJ; Dierickx K (November 2006). "Presymptomatic and predictive genetic testing in minors: a systematic review of guidelines and position papers". Clin. Genet. 70 (5): 374–81. doi:10.1111/j.1399-0004.2006.00692.x. PMID 17026616.
- Mesoudi A; Danielson P (August 2008). "Ethics, evolution and culture". Theory Biosci. 127 (3): 229–40. doi:10.1007/s12064-008-0027-y. PMID 18357481.
- Ehrlich PR; Levin SA (June 2005). "The evolution of norms". PLoS Biol. 3 (6): e194. doi:10.1371/journal.pbio.0030194. PMC 1149491. PMID 15941355.
- Mayer AN; Dimmock DP; Arca MJ et al. (March 2011). "A timely arrival for genomic medicine". Genet. Med. 13 (3): 195–6. doi:10.1097/GIM.0b013e3182095089. PMID 21169843.
- Ayday E; De Cristofaro E; Hubaux JP; Tsudik G (2015). "The Chills and Thrills of Whole Genome Sequencing". ArXiv Repository. arXiv:1306.1264. Bibcode:2015arXiv1306.1264.
- Borry, P.; Evers-Kiebooms, G.; Cornel, MC; Clarke, A; Dierickx, K; Public Professional Policy Committee (PPPC) of the European Society of Human Genetics (ESHG) (2009). "Genetic testing in asymptomatic minors Background considerations towards ESHG Recommendations". Eur J Hum Genet 17 (6): 711–9. doi:10.1038/ejhg.2009.25. PMC 2947094. PMID 19277061.
- "Introducing diagnostic applications of ‘3Gb-testing’ in human genetics".
- "Beyond public health genomics: proposals from an international working group". Eur J Public Health 24: 877–879. Aug 2014. doi:10.1093/eurpub/cku142. PMID 25168910.
- "RD-Connect News: 18 July 2014, Issue 7".
- Sijmons, R.H; Van Langen, I.M (2011). "A clinical perspective on ethical issues in genetic issues". Accountability in Research: Policies and Quality Assurance 18 (3): 148–162. doi:10.1080/08989621.2011.575033.
- Sijmons, R.H.; Van Langen, I.M (2011). "A clinical perspective on ethical issues in genetic testing". Accountability in Research: Policies and Quality Assurance 18 (3): 148–162. doi:10.1080/08989621.2011.575033.
- McGuire, Amy, L; Caulfield, Timothy (2008). "Science and Society: Research ethics and the challenge of whole-genome sequencing". Nature Reviews: Genetics 9 (2): 152–156. doi:10.1038/nrg2302.
- Wade, Nicholas (September 4, 2007). "In the Genome Race, the Sequel Is Personal". New York Times. Retrieved February 22, 2009.
- Nature. "Access : All about Craig: the first 'full' genome sequence". Nature. Retrieved 2009-02-24.
- Levy S; Sutton G; Ng PC; Feuk L; Halpern AL; Walenz BP; Axelrod N; Huang J; Kirkness EF; Denisov G; Lin Y; MacDonald JR; Pang AW; Shago M; Stockwell TB; Tsiamouri A; Bafna V; Bansal V; Kravitz SA; Busam DA; Beeson KY; McIntosh TC; Remington KA; Abril JF; Gill J; Borman J; Rogers YH; Frazier ME; Scherer SW; Strausberg RL; Venter JC (September 2007). "The diploid genome sequence of an individual human". PLoS Biol. 5 (10): e254. doi:10.1371/journal.pbio.0050254. PMC 1964779. PMID 17803354.
- Wade, Wade (June 1, 2007). "DNA pioneer Watson gets own genome map". International Herald Tribune. Retrieved February 22, 2009.
- Wade, Nicholas (May 31, 2007). "Genome of DNA Pioneer Is Deciphered". New York Times. Retrieved February 21, 2009.
- Wheeler DA; Srinivasan M; Egholm M; Shen Y; Chen L; McGuire A; He W; Chen YJ; Makhijani V; Roth GT; Gomes X; Tartaro K; Niazi F; Turcotte CL; Irzyk GP; Lupski JR; Chinault C; Song XZ; Liu Y; Yuan Y; Nazareth L; Qin X; Muzny DM; Margulies M; Weinstock GM; Gibbs RA; Rothberg JM (2008). "The complete genome of an individual by massively parallel DNA sequencing". Nature 452 (7189): 872–6. Bibcode:2008Natur.452..872W. doi:10.1038/nature06884. PMID 18421352.
- Wang J; Wang, Wei; Li, Ruiqiang; Li, Yingrui; Tian, Geng; Goodman, Laurie; Fan, Wei; Zhang, Junqing; Li, Jun; Zhang, Juanbin, Juanbin; Guo, Yiran, Yiran; Feng, Binxiao, Binxiao; Li, Heng, Heng; Lu, Yao, Yao; Fang, Xiaodong, Xiaodong; Liang, Huiqing, Huiqing; Du, Zhenglin, Zhenglin; Li, Dong, Dong; Zhao, Yiqing, Yiqing; Hu, Yujie, Yujie; Yang, Zhenzhen, Zhenzhen; Zheng, Hancheng, Hancheng; Hellmann, Ines, Ines; Inouye, Michael, Michael; Pool, John, John; Yi, Xin, Xin; Zhao, Jing, Jing; Duan, Jinjie, Jinjie; Zhou, Yan, Yan et al. (2008). "The diploid genome sequence of an Asian individual". Nature 456 (7218): 60–65. Bibcode:2008Natur.456...60W. doi:10.1038/nature07484. PMC 2716080. PMID 18987735.
- Bentley DR; Balasubramanian S et al. (2008). "Accurate whole human genome sequencing using reversible terminator chemistry". Nature 456 (7218): 53–9. Bibcode:2008Natur.456...53B. doi:10.1038/nature07517. PMC 2581791. PMID 18987734.
- Ley TJ; Mardis ER; Ding L; Fulton B; McLellan MD; Chen K; Dooling D; Dunford-Shore BH; McGrath S; Hickenbotham M; Cook L; Abbott R; Larson DE; Koboldt DC; Pohl C; Smith S; Hawkins A; Abbott S; Locke D; Hillier LW; Miner T; Fulton L; Magrini V; Wylie T; Glasscock J; Conyers J; Sander N; Shi X; Osborne JR et al. (2008). "DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome". Nature 456 (7218): 66–72. Bibcode:2008Natur.456...66L. doi:10.1038/nature07485. PMC 2603574. PMID 18987736.
- Ahn SM; Kim TH; Lee S; Kim D; Ghang H; Kim D; Kim BC; Kim SY; Kim WY; Kim C; Park D; Lee YS; Kim S; Reja R; Jho S; Kim CG; Cha JY; Kim KH; Lee B; Bhak J; Kim SJ (2009). "The first Korean genome sequence and analysis: Full genome sequencing for a socio-ethnic group". Genome Research 19 (9): 1622–9. doi:10.1101/gr.092197.109. PMC 2752128. PMID 19470904.
- Barbujani, Guido; Pigliucci, Massimo (2013). "Human races" (PDF). Current Biology 23 (5): R185–R187. doi:10.1016/j.cub.2013.01.024. ISSN 0960-9822. PMID 23473555. Retrieved 2 December 2013.
What does this imply for the existence of human races? Basically, that people with similar genetic features can be found in distant places, and that each local population contains a vast array of genotypes. Among the first genomes completely typed were those of James Watson and Craig Venter, two U.S. geneticists of European origin; they share more alleles with Seong-Jin Kim, a Korean scientist (1,824,482 and 1,736,340, respectively) than with each other (1,715,851).
- Lohr, Steve (2011-10-20). "New Book Details Jobs's Fight Against Cancer". The New York Times.
- "Complete Human Genome Sequencing Datasets to its Public Genomic Repository".
- Archon X Prize for Genomics
- James Watson's Personal Genome Sequence
- AAAS/Science: Genome Sequencing Poster
- Outsmart Your Genes: Book that discusses full genome sequencing and its impact upon health care and society
- Whole genome linkage analysis