List of sequenced eukaryotic genomes

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
Saccharomyces cerevisiae was the first eukaryotic organism to have its complete genome sequence determined.

This list of "sequenced" eukaryotic genomes contains all the eukaryotes known to have publicly available complete nuclear and organelle genome sequences that have been sequenced, assembled, annotated and published; draft genomes are not included, nor are organelle-only sequences.

DNA was first sequenced in 1977. The first free-living organism to have its genome completely sequenced was the bacterium Haemophilus influenzae, in 1995. In 1996 Saccharomyces cerevisiae (baker's yeast) was the first eukaryote genome sequence to be released and in 1998 the first genome sequence for a multicellular eukaryote, Caenorhabditis elegans, was released.


Following are the six earliest sequenced genomes of protists. For a more complete list, see the List of sequenced protist genomes.

Organism Type Relevance Genome size Number of genes predicted Organization Year of completion
Guillardia theta Cryptomonad Model organism 0.551 Mb
(nucleomorph genome only)
465,[1] 513, 598 (UniProt) Canadian Institute of Advanced Research, Philipps-University Marburg and the University of British Columbia 2001[1]
Plasmodium falciparum
Apicomplexan Human pathogen (malaria) 22.9 Mb 5,268[2] Malaria Genome Project Consortium 2002[2]
Plasmodium yoelii yoelii
Apicomplexan Rodent pathogen (malaria) 23.1 Mb 5,878[3] TIGR and NMRC 2002[3]
Cryptosporidium hominis
Apicomplexan Human pathogen 10.4 Mb 3,994[4] Virginia Commonwealth University 2004[4]
Cryptosporidium parvum
C- or genotype 2 isolate
Apicomplexan Human pathogen 16.5 Mb 3,807[5] UCSF and University of Minnesota 2004[5]
Thalassiosira pseudonana
Strain:CCMP 1335
Diatom Model organism 34.5 Mb 11,242[6] Joint Genome Institute and the University of Washington 2004[6]


Following are the five earliest sequenced genomes of plants. For a more complete list, see the List of sequenced plant genomes.

Organism Type Relevance Genome size Number of chromosomes Number of genes predicted Organization Year of completion
Arabidopsis thaliana
Wild mustard Thale Cress Model plant 135 Mb[7] 5 25,498,[8] 27,400,[9] 31,670 (UniProt) Arabidopsis Genome Initiative[10] 2000[8]
Cyanidioschyzon merolae
Red algae Simple eukaryote 16.5 Mb 20 5,331[11] University of Tokyo, Rikkyo University, Saitama University and Kumamoto University 2004[11]
Oryza sativa
ssp indica
Rice Crop and model organism 420 Mb 12 32-50,000[12] Beijing Genomics Institute, Zhejiang University and the Chinese Academy of Sciences 2002[12]
Ostreococcus tauri Green algae Simple eukaryote, small genome 12.6 Mb 7,969 (UniProt) Laboratoire Arago 2006[13]
Populus trichocarpa Balsam poplar or Black Cottonwood Carbon sequestration, model tree, commercial use (timber), and comparison to A. thaliana 550 Mb 19 45,555[14] The International Poplar Genome Consortium 2006[14]


Following are the five earliest sequenced genomes of fungi. For a more complete list, see the List of sequenced fungi genomes.

Organism Type Relevance Genome size Number of genes predicted Organization Year of completion
Saccharomyces cerevisiae
Saccharomycetes Baker's Yeast; Model eukaryote 12.1 Mb 6,294[15] International Collaboration for the Yeast Genome Sequencing[16] 1996[15]
Encephalitozoon cuniculi Microsporidium Human pathogen 2.9 Mb 1,997[17] Genoscope and Université Blaise Pascal 2001[17]
Schizosaccharomyces pombe
Schizosaccharomycetes Model eukaryote 14 Mb 4,824[18] Sanger Institute and Cold Spring Harbor Laboratory 2002[18]
Neurospora crassa Sordariomycetes Model eukaryote 40 Mb 10,082[19] Broad Institute, Oregon Health and Science University, University of Kentucky, and the University of Kansas 2003[19]
Phanerochaete chrysosporium
Strain:RP78 Imagine!
Agaricomycetes Wood rotting fungus, use in mycoremediation 30 Mb 11,777[20] Joint Genome Institute 2004[20]


Following are the five earliest sequenced genomes of animals. For a more complete list, see the List of sequenced animal genomes.

Organism Type Relevance Genome size Number of genes predicted Organization Year of completion
Caenorhabditis elegans
Strain:Bristol N2
Nematode Model animal 100 Mb 19,000[21] Washington University and the Sanger Institute 1998[21]
Drosophila melanogaster Fruit fly Model animal 165 Mb 13,600[22] Celera, UC Berkeley, Baylor College of Medicine, European DGP 2000[22]
Anopheles gambiae
Strain: PEST
Mosquito Vector of malaria 278 Mb 13,683[23] Celera Genomics and Genoscope 2002[23]
Takifugu rubripes Puffer fish Vertebrate with small genome 390 Mb 22–29,000[24] International Fugu Genome Consortium[25] 2002[26]
Homo sapiens Human 3.2 Gb[27] 18,826 (CCDS consortium) Human Genome Project Consortium and Celera Genomics Draft 2001[28][29]
Complete 2006[30]

See also[edit]


  1. ^ a b Douglas S, Zauner S, Fraunholz M, et al. (April 2001). "The highly reduced genome of an enslaved algal nucleus". Nature. 410 (6832): 1091–6. Bibcode:2001Natur.410.1091D. doi:10.1038/35074092. PMID 11323671.
  2. ^ a b Gardner MJ, Hall N, Fung E, et al. (October 2002). "Genome sequence of the human malaria parasite Plasmodium falciparum". Nature. 419 (6906): 498–511. Bibcode:2002Natur.419..498G. doi:10.1038/nature01097. PMC 3836256. PMID 12368864.
  3. ^ a b Carlton JM, Angiuoli SV, Suh BB, et al. (October 2002). "Genome sequence and comparative analysis of the model rodent malaria parasite Plasmodium yoelii yoelii". Nature. 419 (6906): 512–9. Bibcode:2002Natur.419..512C. doi:10.1038/nature01099. PMID 12368865.
  4. ^ a b Xu P, Widmer G, Wang Y, et al. (October 2004). "The genome of Cryptosporidium hominis". Nature. 431 (7012): 1107–12. Bibcode:2004Natur.431.1107X. doi:10.1038/nature02977. PMID 15510150.
  5. ^ a b Abrahamsen MS, Templeton TJ, Enomoto S, et al. (April 2004). "Complete genome sequence of the apicomplexan, Cryptosporidium parvum". Science. 304 (5669): 441–5. Bibcode:2004Sci...304..441A. doi:10.1126/science.1094786. PMID 15044751. S2CID 26434820.
  6. ^ a b Armbrust EV, Berges JA, Bowler C, et al. (October 2004). "The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism". Science. 306 (5693): 79–86. Bibcode:2004Sci...306...79A. CiteSeerX doi:10.1126/science.1101156. PMID 15459382. S2CID 8593895.
  7. ^
  8. ^ a b The Arabidopsis Genome Initiative (December 2000). "Analysis of the genome sequence of the flowering plant Arabidopsis thaliana". Nature. 408 (6814): 796–815. Bibcode:2000Natur.408..796T. doi:10.1038/35048692. PMID 11130711.
  9. ^ Ensembl entry
  10. ^ Arabidopsis Genome Initiative Archived 2006-02-07 at the Wayback Machine
  11. ^ a b Matsuzaki M, Misumi O, Shin-I T, et al. (April 2004). "Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D". Nature. 428 (6983): 653–7. Bibcode:2004Natur.428..653M. doi:10.1038/nature02398. PMID 15071595.
  12. ^ a b Goff SA, Ricke D, Lan TH, et al. (April 2002). "A draft sequence of the rice genome (Oryza sativa L. ssp. japonica)". Science. 296 (5565): 92–100. Bibcode:2002Sci...296...92G. doi:10.1126/science.1068275. PMID 11935018. S2CID 2960202.
  13. ^ Derelle E, Ferraz C, Rombauts S, et al. (August 2006). "Genome analysis of the smallest free-living eukaryote Ostreococcus tauri unveils many unique features". PNAS. 103 (31): 11647–52. Bibcode:2006PNAS..10311647D. doi:10.1073/pnas.0604795103. PMC 1544224. PMID 16868079.
  14. ^ a b Tuskan GA, Difazio S, Jansson S, et al. (September 2006). "The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)". Science. 313 (5793): 1596–604. Bibcode:2006Sci...313.1596T. doi:10.1126/science.1128691. PMID 16973872. S2CID 7717980.
  15. ^ a b Goffeau A, Barrell BG, Bussey H, et al. (October 1996). "Life with 6000 genes". Science. 274 (5287): 546, 563–7. Bibcode:1996Sci...274..546G. doi:10.1126/science.274.5287.546. PMID 8849441. S2CID 16763139.
  16. ^ International Collaboration for the Yeast Genome Sequencing Archived 2007-09-27 at the Wayback Machine
  17. ^ a b Katinka MD, Duprat S, Cornillot E, et al. (November 2001). "Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculi". Nature. 414 (6862): 450–3. Bibcode:2001Natur.414..450K. doi:10.1038/35106579. PMID 11719806.
  18. ^ a b Wood V, Gwilliam R, Rajandream MA, et al. (February 2002). "The genome sequence of Schizosaccharomyces pombe". Nature. 415 (6874): 871–80. doi:10.1038/nature724. PMID 11859360.
  19. ^ a b Galagan JE, Calvo SE, Borkovich KA, et al. (April 2003). "The genome sequence of the filamentous fungus Neurospora crassa". Nature. 422 (6934): 859–68. Bibcode:2003Natur.422..859G. doi:10.1038/nature01554. PMID 12712197.
  20. ^ a b Martinez, Diego; Larrondo, Luis F; Putnam, Nik; Gelpke, Maarten D Sollewijn; Huang, Katherine; Chapman, Jarrod; Helfenbein, Kevin G; Ramaiya, Preethi; et al. (2004). "Genome sequence of the lignocellulose degrading fungus Phanerochaete chrysosporium strain RP78". Nature Biotechnology. 22 (6): 695–700. doi:10.1038/nbt967. PMID 15122302.
  21. ^ a b C. elegans Sequencing Consortium (December 1998). "Genome sequence of the nematode C. elegans: a platform for investigating biology". Science. 282 (5396): 2012–8. Bibcode:1998Sci...282.2012.. doi:10.1126/science.282.5396.2012. PMID 9851916.
  22. ^ a b Adams MD, Celniker SE, Holt RA, et al. (March 2000). "The genome sequence of Drosophila melanogaster". Science. 287 (5461): 2185–95. Bibcode:2000Sci...287.2185.. CiteSeerX doi:10.1126/science.287.5461.2185. PMID 10731132.
  23. ^ a b Holt RA, Subramanian GM, Halpern A, et al. (October 2002). "The genome sequence of the malaria mosquito Anopheles gambiae". Science. 298 (5591): 129–49. Bibcode:2002Sci...298..129H. CiteSeerX doi:10.1126/science.1076181. PMID 12364791. S2CID 4512225.H
  24. ^ International Fugu Genome Consortium. Forth Genome Assembly Archived 2012-02-05 at the Wayback Machine
  25. ^ International Fugu Genome Consortium Archived 2012-02-05 at the Wayback Machine
  26. ^ Aparicio S, Chapman J, Stupka E, et al. (August 2002). "Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes". Science. 297 (5585): 1301–10. Bibcode:2002Sci...297.1301A. doi:10.1126/science.1072104. PMID 12142439. S2CID 10310355.
  27. ^ Human Genome Sequencing Consortium, International (October 2004). "Finishing the euchromatic sequence of the human genome". Nature. 431 (7011): 931–45. Bibcode:2004Natur.431..931H. doi:10.1038/nature03001. PMID 15496913.
  28. ^ McPherson JD, Marra M, Hillier L, et al. (February 2001). "A physical map of the human genome". Nature. 409 (6822): 934–41. Bibcode:2001Natur.409..934M. doi:10.1038/35057157. PMID 11237014.
  29. ^ Venter JC, Adams MD, Myers EW, et al. (February 2001). "The sequence of the human genome". Science. 291 (5507): 1304–51. Bibcode:2001Sci...291.1304V. doi:10.1126/science.1058040. PMID 11181995.
  30. ^ Gregory SG, Barlow KF, McLay KE, et al. (May 2006). "The DNA sequence and biological annotation of human chromosome 1". Nature. 441 (7091): 315–21. Bibcode:2006Natur.441..315G. doi:10.1038/nature04727. PMID 16710414.

External links[edit]