Jump to content

HomoloGene: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
m Added {{unreferenced}} tag to article. using Friendly
Lithoderm (talk | contribs)
Blanked the page
Line 1: Line 1:
{{unreferenced|date=November 2008}}
'''HomoloGene''', a tool of the [[National Center for Biotechnology Information]] (NCBI), is a system for automated detection of [[homology (biology)|homologs]] (similarity attributable to descent from a common ancestor) among the annotated genes of several completely sequenced eukaryotic genomes.

The HomoloGene processing consists of the protein analysis from the input organisms. Sequences are compared using blastp[http://www.ncbi.nlm.nih.gov/BLAST/Blast.cgi?CMD=Web&LAYOUT=TwoWindows&AUTO_FORMAT=Semiauto&ALIGNMENTS=250&ALIGNMENT_VIEW=Pairwise&CDD_SEARCH=on&CLIENT=web&DATABASE=nr&DESCRIPTIONS=500&ENTREZ_QUERY=%28none%29&EXPECT=10&FILTER=L&FORMAT_OBJECT=Alignment&FORMAT_TYPE=HTML&I_THRESH=0.005&MATRIX_NAME=BLOSUM62&NCBI_GI=on&PAGE=Proteins&PROGRAM=blastp&SERVICE=plain&SET_DEFAULTS.x=41&SET_DEFAULTS.y=5&SHOW_OVERVIEW=on&END_OF_HTTPGET=Yes&SHOW_LINKOUT=yes&GET_SEQUENCE=yes|blastp], then matched up and put into groups, using a taxonomic tree built from sequence similarity, where closer related organisms are matched up first, and then further organisms are added to the tree. The protein alignments are mapped back to their corresponding DNA sequences, and then distance metrics as molecular distances [[Substitution model|Jukes and Cantor (1969)]], [[Ka/Ks ratio]] can be calculated.

The sequences are matched up by using a [[Heuristic (computer science)|heuristic algorithm]] for maximizing the score globally, rather than locally, in a bipartite matching (see [[complete bipartite graph]]). And then it calculates the statistical significance of each match. Cutoffs are made per position and Ks values are set to prevent false [[Ortholog|"orthologs"]] from being grouped together. [[Homology (biology)|“Paralogs”]] are identified by finding sequences that are closer within species than other species.

==Input organisms==
''[[Homo sapiens]], [[Pan troglodytes]], [[Canis lupus familiaris]], [[Bos taurus]], [[Mus musculus]], [[Danio rerio]], [[Rattus norvegicus]], [[Arabidopsis thaliana]], [[Gallus gallus]], [[Oryza sativa]], [[Anopheles gambiae]], [[Drosophila melanogaster]], [[Magnaporthe grisea]], [[Neurospora crassa]], [[Caenorhabditis elegans]], [[Saccharomyces cerevisiae]], [[Kluyveromyces lactis]], [[Eremothecium gossypii]], [[Schizosaccharomyces pombe]] and [[Plasmodium falciparum]]''.

==Interface==
The HomoloGene is linked to all Entrez databases and based on homology and phenotype information of these links:
* Mouse Genome Informatics (MGI),
* Zebrafish Information Network (ZFIN),
* Saccharomyces Genome Database (SGD),
* Clusters of Orthologous Groups (COG),
* FlyBase,
* Online Mendelian Inheritance in Man (OMIM)

As a result HomoloGene displays information about Genes, Proteins, Phenotypes, and Conserved Domains.

==External links==
{{multicol}}
*[http://www.ncbi.nlm.nih.gov/sites/entrez?db=homologene HomoloGene] at the [[National Center for Biotechnology Information]]
*[http://harvester.embl.de/ Bioinformatic Harvester] - [[Bioinformatic Harvester]], a meta search engine that uses Homologene
*[http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=OMIM OMIM]
*[http://zfin.org/cgi-bin/webdriver?MIval=aa-ZDB_home.apg ZFIN]
*[http://www.yeastgenome.org/ SGD]
{{multicol-break}}
*[http://www.ncbi.nlm.nih.gov/COG/ COG]
*[http://flybase.bio.indiana.edu/ FlyBase]
*[http://www.informatics.jax.org/ MGI]
*[http://rgd.mcw.edu/ Rat Genome Database]
{{multicol-end}}

{{Harvesternavi}}

[[Category:Genetics]]
[[Category:Bioinformatics]]

[[de:HomoloGene]]

Revision as of 05:44, 2 March 2009