Jump to content

Biomedical text mining: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Added SBGN link to Texflame entry
m →Cite journal with Wikipedia template filling, reflist
Line 9: Line 9:


== Examples ==
== Examples ==
* [http://www.xtractor.in/ XTractor] - Discovering Newer Scientific Relations Across [[PubMed]] Abstracts. A tool to obtain manually annotated,expert curated relationships for [http://www.xtractor.in/insta_search.do?entity_type=SWISSPROT&search_id=P00533&entity_id=EGFR Proteins], [http://www.xtractor.in/insta_search.do?entity_type=MESH&search_id=D001943&entity_id=Breast%20Neoplasms Diseases], [http://www.xtractor.in/insta_search.do?entity_type=DRUG&search_id=APRD01148&entity_id=Oseltamivir%20Phosphate Drugs] and [http://www.xtractor.in/insta_search.do?entity_type=GO&search_id=GO:0006915&entity_id=apoptosis Biological Processes] as they get published in [[PubMed]].
* [http://www.xtractor.in/ XTractor] Discovering Newer Scientific Relations Across [[PubMed]] Abstracts. A tool to obtain manually annotated,expert curated relationships for [http://www.xtractor.in/insta_search.do?entity_type=SWISSPROT&search_id=P00533&entity_id=EGFR Proteins], [http://www.xtractor.in/insta_search.do?entity_type=MESH&search_id=D001943&entity_id=Breast%20Neoplasms Diseases], [http://www.xtractor.in/insta_search.do?entity_type=DRUG&search_id=APRD01148&entity_id=Oseltamivir%20Phosphate Drugs] and [http://www.xtractor.in/insta_search.do?entity_type=GO&search_id=GO:0006915&entity_id=apoptosis Biological Processes] as they get published in [[PubMed]].
* [http://www.chilibot.net/ Chilibot] - A tool for finding relationships between genes or gene products.
* [http://www.chilibot.net/ Chilibot] A tool for finding relationships between genes or gene products.
* [http://www.ihop-net.org/UniPub/iHOP/ Information Hyperlinked Over Proteins (iHOP)] (ref.: [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=16204114 Bioinformatics, 2005 Sep 1;21 Suppl 2:ii252-ii258.]): "A network of concurring genes and proteins extends through the scientific literature touching on phenotypes, pathologies and gene function. iHOP provides this network as a natural way of accessing millions of PubMed abstracts. By using genes and proteins as hyperlinks between sentences and abstracts, the information in PubMed can be converted into one navigable resource, bringing all advantages of the internet to scientific literature research."
* [http://www.ihop-net.org/UniPub/iHOP/ Information Hyperlinked Over Proteins (iHOP)]<ref>{{cite journal |author=Hoffmann R, Valencia A |title=Implementing the iHOP concept for navigation of biomedical literature |journal=Bioinformatics |volume=21 |issue=Suppl 2|pages=ii252–8 |year=2005 |month=September |pmid=16204114 |doi=10.1093/bioinformatics/bti1142 |url=http://bioinformatics.oxfordjournals.org/cgi/pmidlookup?view=long&pmid=16204114}}</ref>: "A network of concurring genes and proteins extends through the scientific literature touching on phenotypes, pathologies and gene function. iHOP provides this network as a natural way of accessing millions of PubMed abstracts. By using genes and proteins as hyperlinks between sentences and abstracts, the information in PubMed can be converted into one navigable resource, bringing all advantages of the internet to scientific literature research."
* [http://fable.chop.edu FABLE] - A gene-centric text-mining search engine for MEDLINE
* [http://fable.chop.edu FABLE] A gene-centric text-mining search engine for MEDLINE
* [http://www.gopubmed.org GoPubMed] - retrieves [[PubMed]] abstracts for your search query, then detects ontology terms from the [[Gene Ontology]] and [[Medical Subject Headings]] in the abstracts and allows the user to browse the search results by exploring the [[Ontology|ontologie]]s and displaying only papers mentioning selected terms, their synonyms or descendants.
* [http://www.gopubmed.org GoPubMed] retrieves [[PubMed]] abstracts for your search query, then detects ontology terms from the [[Gene Ontology]] and [[Medical Subject Headings]] in the abstracts and allows the user to browse the search results by exploring the [[Ontology|ontologie]]s and displaying only papers mentioning selected terms, their synonyms or descendants.
* [http://www.litinspector.org LitInspector] - Gene and signal transduction pathway data mining in [[PubMed]] abstracts.
* [http://www.litinspector.org LitInspector] Gene and signal transduction pathway data mining in [[PubMed]] abstracts.
* [http://www.pubgene.org PubGene] - [[Co-occurrence networks]] display of gene and protein symbols as well as [[Medical Subject Headings|MeSH]], [[Gene Ontology|GO]], [[PubChem]] and interaction terms (such as "binds" or "induces") as these appear in [[MEDLINE]] records (that is, [[PubMed]] titles and abstracts).
* [http://www.pubgene.org PubGene] [[Co-occurrence networks]] display of gene and protein symbols as well as [[Medical Subject Headings|MeSH]], [[Gene Ontology|GO]], [[PubChem]] and interaction terms (such as "binds" or "induces") as these appear in [[MEDLINE]] records (that is, [[PubMed]] titles and abstracts).
* [http://brainarray.mbni.med.umich.edu/Brainarray/prototype/PubAnatomy/ PubAnatomy ] - An interactive visual search engine that provides new ways to explore relationships among Medline literature, text mining results, anatomical structures, gene expression and other background information.
* [http://brainarray.mbni.med.umich.edu/Brainarray/prototype/PubAnatomy/ PubAnatomy ] An interactive visual search engine that provides new ways to explore relationships among Medline literature, text mining results, anatomical structures, gene expression and other background information.
* [http://www.nextbio.com NextBio]- Life sciences search engine with a text mining functionality that utilizes [[PubMed]] abstracts [http://www.nextbio.com/b/home/generalSearch.nb?q=breast+cancer (ex: literature search)]and clinical trials [http://www.nextbio.com/b/home/generalSearch.nb?q=breast+cancer#sitype=TRIALS (example)] to return concepts relevant to the query based on a number of heuristics including ontology relationships, journal impact, publication date, and authorship.
* [http://www.nextbio.com NextBio]- Life sciences search engine with a text mining functionality that utilizes [[PubMed]] abstracts [http://www.nextbio.com/b/home/generalSearch.nb?q=breast+cancer (ex: literature search)]and clinical trials [http://www.nextbio.com/b/home/generalSearch.nb?q=breast+cancer#sitype=TRIALS (example)] to return concepts relevant to the query based on a number of heuristics including ontology relationships, journal impact, publication date, and authorship.
* [http://xldb.fc.ul.pt/biotools/rebil/goa/ GOAnnotator], an online tool that uses [[Semantic similarity]] for verification of electronic protein annotations using GO terms automatically extracted from literature.
* [http://xldb.fc.ul.pt/biotools/rebil/goa/ GOAnnotator], an online tool that uses [[Semantic similarity]] for verification of electronic protein annotations using GO terms automatically extracted from literature.
Line 22: Line 22:


==References==
==References==
{{reflist}}
*[http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15998455 Text-mining and information-retrieval services for molecular biology]
*{{cite journal |author=Krallinger M, Valencia A |title=Text-mining and information-retrieval services for molecular biology |journal=Genome Biol. |volume=6 |issue=7 |pages=224 |year=2005 |pmid=15998455 |pmc=1175978 |doi=10.1186/gb-2005-6-7-224 |url=http://genomebiology.com/1465-6906/6/224}}
*[http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15886388 Text mining for metabolic pathways, signaling cascades, and protein networks]
*{{cite journal |author=Hoffmann R, Krallinger M, Andres E, Tamames J, Blaschke C, Valencia A |title=Text mining for metabolic pathways, signaling cascades, and protein networks |journal=Sci. STKE |volume=2005 |issue=283 |pages=pe21 |year=2005 |month=May |pmid=15886388 |doi=10.1126/stke.2832005pe21 |url=http://stke.sciencemag.org/cgi/pmidlookup?view=long&pmid=15886388}}
*[http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15808823 Text-mining approaches in molecular biology and biomedicine]
*{{cite journal |author=Krallinger M, Erhardt RA, Valencia A |title=Text-mining approaches in molecular biology and biomedicine |journal=Drug Discov. Today |volume=10 |issue=6 |pages=439–45 |year=2005 |month=March |pmid=15808823 |doi=10.1016/S1359-6446(05)03376-3 |url=http://linkinghub.elsevier.com/retrieve/pii/S1359644605033763}}
*[http://blimp.cs.queensu.ca/ Biomedical Literature Mining Publications (BLIMP)]: A comprehensive and regularly updated index of publications on (bio)medical text mining
*[http://blimp.cs.queensu.ca/ Biomedical Literature Mining Publications (BLIMP)]: A comprehensive and regularly updated index of publications on (bio)medical text mining



Revision as of 08:04, 5 December 2009

Biomedical text mining (also known as BioNLP) refers to text mining applied to texts and literature of the biomedical and molecular biology domain. It is a rather recent research field on the edge of natural language processing, bioinformatics, medical informatics and computational linguistics.

There is an increasing interest in text mining and information extraction strategies applied to the biomedical and molecular biology literature due to the increasing number of electronically available publications stored in databases such as PubMed.


Main applications

The main developments in this area have been related to the identification of biological entities (named entity recognition), such as protein and gene names in free text, the association of gene clusters obtained by microarray experiments with the biological context provided by the corresponding literature, automatic extraction of protein interactions and associations of proteins to functional concepts (e.g. gene ontology terms). Even the extraction of kinetic parameters from text or the subcellular location of proteins have been addressed by information extraction and text mining technology.

Examples

  • XTractor — Discovering Newer Scientific Relations Across PubMed Abstracts. A tool to obtain manually annotated,expert curated relationships for Proteins, Diseases, Drugs and Biological Processes as they get published in PubMed.
  • Chilibot — A tool for finding relationships between genes or gene products.
  • Information Hyperlinked Over Proteins (iHOP)[1]: "A network of concurring genes and proteins extends through the scientific literature touching on phenotypes, pathologies and gene function. iHOP provides this network as a natural way of accessing millions of PubMed abstracts. By using genes and proteins as hyperlinks between sentences and abstracts, the information in PubMed can be converted into one navigable resource, bringing all advantages of the internet to scientific literature research."
  • FABLE — A gene-centric text-mining search engine for MEDLINE
  • GoPubMed — retrieves PubMed abstracts for your search query, then detects ontology terms from the Gene Ontology and Medical Subject Headings in the abstracts and allows the user to browse the search results by exploring the ontologies and displaying only papers mentioning selected terms, their synonyms or descendants.
  • LitInspector — Gene and signal transduction pathway data mining in PubMed abstracts.
  • PubGeneCo-occurrence networks display of gene and protein symbols as well as MeSH, GO, PubChem and interaction terms (such as "binds" or "induces") as these appear in MEDLINE records (that is, PubMed titles and abstracts).
  • PubAnatomy — An interactive visual search engine that provides new ways to explore relationships among Medline literature, text mining results, anatomical structures, gene expression and other background information.
  • NextBio- Life sciences search engine with a text mining functionality that utilizes PubMed abstracts (ex: literature search)and clinical trials (example) to return concepts relevant to the query based on a number of heuristics including ontology relationships, journal impact, publication date, and authorship.
  • GOAnnotator, an online tool that uses Semantic similarity for verification of electronic protein annotations using GO terms automatically extracted from literature.
  • TexFlame, an online tool that renders a single PubMed abstract as a Systems Biology Graphical Notation (SBGN)-like graph. The graph is a complete syntactic-semantic representation of the abstract.

References

  1. ^ Hoffmann R, Valencia A (2005). "Implementing the iHOP concept for navigation of biomedical literature". Bioinformatics. 21 (Suppl 2): ii252–8. doi:10.1093/bioinformatics/bti1142. PMID 16204114. {{cite journal}}: Unknown parameter |month= ignored (help)

Conferences at which BioNLP research is presented

BioNLP is presented at a variety of meetings:

See also

External links