|Industry||Internet, Information science|
|Headquarters||Raleigh, North Carolina February, 2007|
|Key people||Antony J. Williams, VP of Strategic Development|
|Parent||Royal Society of Chemistry|
The database contains more than 26 million unique molecules from over 400 data sources including those listed below.
- A-L: EPA DSSTox, U.S. Food and Drug Administration (FDA), Human Metabolome Database, Journal of Heterocyclic Chemistry, KEGG, KUMGM, LeadScope, LipidMAPS
- M-N: Marinlit, MDPI, MICAD, MLSMR, MMDB, MOLI, MTDP, Nanogen, Nature Chemical Biology, NCGC, NIAID, National Institutes of Health (NIH), NINDS Approved Drug Screening Program, NIST, NIST Chemistry WebBook, NMMLSC, NMRShiftDB
- P-S: PANACHE, PCMD, PDSP, Peptides, Prous Science Drugs of the Future, QSAR, R&D Chemicals, San Diego Center for Chemical Genomics, SGCOxCompounds, SGCStoCompounds, SMID, Specs, Structural Genomics Consortium, SureChem, Synthon-Lab
- T-Z: Thomson Pharma, Total TOSLab Building-Blocks, UM-BBD, UPCMLD, UsefulChem, Web of Science, xPharm, ZINC
The ChemSpider database can be updated with user contributions including chemical structure deposition, spectra deposition and user curation. This is a crowdsourcing approach to develop an online chemistry database. Crowdsourced based curation of the data has produced a dictionary of chemical names associated with chemical structures that has been used in text-mining applications of the biomedical and chemical literature.
A number of available search modules are provided:
- The standard search allows querying for systematic names, trade names and synonyms and registry numbers
- The advanced search allows interactive searching by chemical structure, chemical substructure, using also molecular formula and molecular weight range, CAS numbers, suppliers, etc. The search can be used to widen or restrict already found results.
- Structure searching on mobile devices can be done using free apps for iPhone/iPod/iPad and Android.
Chemistry document mark-up
The ChemSpider database has been used in combination with text mining as the basis of chemistry document markup. ChemMantis, the Chemistry Markup And Nomenclature Transformation Integrated System uses algorithms to identify and extract chemical names from documents and web pages and converts the chemical names to chemical structures using name-to-structure conversion algorithms and dictionary look-ups in the ChemSpider database. The result is an integrated system between chemistry documents and information look-up via ChemSpider into over 150 data sources.
ChemSpider was acquired by the Royal Society of Chemistry in May, 2009. Prior to the acquisition by RSC, ChemSpider was controlled by a private corporation, ChemZoo Inc. The system was first launched in March 2007 in a beta release form and transitioned to release in March 2008. ChemSpider has expanded the generic support of a chemistry database to include support of the Wikipedia chemical structure collection via their WiChempedia implementation.
A number of services are made available online. These include the conversion of chemical names to chemical structures, the generation of SMILES and InChI strings as well as the prediction of many physicochemical parameters and integration to a web service allowing NMR prediction. The organization is working with RSC to develop a hash table resolver for InChIKeys, shorter hashed forms of InChIs.
SyntheticPages is a free interactive database of synthetic chemistry procedures operated by the Royal Society of Chemistry. Users submit synthetic procedures which they have conducted themselves for publication on the site. These procedures may be original works, but they are more often based on literature reactions. Citations to the original published procedure are made where appropriate. They are checked by a scientific editor before posting. The pages do not undergo formal peer-review like a scientific journal article but comments can be made by logged-in users. The comments are also moderated by scientific editors. The intention is to collect practical experience of how to conduct useful chemical synthesis in the lab. While experimental methods published in an ordinary academic journal are listed formally and concisely, the procedures in ChemSpider SyntheticPages are given with more practical detail. Informality is encouraged. Comments by submitters are included as well. Other publications with comparable amounts of detail include Organic Syntheses and Inorganic Syntheses. The SyntheticPages site was originally set up by Professors Kevin Booker-Milburn (University of Bristol), Stephen Caddick (University College London), Peter Scott (University of Warwick) and Dr Max Hammond. In February 2010 a merger was announced with the Royal Society of Chemistry's chemical structure search engine ChemSpider and the formation of ChemSpider|SyntheticPages (CS|SP).
ChemSpider is serving as the chemical compound repository as part of the Open PHACTS project, an Innovative Medicines Initiative. Open PHACTS will deploy a highly innovative open standards, open access, semantic web approach to address key bottlenecks in small molecule drug discovery - disparate information sources, lack of standards and information overload.
Notes and references
- Automatic vs. manual curation of a multi-source chemical dictionary: the impact on text mining, Kristina M Hettne, Antony J Williams, Erik M van Mulligen, Jos Kleinjans, Valery Tkachenko and Jan A Kors, Journal of Cheminformatics, Volume 2, Number 1, 3 doi:10.1186/1758-2946-2-3
- Welcome ChemMantis to ChemZoo and a Call for Contributions from the Community,2008-10-23, A. Williams,blog post
- "RSC acquires ChemSpider". Royal Society of Chemistry. 11 May 2009. Retrieved 2009-05-11.
- "ChemSpider SyntheticPages". ChemSpider SyntheticPages. Royal Society of Chemistry. Retrieved 26 June 2012.
- "ChemSpider and SyntheticPages support synthetic chemistry". RSC Publishing (in English). Royal Society of Chemistry. 5. Retrieved 26 June 2012.
- Williams, A. J. et al. Open PHACTS: semantic interoperability for drug discovery. Drug Discovery Today (2012). doi:10.1016/j.drudis.2012.05.016
- Chemical & Engineering News 85 (24). June 11, 2007.
- Antony John Williams (Jan-Feb 2008). "ChemSpider and Its Expanding Web: Building a Structure-Centric Community for Chemists". Chemistry International 30 (1).
- Antony John Williams (Apr - May 2008). "Public Chemical Compound Databases". Current Opinion in Drug Discovery & Development 11 (3).
- Sean Ekins, Manisha Iyer, Matthew D. Krasowski and Evan D. Kharasch (2008). "Molecular Characterization of CYP2B6 Substrates". Current Drug Metabolism 9 (5): 363–73. PMC 2426921. PMID 18537573.
- Geoff Brumfiel (2008-05-07). "Chemists Spin a Web of Data". Nature 453 (7192): 139. Bibcode:2008Natur.453..139B. doi:10.1038/453139a. PMID 18464701.
- E. Curry, A. Freitas, and S. O’Riáin, “The Role of Community-Driven Data Curation for Enterprises,” in Linking Enterprise Data, D. Wood, Ed. Boston, MA: Springer US, 2010, pp. 25–47.