|Description||DisProt contains information and disorder annotation in intrinsically unstructured proteins.|
|Annotation of intrinsically unstructured proteins|
|Research center||Center for Computational Biology and Bioinformatics at Indiana University School of Medicine, Center for Information Science and Technology at Temple University, Department of Biomedical Sciences at University of Padua|
|Primary citation||PMID 27899601|
|Release date||September 2016|
|Data format||CVS and JSON|
|Web service URL||REST API see info here|
|Curation policy||Yes - manual curation from the literature|
In molecular biology, DisProt is a curated biological database collection of intrinsically unstructured proteins. It is a community resource annotating protein sequences for intrinsically disorder regions from the literature. DisProt classifies intrinsic disorder based on experimental methods and three ontologies for molecular function, transition and binding partners.
Historically, the study of disordered proteins has been hampered by the lack of an organised resource collecting them and their properties together. Release 7 of DisProt contains information on more than 800 proteins. Each protein entry in DisProt is characterised by a DisProt identifier which takes the form of the prefix DP followed by a 5 digit protein identifier. For example, DP00016 refers to the Cyclin-dependent kinase inhibitor 1 protein. DisProt has been widely used to train software methods to predict disordered regions in proteins. In addition, DisProt has been used to understand the properties of intrinsically unstructured proteins.
The DisProt website provides users with an interface to search by keyword, freetext, or by sequence similarity using BLAST. Users can also browse through the entries (proteins or regions) by their identifier, detection method or PubMed ID. The entire data set can be downloaded from the website in either CVS or JSON format.
DisProt web-server exposes some RESTful endpoints allowing programmatic access to DisProt and retrieval of different data types. Available GET routes provide access to all available data given a DisProt ID, a list of entries of a given type or a list of functional terms used for DisProt annotation.
- Vucetic, Slobodan; Obradovic, Zoran; Vacic, Vladimir; Radivojac, Predrag; Peng, Kang; Iakoucheva, Lilia M.; Cortese, Marc S.; Lawson, J. David; Brown, Celeste J. (2005-01-01). "DisProt: a database of protein disorder". Bioinformatics. 21 (1): 137–140. doi:10.1093/bioinformatics/bth476. ISSN 1367-4803. PMID 15310560.
- Sickmeier, Megan; Hamilton, Justin A.; LeGall, Tanguy; Vacic, Vladimir; Cortese, Marc S.; Tantos, Agnes; Szabo, Beata; Tompa, Peter; Chen, Jake (2007-01-01). "DisProt: the Database of Disordered Proteins". Nucleic Acids Research. 35 (Database issue): D786–793. doi:10.1093/nar/gkl893. ISSN 1362-4962. PMC . PMID 17145717.
- Piovesan, Damiano; Tabaro, Francesco; Mičetić, Ivan; Necci, Marco; Quaglia, Federica; Oldfield, Christopher J.; Aspromonte, Maria Cristina; Davey, Norman E.; Davidović, Radoslav (2016-11-28). "DisProt 7.0: a major update of the database of disordered proteins". Nucleic Acids Research. doi:10.1093/nar/gkw1056. ISSN 1362-4962. PMID 27899601.
- Kovačević JJ (June 2012). "Computational analysis of position-dependent disorder content in DisProt database". Genomics Proteomics Bioinformatics. 10 (3): 158–65. doi:10.1016/j.gpb.2012.01.002. PMID 22917189.