From Wikipedia, the free encyclopedia
Jump to: navigation, search
Description DisProt contains information about intrinsically unstructured proteins.
Data types captured intrinsically unstructured proteins
Organisms all
Research center Center for Computational Biology and Bioinformatics at Indiana University School of Medicine and Center for Information Science and Technology at Temple University
Primary citation 15310560
Data format Fasta format and XML
Website [1]
Download URL DisProt download page
Version 6.01
Curation policy yes

In molecular biology, DisProt is a curated biological database collection of intrinsically unstructured proteins.[1][2] Historically, the study of disordered proteins has been hampered by the lack of an organised resource collecting them and their properties together.[2] Release 6.01 of DisProt contains information on 684 proteins. Each protein entry in DisProt is characterised by a DisProt identifier which takes the form of the prefix DP followed by a 5 digit protein identifier. For example, DP000016 refers to the Cyclin-dependent kinase inhibitor 1 protein. DisProt has been widely used to train software methods to predict disordered regions in proteins. In addition, DisProt has been used to understand the properties of intrinsically unstructured proteins.[3]


The DisProt website provides users with an interface to search by keyword, or by sequence similarity using PSI-BLAST or Smith-Waterman alignment. Users can also browse through the entries by their identifier or by functional class. For example, a user can list all the entries which are annotated with the function acetylation. The entire data set can be downloaded from the website in either Fasta format or XML format.

External links[edit]


  1. ^ Vucetic S, Obradovic Z, Vacic V; et al. (January 2005). "DisProt: a database of protein disorder". Bioinformatics 21 (1): 137–40. doi:10.1093/bioinformatics/bth476. PMID 15310560. 
  2. ^ a b Sickmeier M, Hamilton JA, LeGall T; et al. (January 2007). "DisProt: the Database of Disordered Proteins". Nucleic Acids Res. 35 (Database issue): D786–93. doi:10.1093/nar/gkl893. PMC 1751543. PMID 17145717. 
  3. ^ Kovačević JJ (June 2012). "Computational analysis of position-dependent disorder content in DisProt database". Genomics Proteomics Bioinformatics 10 (3): 158–65. doi:10.1016/j.gpb.2012.01.002. PMID 22917189.