Draft:List of proteins in the human body
There are many speculations to the number of proteins which could potentially be made,[1][2][3][4] however various sources agree on an estimated 10,000 proteins which are relevant to the human body.[5][6][7][8] These estimates are thanks to recent advances in technology leading to the birth of the field of Proteomics. Another consensus which can also be found is that it takes 20000-25000 Proteins to make a human work and that our cells have the potential to make around a 100 000 different proteins depending on what is most convenient in the environment we live in.[9][10] Even though this seems like a fair estimate, it is hard to find an empirical database which claims to list more than 10 000 proteins. Another way to approach it is by looking from the bottom up there are ~20,000 protein coding genes in the human genome, (of which it is estimated that 12,733 already have Wikipedia articles (the Gene Wiki) about them). If wone where to Including splice variants, some argue, that there could be as many as 500,000 unique human proteins.[11] All of these speculations are based on how the average human at the start of the 21' century looks.
This is a list for only those proteins which have been found in the human body so far. It contains about 1100 unique proteins (all of which have their own wiki article), some of which are listed multiple times waiting to be merged into one row. After the merging work there is still about 9200 proteins yet to be described and listed.
Proteins
[edit]This section needs expansion. You can help by adding to it. (January 2015) |
Naming Convention for the list
[edit]Essential cell biology 5 edition lists: Enzymes(which typically have EC number), Transport proteins(which typically have TC number), Motor Proteins, Storage Proteins, Signal proteins, Receptor Proteins, Transcription Proteins as the most common types of proteins. Finally the make a very broad category of Special-purpose proteins. So far this seems to be the most simple yet sufficient way to split proteins into types
Column name | Meaning and convention |
---|---|
name | The name of the protein in accordance with Gene nomenclature |
Protein type | Either: Enzymes(EC number should be added here*), Transport proteins(TC number Should be added here), Motor Proteins, Storage Proteins, Signal proteins, Receptor Proteins, Trranscription Proteins or Special-purpose proteins |
Location | The Tissue, Organs or place and thing in the body where this protein is used |
Produced by | The cell which typically produce the protein, in case the protein is not produced in the human body please start this column with (Outside the human body). |
Folding Variations | This refers to the proteins structural domain or Protein domain |
Produced by the human body | This column indicates weather a protein can be made my the human body. There is a place for proteins in this list which are not made in the human body but which still have a messurable effect on the human body, such as certain types of medicine. |
*An Enzyme Commission number refers to the reaction catalyzed by an enzyme and not the enzyme protein itself. This means that there is often more than one human gene that corresponds to a given EC number.
List of databases containing Human Proteins
[edit]This section needs expansion. You can help by adding to it. (January 2015) |
DB name | DB website | Provider | Data sources | Revenue/Sponsors sources | Integrates | Wiki article | Desc. | Size | DB type |
---|---|---|---|---|---|---|---|---|---|
Database of Macromolecular Movements | http://www.molmovdb.org/ | describes the motions that occur in proteins and other macromolecules, particularly using movies | protein structure databases | ||||||
Dynameomics | https://ngdc.cncb.ac.cn/databasecommons/database/id/3509 | a data warehouse of molecular dynamics simulations and analyses of proteins representing all known protein fold families | protein structure databases | ||||||
JenaLib | https://jenalib.leibniz-fli.de/ | the Jena Library of Biological Macromolecules is aimed at a better dissemination of information on three-dimensional biopolymer structures with an emphasis on visualization and analysis. | protein structure databases | ||||||
OCA | http://oca.weizmann.ac.il/oca-bin/ocamain | KEGG, OMIM, PDBselect, Pfam, PubMed, SCOP, SwissProt, and others | a browser-database for protein structure/function | protein structure databases | |||||
PDB Lite/SUM | https://www.ebi.ac.uk/pdbe/, http://www.ebi.ac.uk/thornton-srv/databases/pdbsum/, | EMBL | PDB Lite/SUM | lite: derived from OCA, PDB Lite was provided to make it as easy as possible to find and view a macromolecule within the PDB sum: provides an overview macromolecular structures in the PDB, giving schematic diagrams of the molecules in each structure and of the interactions between them | protein structure databases | ||||
PDBTM | https://web.archive.org/web/20131225065028/http://pdbtm.enzim.hu/ | the Protein Data Bank of Transmembrane Proteins – — a selection of the PDB. | protein structure databases | ||||||
PDBWiki | a community annotated knowledge base of biological molecular structures | protein structure databases | |||||||
InterPro | http://www.ebi.ac.uk/interpro/ | ELIXIR infrastructure | European Bioinformatics Institute | EMBL, The Welcome trust, BBSRC | CATH-Gene3D, CDD, HAMAP, MobiDB, PANTHER, Pfam, SMART, SUPERFAMILY, SFLD, TIGRFAMs, | InterPro | classifies proteins into families and predicts the presence of domains and sites | Protein sequence databases | |
NextProt | https://www.nextprot.org/ | CALIPHO (is a group at the SIB) | Swiss Institute of Bioinformatics | https://www.sib.swiss/about/funding-sources | UniProt, Cellosaurus, Gnomad, IntAct, SRAA Atlas, Uniprot – GOA, BGEE, COSMIC, MassIVE, Peptide atlas | neXtProt | a human protein-centric knowledge resource | Protein sequence databases | |
Wiki-pi | http://severus.dbmi.pitt.edu/wiki-pi/ | Madhavi K. Ganapathiraju | At present Wiki-Pi contains 48,419 unique interactions among 10,492 proteins. However it is not clear if this is unique proteins[13] | Protein interactoin Database | |||||
Human Protein Reference Database | Institute of Bioinformatics (IOB), Bangalore, India | Human Protein Reference Database | One source claims 15000 [13] proteins. But it is unclear how many of these are unique | ||||||
Sanger Institute | Pfam | protein families database of alignments and HMMs | Protein sequence databases | ||||||
Human Proteinpedia | Institute of Bioinformatics (IOB), Bangalore and Johns Hopkins University, | Human Proteinpedia | The human Proteinpedia is based on HPRD (Human protein reference database)which is a repository hosting over 30,000 human proteins. However it is unclear how many of these are unique proteins | ||||||
Human Protein Atlas | The Swedish Government | Human Protein Atlas | It contains roughly 10 million IHC images of a bit less than 25,000 antibodies. But once again it is unclear how many of these are unique | ||||||
Manchester University | PRINTS | a compendium of protein fingerprints | Protein sequence databases | ||||||
PROSITE | database of protein families and domains | Protein sequence databases | |||||||
Georgetown University Medical Center [GUMC] | Protein Information Resource | Protein sequence databases | |||||||
SUPERFAMILY | library of HMMs representing superfamilies and database of (superfamily and family) annotations for all completely sequenced organisms | Protein sequence databases | |||||||
Swiss Institute of Bioinformatics | Swiss-Prot | protein knowledgebase | Protein sequence databases | ||||||
NCBI | protein sequence and knowledgebase (National Center for Biotechnology Information) | Protein sequence databases | |||||||
Protein DataBank in Europe (PDBe),[14] ProteinDatabank in Japan (PDBj),[15] Research Collaboratory for Structural Bioinformatics (RCSB)[16] | Protein Data Bank | (PDB) | Protein structure databases | ||||||
Structural Classification of Proteins (SCOP) | Protein structure databases | ||||||||
Protein Structure Classification database | CATH : | Protein structure databases | |||||||
Sali Lab, UCSF | ModBase | database of comparative protein structure models | Protein model databases | ||||||
Similarity Matrix of Proteins | SIMAP | database of protein similarities computed using FASTA | Protein model databases | ||||||
Swiss-model | server and repository for protein structure models | Protein model databases | |||||||
AAindex | : database of amino acid indices, amino acid mutation matrices, and pair-wise contact potentials | Protein model databases | |||||||
Samuel Lunenfeld Research Institute | BioGRID | general repository for interaction datasets | Protein-protein and other molecular interactions | ||||||
RNA-binding protein databas | Protein-protein and other molecular interactions | ||||||||
Univ. of California | Database of Interacting Proteins | Protein-protein and other molecular interactions | |||||||
(EMBL-EBI) | IntAct:[17] | open-source database for molecular interactions | Protein-protein and other molecular interactions | ||||||
String | an open source molecular interaction database to study interactions between proteins | Protein-protein and other molecular interactions | |||||||
Human Protein Atlas | aims at mapping all the human proteins in cells, tissues and organs | Protein expression databases | |||||||
ProteinModelPortal | Protein Model Portal of the PSI-Nature Structural Biology Knowledgebase | ?? | ?? | 3D structure protein databases | |||||
SMR | Database of annotated 3D protein structure models | University of Basel | The Swiss government | 3D structure protein databases | |||||
DisProt | Database of Protein Disorder | ELIXIR infrastructure | Indiana University School of Medicine, Temple University, University of Padua | funding from the European Union's Horizon 2020 | Swiss Prot/Uni Prot, CATH, Pfam, Europe PMC, BITEM, ECO, Geneontology | DisProt | database of experimental evidences of disorder in proteins | 3D structure protein databases, Protein sequence databases | |
MobiDB | Database of intrinsically disordered and mobile proteins | John Moult, Christine Orengo, Predrag Radivojac | University of Padua | Italian Government | MobiDB | database of intrinsic protein disorder annotation | 3D structure protein databases, Protein sequence databases | ||
ModBase | Database of Comparative Protein Structure Models | Ursula Pieper, Ben Webb, Narayanan Eswar, Andrej Sali Roberto Sanchez | UCSF, Sali Lab | 3D structure protein databases | |||||
PDBsum | Pictorial database of 3D structures in the Protein Data Bank | European Bioinformatics Institute 2013 | Wellcome Trust | 3D structure protein databases | |||||
CCDS | The Consensus CDS protein set database | NCBI | ?? | Sequence databases | |||||
DDBJ | DNA Data Bank of Japan | ?? | ?? | Sequence databases | |||||
ENA | European Nucleotide Archive | ?? | ?? | Sequence databases | |||||
GenBank | GenBank nucleotide sequence database | ?? | ?? | Sequence databases | |||||
Refseq | NCBI Reference Sequence Database | ?? | ?? | Sequence databases | |||||
UniGene | Database of computationally identifies transcripts from the same locus | ?? | ?? | Sequence databases | |||||
UniProtKB | Universal Protein Resource (UniProt) | ?? | ?? | Sequence databases | |||||
Swiss Prot/Uni Prot | https://www.sib.swiss/swiss-prot,https://www.uniprot.org/ | SIB Swiss Institute of Bioinformatics | European Bioinformatics Institute (EMBL-EBI) | Swiss-Prot has collected over 81 000 variants in roughly 13,000 human protein sequence records from peer-reviewed literature. It is unclear how many unique proteins types are present in the database. |
See also
[edit]- List of skeletal muscles of the human body
- List of organs of the human body
- List of distinct cell types in the adult human body
- List of human microbiota
References
[edit]- ^ "How Many Proteins Exist? (with pictures)". All the Science. 2023-05-26. Retrieved 2023-07-12.
- ^ "Is there are theoretical limit to the number of proteins possible and their respective structure?". Biology Stack Exchange. Retrieved 2023-07-12.
- ^ "How many different possible proteins, 300 amino acids in length, could exist?". Quora. Retrieved 2023-07-12.
- ^ "How can 20 different amino acids permit 75000 different proteins?". homework.study.com. Retrieved 2023-07-12.
- ^ Rosenberger G, Koh CC, Guo T, Röst HL, Kouvonen P, Collins BC, et al. (2014). "A repository of assays to quantify 10,000 human proteins by SWATH-MS". Scientific Data. 1: 140031. doi:10.1038/sdata.2014.31. PMC 4322573. PMID 25977788.
- ^ Jiang L, Wang M, Lin S, Jian R, Li X, Chan J, et al. (October 2020). "A Quantitative Proteome Map of the Human Body". Cell. 183 (1): 269–283.e19. doi:10.1016/j.cell.2020.08.036. PMC 7575058. PMID 32916130.
- ^ Avenue, 677 Huntington; Boston; Ma 02115 (2012-09-18). "Protein". The Nutrition Source. Retrieved 2023-07-12.
{{cite web}}
: CS1 maint: numeric names: authors list (link) - ^ "What is a protein? | Precision Medicine – SIB". www.precisionmed.ch. 2019-07-19. Retrieved 2023-07-12.
- ^ "3.7: Proteins, Genes and Evolution- How Many Proteins are We?". Biology LibreTexts. 2019-01-04. Retrieved 2023-07-19.
- ^ Aebersold R, Agar JN, Amster IJ, Baker MS, Bertozzi CR, Boja ES, et al. (February 2018). "How many human proteoforms are there?". Nature Chemical Biology. 14 (3): 206–214. doi:10.1038/nchembio.2576. PMC 5837046. PMID 29443976.
- ^ Pray L (2008). "Eukaryotic genome complexity". Nature Education. 1 (1): 96.
- ^ Murín R, Schaer A, Kowtharapu BS, Verleysdonk S, Hamprecht B (May 2008). "Expression of 3-hydroxyisobutyrate dehydrogenase in cultured neural cells". Journal of Neurochemistry. 105 (4): 1176–1186. doi:10.1111/j.1471-4159.2008.05298.x. PMID 18284611. S2CID 205618976.
- ^ Keshava Prasad, T. S.; Goel, R.; Kandasamy, K.; Keerthikumar, S.; Kumar, S.; Mathivanan, S.; Telikicherla, D.; Raju, R.; Shafreen, B.; Venugopal, A.; Balakrishnan, L.; Marimuthu, A.; Banerjee, S.; Somanathan, D. S.; Sebastian, A.; Rani, S.; Ray, S.; Harrys Kishore, C. J.; Kanth, S.; Ahmed, M.; Kashyap, M. K.; Mohmood, R.; Ramachandra, Y. L.; Krishna, V.; Rahiman, B. A.; Mohan, S.; Ranganathan, P.; Ramabadran, S.; Chaerkady, R.; Pandey, A. (2009). "Human Protein Reference Database--2009 update". Nucleic Acids Research. 37 (Database): D767–D772. doi:10.1093/nar/gkn892. PMC 2686490. PMID 18988627.
- ^ Mir S, Alhroub Y, Anyango S, Armstrong DR, Berrisford JM, Clark AR, et al. (January 2018). "PDBe: towards reusable data delivery infrastructure at protein data bank in Europe". Nucleic Acids Research. 46 (D1): D486–D492. doi:10.1093/nar/gkx1070. PMC 5753225. PMID 29126160.
- ^ Kinjo AR, Bekker GJ, Suzuki H, Tsuchiya Y, Kawabata T, Ikegawa Y, Nakamura H (January 2017). "Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures". Nucleic Acids Research. 45 (D1): D282–D288. doi:10.1093/nar/gkw962. PMC 5210648. PMID 27789697.
- ^ Rose PW, Prlić A, Altunkaya A, Bi C, Bradley AR, Christie CH, et al. (January 2017). "The RCSB protein data bank: integrative view of protein, gene and 3D structural information". Nucleic Acids Research. 45 (D1): D271–D281. doi:10.1093/nar/gkw1000. PMC 5210513. PMID 27794042.
- ^ Hermjakob H, Montecchi-Palazzi L, Lewington C, Mudali S, Kerrien S, Orchard S, et al. (January 2004). "IntAct: an open source molecular interaction database". Nucleic Acids Research. 32 (Database issue): D452-5. doi:10.1093/nar/gkh052. PMC 308786. PMID 14681455.
Sources
[edit][[Category:Human proteins| ]]