Proteomics Standards Initiative

From Wikipedia, the free encyclopedia

The Proteomics Standards Initiative (PSI) is a working group of the Human Proteome Organization. It aims to define data standards for proteomics to facilitate data comparison, exchange and verification.[1][2]

The Proteomics Standards Initiative focuses on the following subjects: minimum information about a proteomics experiment defines the metadata that should be provided along with a proteomics experiment.[3] a data markup language for encoding the data, and metadata ontologies for consistent annotation and representation.

Minimum information about a proteomics experiment[edit]

Minimum information about a proteomics experiment (MIAPE) is a minimum information standard, created by the Proteomics Standards Initiative of the Human Proteome Organization, for reporting proteomics experiments.[4] You can't just introduce the results of an analysis, it is intended to specify all the information necessary to interpret the experiment results unambiguously and to potentially reproduce the experiment.[citation needed] While the MIAPE guidelines define the content required for compliant reports, it does not specify the format in which this data should be presented (which is left to the corresponding *ML format, also defined by PSI[5]), nor does it define how to perform experiments.[6]

Working groups[edit]

Several working groups work on several documents covering the different areas of proteomics:[7]

The gel electrophoresis working group defined reporting requirements for gel electrophoresis experiments. The document is at the stage of a recommendation and has been published.[8] The corresponding data exchange format is called GelML, and a stable version was released in late 2007.[9]

The gel electrophoresis working group also focuses on image analysis with the gel image informatics recommendation that is currently in the public review phase while the corresponding exchange format is only a draft (as of April 2009).[9]

The sample processing working group defines requirements concerning all the sample pre-processing steps that are carried out before gel electrophoresis or mass spectrometry is applied. Two documents concerning column chromatography and capillary electrophoresis are in the early draft stages and the Sample preparation and handling is still a project (as of April 2009). The data exchange format (spML) is also under development.[10]

Mass spectrometry[11] and mass spectrometry informatics[12] documents have been published as recommendations by the mass spectrometry working group.

The working group has released several data exchange format: the mzML, for the capture of data generated by a mass spectrometer, which is a merge of the previous mzData (developed by PSI) and mzXML (developed at the Seattle Proteome Center at the Institute for Systems Biology); mzIdentML, for Mass spectra informatics analysis that capture the results of the identification of proteins and peptides from mass spectrometry data; and TraML, for selected reaction monitoring input file. Finally, they develop MS CV, a controlled vocabulary to use with the previous file formats.[13]

The molecular interactions working group of PSI only works on PSI MI XML, a data exchange format, and on its corresponding ontologies. They have published the MIMIx guidelines (minimum information about a molecular interaction experiment)

Study design and sample generation and statistical analysis of data MIAPE recommendations are also being planned or drafted.[7]

Standard-compliant proteomics repositories[edit]

Several standard-compliant proteomics repositories exist, allowing researchers to publish their data while enforcing MIAPE guidelines. For example: MIAPEGelDB[14] (for gel electrophoresis data), PRIDE[15] (for mass spectrometry data), and ProteoRed MIAPE Generator tool[16] (for gel electrophoresis and mass spectrometry data)

It is expected that journal editors will eventually request authors to publish all their data to such repositories before publication[citation needed].

Similar initiatives[edit]

There are similar initiatives that try to define minimal requirements. For microarrays the MGED Society defined the minimum information about a microarray experiment (MIAME).[17] The standards for reporting of diagnostic accuracy (STARD) is available for studies reporting medical diagnosis accuracies.[18]

References[edit]

  1. ^ Taylor, C. F.; Hermjakob, H.; Julian, R. K.; Garavelli, J. S.; Aebersold, R.; Apweiler, R. (2006). "The Work of the Human Proteome Organisation's Proteomics Standards Initiative (HUPO PSI)". OMICS: A Journal of Integrative Biology. 10 (2): 145–151. doi:10.1089/omi.2006.10.145. PMID 16901219.
  2. ^ "HUPO Proteomics Standards Initiative home page". HUPO Proteomics Standards Initiative. Retrieved 2008-12-06.
  3. ^ Taylor, C. F.; Paton, N. W.; Lilley, K. S.; Binz, P. A.; Julian Jr, R. K.; Jones, A. R.; Zhu, W.; Apweiler, R.; Aebersold, R.; Deutsch, E. W.; Dunn, M. J.; Heck, A. J. R.; Leitner, A.; Macht, M.; Mann, M.; Martens, L.; Neubert, T. A.; Patterson, S. D.; Ping, P.; Seymour, S. L.; Souda, P.; Tsugita, A.; Vandekerckhove, J.; Vondriska, T. M.; Whitelegge, J. P.; Wilkins, M. R.; Xenarios, I.; Yates Jr, J. R.; Hermjakob, H. (2007). "The minimum information about a proteomics experiment (MIAPE)". Nature Biotechnology. 25 (8): 887–893. doi:10.1038/nbt1329. PMID 17687369.
  4. ^ "MIAPE home page". HUPO Proteomics Standards Initiative. Retrieved 2013-05-13.
  5. ^ Hermjakob, H (2006). "The HUPO Proteomics Standards Initiative - Overcoming the fragmentation of Proteomics Data". Practical Proteomics. 6 (S2): 34–38. doi:10.1002/pmic.200600537. PMID 17031794. S2CID 20005411.
  6. ^ Taylor, Chris (2006). "Minimum Reporting Requirements for Proteomics: A MIAPE Primer". Practical Proteomics. 6 (S2): 39–44. doi:10.1002/pmic.200600549. PMID 17031795. S2CID 8175511.
  7. ^ a b "HUPO PSI home page". HUPO Proteomics Standards Initiative. Retrieved 2013-05-13.
  8. ^ Gibson, Frank; Leigh Anderson; Gyorgy Babnigg; Mark Baker; Matthias Berth; Pierre-Alain Binz; Andy Borthwick; Phil Cash; Billy W Day; David B Friedman; Donita Garland; Howard B Gutstein; Christine Hoogland; Neil A Jones; Alamgir Khan; Joachim Klose; Angus I Lamond; Peter F Lemkin; Kathryn S Lilley; Jonathan Minden; Nicholas J Morris; Norman W Paton; Michael R Pisano; John E Prime; Thierry Rabilloud; David A Stead; Chris F Taylor; Hans Voshol; Anil Wipat; Andrew R Jones (2008). "Guidelines for reporting the use of gel electrophoresis in proteomics". Nat. Biotechnol. 26 (8): 863–864. arXiv:0904.0694. doi:10.1038/nbt0808-863. ISSN 1087-0156. PMID 18688234. S2CID 1231720.
  9. ^ a b "MIAPE Gel Electrophoresis working group page". HUPO Proteomics Standards Initiative. Retrieved 2009-04-23.
  10. ^ "MIAPE Sample Processing working group page". HUPO Proteomics Standards Initiative. Retrieved 2009-04-23.
  11. ^ Taylor, Chris F; Pierre-Alain Binz; Ruedi Aebersold; Michel Affolter; Robert Barkovich; Eric W Deutsch; David M Horn; Andreas Huhmer; Martin Kussmann; Kathryn Lilley; Marcus Macht; Matthias Mann; Dieter Muller; Thomas A Neubert; Janice Nickson; Scott D Patterson; Roberto Raso; Kathryn Resing; Sean L Seymour; Akira Tsugita; Ioannis Xenarios; Rong Zeng; Randall K Julian (2008). "Guidelines for reporting the use of mass spectrometry in proteomics". Nat. Biotechnol. 26 (8): 860–861. doi:10.1038/nbt0808-860. ISSN 1087-0156. PMID 18688232. S2CID 205270031.
  12. ^ Binz, Pierre-Alain; Robert Barkovich; Ronald C Beavis; David Creasy; David M Horn; Randall K Julian; Sean L Seymour; Chris F Taylor; Yves Vandenbrouck (2008). "Guidelines for reporting the use of mass spectrometry informatics in proteomics". Nat. Biotechnol. 26 (8): 862. doi:10.1038/nbt0808-862. ISSN 1087-0156. PMID 18688233. S2CID 205270035.
  13. ^ "MIAPE mass spectrometry working group page". HUPO Proteomics Standards Initiative. Retrieved 2009-04-23.
  14. ^ "MIAPEGelDB home page". ExPASy. Retrieved 2008-12-07.
  15. ^ "PRIDE PRoteomics IDEntifications database home page". European Bioinformatics Institute. Retrieved 2008-12-07.
  16. ^ "MIAPE Generator tool". ProteoRed. Archived from the original on 2013-04-15. Retrieved 2009-03-06.
  17. ^ Brazma, Alvis; Pascal Hingamp; John Quackenbush; Gavin Sherlock; Paul Spellman; Chris Stoeckert; John Aach; Wilhelm Ansorge; Catherine A. Ball; Helen C. Causton; Terry Gaasterland; Patrick Glenisson; Frank C.P. Holstege; Irene F. Kim; Victor Markowitz; John C. Matese; Helen Parkinson; Alan Robinson; Ugis Sarkans; Steffen Schulze-Kremer; Jason Stewart; Ronald Taylor; Jaak Vilo; Martin Vingron (December 2001). "Minimum information about a microarray experiment (MIAME)—toward standards for microarray data". Nat Genet. 29 (4): 365–371. doi:10.1038/ng1201-365. ISSN 1061-4036. PMID 11726920.
  18. ^ Bossuyt, Patrick M.; Johannes B. Reitsma; David E. Bruns; Constantine A. Gatsonis; Paul P. Glasziou; Les M. Irwig; David Moher; Drummond Rennie; Henrica C.W. de Vet; Jeroen G. Lijmer (2003-01-01). "The STARD Statement for Reporting Studies of Diagnostic Accuracy: Explanation and Elaboration". Clin Chem. 49 (1): 7–18. doi:10.1373/49.1.7. PMID 12507954.

External links[edit]