List of mass spectrometry software

From Wikipedia, the free encyclopedia
  (Redirected from Mass spectrometry software)
Jump to: navigation, search

Mass spectrometry software is software used for data acquisition, analysis, or representation in mass spectrometry.

MS/MS peptide identification[edit]

Within the field of protein mass spectrometry, tandem mass spectrometry (also known as MS/MS or MS2) experiments are used for protein/peptide identification.

In these experiments, sample proteins are broken up into short peptides using an enzyme like trypsin and separated in time using liquid chromatography. They are then sent through one mass spectrometer to separate them by mass. Peptide having a specific mass are then typically fragmented using collision-induced dissociation and sent through a second mass spectrometer, which will generate a set of fragment peaks from which the amino acid sequence of the peptide may often be inferred. Peptide identification software is used to try to reliably make these inferences.[1]

A typical experiment involves several hours of mass spectrometer time, and recent instruments may produce hundreds of thousands of MS/MS spectra, which must then be interpreted.

Peptide identification algorithms fall into two broad classes: database search and de novo search. The former search takes place against a database containing all amino acid sequences assumed to be present in the analyzed sample, whereas the latter infers peptide sequences without knowledge of genomic data. At present, database search is more popular and considered to produce higher quality results for most uses. With increasing instrument precision, however, de novo search may become increasingly attractive.

Database search algorithms[edit]


SEQUEST is a proprietary tandem mass spectrometry data analysis program developed by John Yates and Jimmy Eng in 1994.[2] The algorithm used by this program is covered by several US and European software patents. SEQUEST identifies collections of tandem mass spectra to peptide sequences that have been generated from databases of protein sequences. It was one of the first, if not the first, database search program.

SEQUEST, like many engines, identifies each tandem mass spectrum individually. The software evaluates protein sequences from a database to compute the list of peptides that could result from each. The peptide's intact mass is known from the mass spectrum, and SEQUEST uses this information to determine the set of candidate peptides sequences that could meaningfully be compared to the spectrum by including only those which are near the mass of the observed peptide ion. For each candidate peptide, SEQUEST projects a theoretical tandem mass spectrum, and SEQUEST compares these theoretical spectra to the observed tandem mass spectrum by the use of cross correlation. The candidate sequence with the best matching theoretical tandem mass spectrum is reported as the best identification for this spectrum.


Mascot[3] is a proprietary identification program available from Matrix Science. It performs mass spectrometry data analysis through a statistical evaluation of matches between observed and projected peptide fragments rather than cross correlation. As of version 2.2, support for peptide quantitation methods is provided in addition to the identification features.

PEAKS DB[edit]

PEAKS DB is a proprietary database search engine, run in parallel with de novo sequencing to automatically validate search results, allowing for a higher number of found sequences for a given false discovery rate. In addition to providing an independent database search, results can be incorporated as part of the software’s multi-engine (Sequest, Mascot, X!Tandem, OMSSA, PEAKS DB) consensus reporting tool, inChorus.[4] The tool also provides a list of sequences identified exclusively by de novo sequencing.


X!Tandem[5] is open source software that can match tandem mass spectra with peptide sequences.

Unlike some earlier generation search engines, all of the X! Series search engines calculate statistical confidence (expectation values) for all of the individual spectrum-to-sequence assignments. They also reassemble all of the peptide assignments in a data set onto the known protein sequences and assign the statistical confidence that this assembly and alignment is non-random (i.e., did not occur by chance).[6] Therefore, separate assembly and statistical analysis software (e.g. PeptideProphet and ProteinProphet) are not needed. This software has a simple, XML-based input file format.[7] This format is used for all of the X! series search engines, as well as the GPM and GPMDB.


X!!Tandem[8] is a parallel, high performance version of X!Tandem that has been parallelized via MPI to run on clusters or other non-shared memory multiprocessors running Linux. In X!!Tandem the search is parallelized by splitting the input spectra into as many subsets as there are processors, and processing each subset independently. Both compute-intensive stages of the processing (initial and refinement) are parallelized, and overall speedups in excess of 20-fold have been observed on real datasets.

With the exception of the details related to MPI launch, it is run exactly as X!Tandem, and produces exactly the same results using the same input and configuration files. It differs from Parallel Tandem[9] in that the parallelism is handled internally, rather than as an external driver/wrapper.


Phenyx is developed by Geneva Bioinformatics (GeneBio) in collaboration with the Swiss Institute of Bioinformatics (SIB). Phenyx incorporates OLAV, a family of statistical scoring models, to generate and optimize scoring schemes that can be tailored for all kinds of instruments, instrumental set-ups and general sample treatments.[10] Phenyx computes a score to evaluate the quality of a match between a theoretical and experimental peak list (i.e. mass spectrum). A match is thus a collection of observations deduced from this comparison. The basic peptide score is ultimately transformed into a normalized z-score and a p-value. A basic peptide score is the sum of raw scores for up to twelve physico-chemical properties.

In addition to regular peptide and protein identification features, Phenyx proposes a number of additional functionalities, such as: a result comparison interface to visualise side-by-side multiple results; an import functionality to incorporate results from other search engines; a manual validation feature to manually accept/reject identifications and dynamically recalculates protein scores.


OMSSA[11] is an open source database search program developed at NCBI.[12]


Developed at NCBI Robust Accurate Identification (RAId)[13] is a suite of proteomics tools for analyzing tandem mass spectrometry data with accurate statistics.[14]

In addition, RAId offers users with different modes of data analysis: database search, generation of de novo score distribution of all possible peptides using different scoring functions (RAId, XCorr, Hyperscore, Kscore),[15] and statistical confidence reassignment. In particular, RAId's integrated knowledge databases[16] incorporate known single amino polymorphisms, post-translational modifications and disease information, providing dynamic information retrieval for biomedical applications.


MyriMatch[17] is an open source database search program developed at the Vanderbilt Medical Center.[18]

MyriMatch designed to run in a single-computer environment or across an entire cluster of processing nodes. It is able to optimally divide work in a much more efficient way than many other database search programs. This is because it only generates candidate sequences from the known database once for the entire set of spectra instead of once for every spectrum. Thus, for each candidate sequence generated, it is compared against every spectrum. The spectra keep a certain (user-defined) number of candidate sequences that had the highest scores. Myrimatch can calculate several standard scores (Xcorr, dotproduct) but is based on a more sophisticated scoring algorithm that classifies peaks according to intensity (MVH). This leads myrimatch to outperform many of the older scoring algorithms. Full documentation can be found here. A graphical user interface that includes Myrimatch as the database search engine and Pepitome as the library search engine is described here. Direct download links are available from this page.


Greylag is an open source database search program developed at the Stowers Institute for Medical Research.[19] Its scoring algorithm is based on that of MyriMatch, but it includes a novel FDR (false discovery rate) validation algorithm as well. It is designed to perform large searches on computational clusters having hundreds of nodes. Notably, it is largely implemented in an interpreted language, Python, with only the CPU-intensive routines written in a compiled language (C++).


Byonic is a database search program released in 2011 by Protein Metrics Inc.[20] with original developments at PARC.[21] Byonic searches MS/MS data from all types of instruments and internally employs the program Combyne,[22] which combines peptide identifications to produce protein scores and identification probabilities.


A MS-alignment search engine available at the Center for Computational Mass Spectrometry at the University of California, San Diego[23]


SIMS (Sequential Interval Motif Search)[24] is a software tool design to perform unrestrictive PTM search over tandem mass spectra. In other words, users do not have to characterize the potential PTMs. Instead, users only need to specify the range of modification mass for each individual amino acid.[25]


MassWiz[26] is a free, open source search algorithm developed at Institute of Genomics and Integrative Biology. It is available as a windows commandline tool[27] and also as a webserver.[28]


SimTandem is a tool for identification of peptide sequences from LC/MS/MS data.[29][30] The engine can be used as an external tool in OpenMS/TOPP.

De novo sequencing algorithms[edit]

De novo peptide sequencing algorithms are based, in general, on the approach proposed in Bartels et al. (1990).[31]


DeNovoX performs de novo sequencing on CID spectra acquired with ion trap mass spectrometers. The software, launched in 2002 by Thermo Fisher Scientific, was the first commercially available software for low-resolution data.[32] DeNovoX delivers complete and/or partial peptide sequences (sequence tags). Each output sequence comes with a probability indicating how likely is for the sequence to have been obtained by chance. The software implements an algorithm based on probabilistic inference.


DeNoS is part of the software tool Proteinmatching Analysis Software (PAS) which in turn is part of the software package Medicwave Bioinformatics Suite (MBS).[33][34]

DeNoS performs sequencing of peptides with high reliability, using all information from CAD and ECD spectra. It is a hierarchal algorithm. In the first step fragments that are confirmed in both CAD and ECD (so called Golden Complementary Pairs) along with fragments that are only found in CAD (so called Complementary Pairs) are used. After that, step-by-step fragments with low reliability are used. In the last step, if the peptide is still not fully sequenced, the software uses a trivial application from the graph theory to sequence the remaining peptide parts with "unreliable" fragments. It uses DTA files as input.


PEAKS de novo automatically provides a complete sequence for each peptide, confidence scores on individual amino acid assignments, simple reporting for high-throughput analysis, and greater knowledge for scientifically sensitive, in-depth investigations.[35] A de novo, manually assisted mode, is available for users who wish to tweak/optimize their results further. Automated de novo sequencing on an entire LC run processed data faster than 1 spectra per second.[36]


Lutefisk is software for the de novo interpretation of peptide CID spectra.[37]

Homology Searching Algorithms[edit]


For the identification of proteins using MS/MS, de novo sequencing software computes one or several possible amino acid sequences (called sequence tags) for each MS/MS spectrum.[38][39] Those tags are then used to match, accounting amino acid mutations, the sequences in a protein database. If the de novo sequencing gives correct tags, the homologs of the proteins can be identified by this approach and software such as MS-BLAST is available for the matching. The most common error is that a segment of amino acids is replaced by another segment with approximately the same masses. The SPIDER algorithm matches sequence tags with errors to database sequences for the purpose of protein and peptide identification.[40] BLAST (and similar) homology approaches can fail when confronted with common sequence substitutions such as I/L, N/GG, SAT/TAS. SPIDER is designed to avoid these problems. SPIDER can be used in conjunction with PEAKS mass spectrometry data analysis software.

MS/MS peptide quantification[edit]

OpenMS / TOPP[edit]

OpenMS is a software C++ library for LC-MS/MS data management and analysis.[41] It offers an infrastructure for the development of mass spectrometry related software. OpenMS is free software available under the 2-clause BSD licence (previously under the LGPL).

TOPP - The OpenMS Proteomics Pipeline - is a set of small applications that can be chained to create analysis pipelines tailored for a specific problem. TOPP is developed using the datastructures and algorithms provided by OpenMS. TOPP is free software available under the 2-clause BSD licence (previously under the LGPL). TOPP provides ready-to-use applications for peak picking, the finding of peptides features (including de-isotoping), their quantitation and interfaces for most of the database search engines. It supports label-free and isotopic-label based quantification (such as iTRAQ). Furthermore, the TOPP tools provide applications for SILAC pair identification and quantification and it includes a metabolomics analysis pipeline.

TOPPView is a viewer software that allows visualization of mass spectrometric data on MS1 and MS2 level as well as in 3D; additionally it also displays chromatographic data from SRM experiments (in version 1.10). TOPPAS is a graphic integrated workflow manager that allows chaining the TOPP tools into a reusable and reproducible workflow.[42]

OpenMS and TOPP are a joint project of the Algorithmic Bioinformatics group at the Free University of Berlin and the Applied Bioinformatics group at Tübingen University. OpenMS was released in summer 2007 with version 1.0 and as of 2013, 10 further versions have been released (the latest being 1.10 in March 2013).[43]


MaxQuant is a proprietary software for quantitative proteomics developed by Jürgen Cox and others at the Max Planck Institute of Biochemistry in Martinsried, Germany.[44][45] The software is written in C# and released as freeware under the "MaxQuant Freeware Software License Agreement".

The software allows the analysis of label free and SILAC based proteomics experiments. It also provides its own search engine called Andromeda.[46]


Spectronaut is a proprietary software for quantitative proteomics developed by Biognosys AG (Schlieren, Switzerland) on the basis of the mProphet algorithm.[47] The software is written in C# and was first released in January 2013, with two further releases in August 2013 and March 2014, respectively (the latest version being 5.0).[48] [49] It is freely available for academic use.[50]

Spectronaut allows the targeted analysis of data independent acquisition (DIA) data sets for label-free peptide quantitation, also called SWATH acquisition or hyper reaction monitoring (HRM).[51] In contrast to data dependent acquisition (DDA) also known as shotgun proteomics where peptide quantitation is performed on a precursor level (MS1), DIA quantitation is performed on the fragment ion level (MS2), which can be advantageous for certain applications.[52]

The list of Spectronaut features includes: spectral library generation, fully automated peak detection and in-run calibration, peak scoring, estimates of false discovery rates, quality control perspective, interference detection, fragment ion report, multiple run data visualization. Spectronaut currently supports raw data formats from instruments produced by AB Sciex and Thermo Scientific.


MassChroQ [53] is a free software for peptide quantification developed by PAPPSO (Gif sur Yvette, France).

The software is written in C++ and released under the GNU General Public License.

It allows the analysis of label free or various isotopic labeling methods (SILAC, ICAT, N-15, C-13 ...), works with high and low resolution spectrometer systems, supports complex data treatments as peptide or protein fractionation prior to LC-MS analysis (SCX, SDS-PAGE, etc.).

Other software[edit]


AnalyzerPro is a proprietary software by SpectralWorks Limited. It is a vendor independent software application for processing mass spectrometry data. Using proprietary algorithms, AnalyzerPro can analyze both GC-MS and LC-MS using both qualitative and quantitative data processing. It is widely used for metabolomics data processing using MatrixAnalyzer for the comparison of multiple data sets.


Analyst is a proprietary software by AB Sciex, a division of The Danaher Corporation.


RemoteAnalyzer is a proprietary software by SpectralWorks Limited. It is a vendor independent 'Open Access' client/server based solution to provide a walk-up and use LC-MS and GC-MS data system. Instrument control and data processing support for multiple vendors' hardware is provided.

ESIprot 1.0 / ESIprot Online[edit]

Electrospray ionization (ESI) mass spectrometry (MS) devices with relatively low resolution are widely used for proteomics and metabolomics. Ion trap devices like the Agilent MSD/XCT ultra or the Bruker HCT ultra are typical representatives. However, even if ESI-MS data of most of the naturally occurring proteins can be measured, the availability of data evaluation software for such ESI protein spectra with low resolution is quite limited.

ESIprot 1.0 enables the charge state determination and molecular weight calculation for low resolution electrospray ionization (ESI) mass spectrometry (MS) data of proteins.[54][55]

Whereas ESIprot 1.0 is written in Python (GPL v3 License), ESIprot Online is a freely accessible PHP web application.


Spectrolyzer is a Microsoft Windows based software package that provides bioinformatics data analysis tools for different mass spectrometers.

Spectrolyzer focuses on finding protein biomarkers and detecting protein deviations. Spectrolyzer is compatible with most mass spectrometers, i.e. Tandem MS (MS/MS), MALDI-TOF MS and SELDI-TOF MS. Spectrolyzer is a software package that contains several other software tools where each of them focuses on analyzing data from a certain mass spectrometry technology.[56]


ProTrawler is an LC/MS data reduction application that reads raw mass spectrometry vendor data (from a variety of well-known instrument companies) and creates lists of {mass, retention time, integrated signal intensity} triplets summarizing the LC/MS chromatogram. The measurements are reported with errors, which are essential for performing dynamic binning for comparisons between data sets. ProTrawler operates in two modes: a highly visual hands-on (expert) mode for the development of parameters used in data reduction and a fully automated mode for moving through many chromatograms in an automated fashion. ProTrawler's data reduction work flow includes background elimination, noise estimation, peak shape estimation, shape deconvolution, and isotopic and charge-state list deconvolution (factoring in errors and signal noise) to give a list features. Typically, ProTrawler reduces 1 GB of raw data to 10 Kb of processed results with a detection sensitivity of three orders of magnitude in 25% of the data acquisition time. No formal Bayesian methods are used, but sophisticated statistical inference is employed throughout. ProTrawler has been used for bacterial protein biomarker discovery efforts as well as for IPEx-related applications.


Regatta is an LC/MS list comparison application that works hand-in-hand with ProTrawler (but accepts input in Excel/CSV form) to provide an environment for LC/MS results list filtering and normalization {mass, retention time, integrated intensity} lists. To accomplish this, Regatta solves the famous Transitive Property of Equality problem that arises in the comparison of analytical list data, viz., if Peak A in Sample A overlaps Peak B in Sample B, and Peak B overlaps Peak C in Sample C, but Peak A does not overlap Peak C, then can we say that we've measured the same analyte in all three samples or not? Regatta also implements multivariate analysis, e.g., hierarchical cluster analysis, principal component analysis, as well as statistical tests, e.g., coefficients of variation. Input is not necessarily restricted to output from ProTrawler. Regatta has been used for successfully for biomarker discovery.

OmicsHub Proteomics[edit]

OmicsHub Proteomics combines a LIMS for mass spec information management with data analysis functionalities on one platform. The software allows the user to import data files from multiple instruments, and conduct protein peak detection, filtering, protein identification, annotation and exportation of formatted reports. It is a single server platform with a web interface for multiuser access and is proprietary software of Integromics.


PIQMIe[57] or Proteomics Identifications & Quantitations Data Management & Integration Service is a web-based tool that aids in reliable and scalable data management, analysis and visualization of semi-quantitative (SILAC) proteomics experiments. PIQMIe takes the results of the MaxQuant/Andromeda MS data processing and readily integrates the result files, in particular peptide and (non-redundant) protein identifications and quantitations from one or more experiments, into a single database file for dedicated data analyses (e.g. in R) and user-driven queries. The web server provides users with a concise summary of their proteomics experiments as well as with interactive tools to aid in the assessment of the experiments and in the identification of proteins of interest.


The ProteoWizard link Library and Tools are a set of modular and extensible open-source, cross-platform tools and software libraries that facilitate proteomics data analysis.[58]

VIPER and Decon2LS[edit]

The "Proteomics Research Resource for Integrative Biology" distributes software tools (VIPER,[59] Decon2LS, and others) that can be used to perform analysis of accurate mass and chromatography retention time analysis of LC-MS features. Sometimes referred to as the Accurate Mass and Time tag approach (AMT tag approach) generally these tools are used for Proteomics.

Mass Frontier[edit]

Mass Frontier is a software tool for interpretation and management of mass spectra of small molecules. Computer methods for interpretation of mass spectral data in Mass Frontier centre on three fundamental methodologies: library search techniques, expert system procedures and classification methods. Mass Frontier uses automated generation of possible fragments at an expert level, including complete fragmentation and rearrangement mechanisms, starting from a user-supplied chemical structure. This software contains an expert system that automatically extracts a decomposition mechanism for each fragmentation reaction in the fragmentation library and determines the compound class range that the mechanism can be applied to. The expert system applies database mechanisms to a user provided structure and automatically predicts the fragmentation reactions for a given compound. The knowledge base uses around 30,000 fragmentation schemes that contain around 100,000 reactions collected from mass spectrometry literature.

Mass Frontier also incorporates an automated system for detecting chromatographic components in complex GC/MS, LC/MS or MSn runs and extracting mass spectral signals from closely coeluting components (deconvolution).

Classification methods include principal component analysis, neural networks and fuzzy clustering.


The program massXpert[60][61] is a graphical user interface-based (GUI) software for simulating and analyzing mass spectrometric data obtained on known bio-polymer sequences. The software runs in an identical manner on MS-Windows, Mac OS X and GNU/Linux/Unix platforms. massXpert is not for identifying proteins, but is useful when characterizing biopolymer sequences (post-translational modifications, intra-molecular cross-links...). It comprises four modules, all available in the same program interface: XpertDef will let the user define any aspect of the polymer chemistry at hand (atoms/isotopes, monomers, modifications, cleavage agents, fragmentation patterns, cross-links, default ionization...) ; XpertCalc is a desktop calculator with which anything mass is calculatable (the calculation is polymer chemistry definition-aware and is fully programmable; m/z ratios are computable with automatic replacement of the ionization agent ; isotopic patterns are computable starting from an elemental composition, with the possibility to specify the resolution of the mass spectrometer) ; XpertEdit is the central part of the software suite. In it reside all the simulation/analysis functionalities, like polymer sequence editing, sequence/monomer chemical modifications, cleavages, fragmentations, elemental/monomeric composition determinations, pI/net charge calculations, arbitrary mass searches in the polymer sequence; XpertMiner is a module in which it is possible to import lists of (m/z, z) pairs to submit them to any kind of calculation. Typically this module will be used to apply a formula to all the pairs in a single strike, or to perform matches between two lists, one from a simulation and another from the mass spectrometric data actually gotten from the mass spectrometer. All the simulations' results can be exported in the form of text either to the clipboard or to text files.


mMass[62] presents open source multi-platform package of tools for precise mass spectrometric data analysis and interpretation. It is written in Python language and is portable to different computer platforms. It is released under the GNU General Public License, so it can be modified or extended by modules for specific needs.

ms & multiMS-toolbox[edit]

ms-alone and multiMS-toolbox[63] is a tool chain for mass spectrometry data peak extraction and statistical analysis. The ms-alone utility runs mass spectrometry data preprocessing and peak extraction with its peak-shape matching function for exact identification of peak maximum and area, exporting also isotope group number or approximation function parameters for later processing. The multiMS-toolbox R software tool runs matching and reconstruction of unmatched peak data and the detailed statistical analysis on them and their isotope groups. Both tools are released under the GNU General Public License, ms-alone utility is also available as Windows binaries.


ProteoIQ is commercial software for the post-analysis of Mascot, SEQUEST, or X!Tandem database search results. The software provides the means to combine tandem mass spectrometry database search results derived from different instruments/platforms. Since the primary goal of many proteomics projects is to determine thresholds which identify as many real proteins as possible while encountering a minimal number of false positive protein identifications, ProteoIQ incorporates the two most common methods for statistical validation of large proteome datasets: the false discovery rate and protein probability approaches.[64][65][66] For false discovery rate calculations, ProteoIQ incorporates proprietary Protein Validation Technology (ProValT) algorithms licensed from the University of Georgia Research Foundation. Protein and peptide probabilities are generated by independent implementations of the Peptide Prophet and Protein Prophet algorithms. In ProteoIQ, protein relative quantitation is performed via spectral counting, standard deviations are automatically calculated across replicates, and spectral count abundances are normalized between samples. Integrated comparison functions allow user to quickly compare proteomic results across biological samples.

PatternLab for proteomics[edit]

PatternLab is a free software (gratis software, not Free Software according to the Free Software Foundation's definition) for post-analysis of SEQUEST or ProLuCID database search results filtered by DTASelect or Census. It offers several tools that combine false discovery rates with statistical tests and protein fold changes to pinpoint differentially expressed proteins, find trend of proteins having similar expression profiles in time course experiments, generate area proportional Venn diagrams, and even deconvolute mass spectra to enable analysis of top-down / middle-down proteomic data (YADA module). Results can also be analyzed using its Gene Ontology Explorer module.[67]


MolAna was developed by Phenomenome Discoveries Inc, (PDI) for use in IONICS Mass Spectrometry Group's 3Q Molecular Analyzer, Triple quadrupole mass spectrometer

Chromeleon 7[edit]

Chromeleon 7 is a proprietary software by Thermo Fisher Scientific used with mass spectrometry instruments, as well as chromatography instruments.


Xcalibur is a proprietary software by Thermo Fisher Scientific used with mass spectrometry instruments.

LabSolutions LCMS[edit]

LabSolutions LCMS is a proprietary software by Shimadzu Corporation used with mass spectrometry and HPLC instruments.


MassCenter is a proprietary software by JEOL used with mass spectrometry instruments like The JMS AccuTOF T100LC.


MassLynx is a proprietary software by Waters Corporation.


TurboMass is proprietary GC/MS software by PerkinElmer.


MSight is a free software for mass spectrometry imaging developed by the Swiss Institute of Bioinformatics.[68]


Spectromania is a commercial software for analysis and visualization of mass spectrometric data.[69]


Peacock is an open source Mac OS X application developed by Johan Kool that can be used to interpret gas-chromatography/mass-spectrometry (GC/MS) data files.[70]


MSGraph is an open source mass spectrometry software working in MS-DOS.[71]


OpenChrom is an open source chromatography and mass spectrometry software. It can be extended using plug-ins and is available for several operating systems (Microsoft Windows, Linux, Unix, Mac OS X) and processor architectures (x86, x86_64, ppc).[72] Converters for the native access of various data files are available, e.g. converters for mzXML, netCDF, Agilent, Finnigan and Varian file formats (however some of the converts are only available under a proprietary licence).[73]


ms2mz is a free, simple utility for converting between mass spectrometer file formats. The most common use of ms2mz is to convert proprietary binary files to MGF peak list files. This is a handy way to prepare files for upload to Proteome Cluster.[74]


Quantinetix (by ImaBiotech) is software used with mass spectrometry imaging instruments.[75] It is designed to quantify and normalize MS images in various study types, for example ADMET - DMPK (absorption, distribution, metabolism, elimination, toxicology - drug metabolism and pharmacokinetics), toxicity, proteomics, lipidomics, and in conjunction with QWBA (quantitative whole body autoradiography). The software is compatible with a variety of MSI instruments, including Bruker, Sciex, Thermo and with iMZML.

Advanced Chemistry Development[edit]

Advanced Chemistry Development (ACD/Labs) is offering several, commercial solutions for the interpretation of MS and xC/MS data. Besides for processing, the software is used for spectrum/structure matching, identification of known and unknown metabolites, as well as for the identification of compounds through spectral comparison. The software is able to create fully searchable spectral databases keeping structures with the data. Commercial reference databases for MS are available. The software is able to read most of the vendor specific raw data formats, allowing to use one GUI for different data.


pymzML[76] is an open-source Python module to interface mzML data in Python based on cElementTree[77] with additional tools for MS-informatics. It offers access to mass spectrometry data by providing a fast parser for mzML data.[78]


Proteomatic is a free data processing pipeline created for the purpose of evaluating mass spectrometric proteomics experiments. Functionality is provided via scripts, and these scripts can be chained together into a high-throughput data processing pipeline. All functionality can be used either from the command-line or via a graphical user interface.[79]


SimGlycan is a proprietary software by PREMIER Biosoft which predicts the structure of glycans and glycopeptides using mass spectrometry data. The software accepts the experimental MS/MS data, matches them with its own database of theoretical fragments and generates a list of probable candidate structures. Each structure is scored to reflect how closely it matches your experimental data. Apart from the structural information, other biological information for the probable molecular structures such as the glycan class (N-Linked, O-Linked, heparin, lipopolysaccharide etc.), reaction, pathway and enzyme are also made available for easy reference in case of structural elucidation of glycans while in the case of glycopeptide qualitative analysis, information such as Protein ID, Protein Name, Source, Classification, Class, peptide sequence, peptide mass etc. are made available for identified glycopeptides.


mspire is an open-source mass spectrometry informatics developers toolbox written in ruby. Includes an mzML reader/writer, in-silico digestion and isotopic pattern calculation etc. Submodules such as mspire-lipidomics, mspire-sequest, and mspire-simulator extend the functionality.[80]


Universal Mass Calculator (UMC)[81] for Windows written in C++ is a proprietary toolbox for calculating relevant information from sum formulae, e.g isotope distribution, mass differences, mass deviations and mass/isotope information of the elements, degree of deuteration.


StavroX is a proprietary, free-to-use software to identify cross-linked peptides from mass spectrometric data written in Java. It can be used for a wide variety of cross linkers and proteases used in the cross linking MS experiment. It compares theoretical peptide-peptide cross link combinations for the analyzed proteins to MS/MS data.[82]

SCiLS Lab[edit]

SCiLS Lab is a proprietary software for statistical analysis of MALDI imaging mass spectrometry data. SCiLS Lab integrates with the Bruker MALDI imaging solution.


The Scaffold Suite products, developed by Proteome Software Inc. since 2008, are a proprietary suite of proteomics tools for analyzing spectra, peptides and proteins across multiple samples. It is a vendor independent software application for processing mass spectrometry search engine data. X! Tandem is integrated for supplementary search engine validation. The software supports adjustable protein identification probability thresholds, FDR thresholding, PTM site assignment, quantitation of label-free and labeled peptides (including iTRAQ, TMT, and SILAC), Gene Ontology annotation, as well as a variety of statistical tests with accompanying charts and graphs. Free Viewers are available to control and share data between users.


MSiReader is a free, vendor-neutral interface built on the Matlab platform designed to view and perform data analysis of mass spectrometry imaging (MSI) data.[83] A standalone version of MSiReader for which Matlab is not required is also available. MSiReader is compatible with the following file formats: imzML (continuous and processed), img (Analyze 7.5), mzXML and ASCII.


Mass++ is free analysis software for mass spectrometry which has been developed by Eisai Co. Ltd. and Shimadzu Corp. supported by the Japanese public foundations since 2005. It can load files with open-formats (mzXML, mzML, AIA(netCDF)) and some instrument vendors formats. Mass++ has a number of functions for multi-OMICS, in addition, users can develop and add original functions as Mass++ plug-ins.

File formats[edit]

See also[edit]

  • SIMION, an ion optics simulation program


  1. ^ Changjiang Xu, Bin Ma, Software for Computational Peptide Identification from MS-MS data[dead link] Drug Discovery Today, Volume 11, Numbers 13/14, July 2006, p 595-600.
  2. ^ Jimmy K. Eng, Ashley L. McCormack, and John R. Yates, III (1994). "An Approach to Correlate Tandem Mass Spectral Data of Peptides with Amino Acid Sequences in a Protein Database". J Am Soc Mass Spectrom 5 (11): 976–989. doi:10.1016/1044-0305(94)80016-2. PMID 24226387. 
  3. ^ Perkins, David N.; Pappin, Darryl J. C.; Creasy, David M.; Cottrell, John S. (1999). "Probability-based protein identification by searching sequence databases using mass spectrometry data". Electrophoresis 20 (18): 3551–67. doi:10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2. PMID 10612281. 
  4. ^ Liang, C; Smith, JC; Hendrie, Christopher (2003). A Comparative Study of Peptide Sequencing Software Tools for MS/MS. American Society for Mass Spectrometry. 
  5. ^ "X! Tandem Project". The Global Proteome Machine Organization. Retrieved 2009-10-21. 
  6. ^ "Calculation of protein expectation value from peptide expectation values in X! Tandem" (PDF). GPM Documents. 
  7. ^ "X! series search engine file format" (PDF). GPM Documents. 
  8. ^ "X!!Tandem". Retrieved 2009-11-16. 
  9. ^ Dexter Duncan and Andrew Link, Vanderbilt University School of Medicine. "Parallel Tandem Project". The Global Proteome Machine Organization. Retrieved 2009-11-16. 
  10. ^ Colinge, Jacques; Masselot, Alexandre; Giron, Marc; Dessingy, Thierry; Magnin, Jérôme (2003). "OLAV: Towards high-throughput tandem mass spectrometry data identification". Proteomics 3 (8): 1454–63. doi:10.1002/pmic.200300485. PMID 12923771. 
  11. ^ "OMSSA ms/ms search engine". Retrieved 2011-09-27. 
  12. ^ Geer, Lewis Y.; Markey, Sanford P.; Kowalak, Jeffrey A.; Wagner, Lukas; Xu, Ming; Maynard, Dawn M.; Yang, Xiaoyu; Shi, Wenyao et al. (2004). "Open Mass Spectrometry Search Algorithm". Journal of Proteome Research 3 (5): 958–64. doi:10.1021/pr0499491. PMID 15473683. 
  13. ^ "RAId MS/MS search engine". QMBP NCBI NLM NIH. Retrieved 2008-01-01. 
  14. ^ Alves, Gelio; Ogurtsov, Aleksey Y.; Yu, Yi-Kuo (2007). "RAId_DbS: peptide identification using database searches with realistic statistics". Biol Direct 2: 25. doi:10.1186/1745-6150-2-25. PMC 2211744. PMID 17961253. 
  15. ^ Alves, Gelio; Ogurtsov, Aleksey Y.; Yu, Yi-Kuo (2010). "RAId_aPS: MS/MS analysis with multiple scoring functions and spectrum-specific statistics". In Uversky, Vladimir N. PLoS ONE 5 (11): e15438. Bibcode:2010PLoSO...515438A. doi:10.1371/journal.pone.0015438. PMC 2982831. PMID 21103371. 
  16. ^ Alves, Gelio; Ogurtsov, Aleksey Y.; Yu, Yi-Kuo (2008). "RAId_DbS: mass-spectrometry based peptide identification web server with knowledge integration". BMC Genomics 9: 505. doi:10.1186/1471-2164-9-505. PMC 2605478. PMID 18954448. 
  17. ^ "Software from the Tabb Lab". Retrieved 2011-09-27. 
  18. ^ Tabb, David L.; Fernando, Christopher G.; Chambers, Matthew C. (2007). "MyriMatch:  Highly Accurate Tandem Mass Spectral Peptide Identification by Multivariate Hypergeometric Analysis". Journal of Proteome Research 6 (2): 654–61. doi:10.1021/pr0604054. PMC 2525619. PMID 17269722. 
  19. ^ "greylag: software for tandem mass spectrum peptide identification". Retrieved 2009-11-16. 
  20. ^ "Protein Metrics Inc.". Protein Metrics Incorporated. Retrieved 2011-05-24. 
  21. ^ Bern, Marshall; Cai, Yuhan; Goldberg, David (2007). "Lookup Peaks: A Hybrid of de Novo Sequencing and Database Search for Protein Identification by Tandem Mass Spectrometry". Analytical Chemistry 79 (4): 1393–1400. doi:10.1021/ac0617013. PMID 17243770. 
  22. ^ Bern, Marshall; Goldberg, David (2008). "Improved Ranking Functions for Protein and Modification-Site Identifications". Journal of Computational Biology 15 (7): 705–719. doi:10.1089/cmb.2007.0119. PMID 18651800. 
  23. ^ "Inspect and MS-Alignment". Retrieved 2009-11-16. 
  24. ^ Liu, Jian; Erassov, Alexandre; Halina, Patrick; Canete, Myra; Nguyen, Nguyen Dinh; Chung, Clement; Cagney, Gerard; Ignatchenko, Alexandr; Fong, Vincent; Emili, Andrew (2008). "Sequential Interval Motif Search: Unrestricted Database Surveys of Global MS/MS Data Sets for Detection of Putative Post-Translational Modifications". Analytical Chemistry 80 (20): 7846–54. doi:10.1021/ac8009017. PMID 18788753. 
  25. ^ "EMILILAB TOOLS". Retrieved 2009-11-16. [dead link]
  26. ^ Yadav, Amit Kumar; Kumar, Dhirendra; Dash, Debasis (2011). "Amit Kumar Yadav, Dhirendra Kumar, and Debasis Dash. MassWiz: A Novel Scoring Algorithm with Target-Decoy Based Analysis Pipeline for Tandem Mass Spectrometry". J. Proteome Res. 10 (5): 2154–2160. doi:10.1021/pr200031z. PMID 21417338. Retrieved 2011-09-27. 
  27. ^ "MassWiz Project at Sourceforge". Retrieved 2011-09-27. 
  28. ^ "MassWiz Webserver". Retrieved 2011-09-27. 
  29. ^ Novak, Jiri; Sachsenberg, Timo; Hoksza, David; Skopal, Tomas; Kohlbacher, Oliver (2013). "On Comparison of SimTandem with State-of-the-Art Peptide Identification Tools, Efficiency of Precursor Mass Filter and Dealing with Variable Modifications". Journal of Integrative Bioinformatics 10 (3). 
  30. ^ "". Retrieved 2013-11-16. 
  31. ^ Bartels, Christian (31 May 1990). "Fast algorithm for peptide sequencing by mass spectroscopy". Biological Mass Spectrometry 19 (6): 363–368. doi:10.1002/bms.1200190607. 
  32. ^ thermo finnigan introduces denovox - Search results from HighBeam Business
  33. ^ Savitski, Mikhail M.; Nielsen, Michael L.; Kjeldsen, Frank; Zubarev, Roman A. (2005). "Proteomics-Grade de Novo Sequencing Approach". Journal of Proteome Research 4 (6): 2348–54. doi:10.1021/pr050288x. PMID 16335984. 
  34. ^ MedicWave PAS[dead link]
  35. ^ Ma, Bin; Zhang, Kaizhong; Hendrie, Christopher; Liang, Chengzhi; Li, Ming; Doherty-Kirby, Amanda; Lajoie, Gilles (2003). "PEAKS: powerful software for peptidede novo sequencing by tandem mass spectrometry". Rapid Communications in Mass Spectrometry 17 (20): 2337–42. doi:10.1002/rcm.1196. PMID 14558135. 
  36. ^ Tannu, Nilesh S; Hemby, Scott E (2007). "De novo protein sequence analysis of Macaca mulatta". BMC Genomics 8: 270. doi:10.1186/1471-2164-8-270. PMC 1965481. PMID 17686166. 
  37. ^ "Lutefisk - de novo MS/MS Sequencing". J. Alex Taylor. Retrieved 2009-11-16. 
  38. ^ Yonghua Han; Bin Ma; Kaizhong Zhang (2004). "SPIDER: software for protein identification from sequence tags with de novo sequencing error". Proceedings. 2004 IEEE Computational Systems Bioinformatics Conference, 2004. CSB 2004. pp. 206–15. doi:10.1109/CSB.2004.1332434. ISBN 0-7695-2194-0. 
  39. ^ Ma, Bin (2007). "Search for the Undiscovered Peptide; Using de novo sequencing and sequence tag homology search to improve protein characterization". Biotechniques Journal 42 (5). 
  40. ^ PEAKS: SPIDER (Sequence Homology Search Tool)[dead link]
  41. ^ "OpenMSt". Retrieved 2009-11-16. 
  42. ^ Junker, J.; Bielow, C.; Bertsch, A.; Sturm, M.; Reinert, K.; Kohlbacher, O. (2012). "TOPPAS: A Graphical Workflow Editor for the Analysis of High-Throughput Proteomics Data". Journal of Proteome Research 11 (7): 3914–3920. doi:10.1021/pr300187f. PMID 22583024.  edit
  43. ^ OpenMS releases
  44. ^ "MaxQuant". Retrieved 1 July 2013. 
  45. ^ Cox, J.; Mann, M. (Dec 2008). "MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification". Nat Biotechnol 26 (12): 1367–72. doi:10.1038/nbt.1511. PMID 19029910. 
  46. ^ Cox, J.; Neuhauser, N.; Michalski, A.; Scheltema, RA.; Olsen, JV.; Mann, M. (Apr 2011). "Andromeda: a peptide search engine integrated into the MaxQuant environment". J Proteome Res 10 (4): 1794–805. doi:10.1021/pr101065j. PMID 21254760. 
  47. ^ Reiter, L; et al. (2011). "mProphet: automated data processing and statistical validation for large-scale SRM experiments". Nat Methods 8 (5): 430–435. doi:10.1038/nmeth.1584. 
  48. ^ "New Spectronaut software now enables Thermo Scientific DIA | Biognosys AG". Retrieved 3 February 2014. 
  49. ^ New Spectronaut release supports spectral library generation | Biognosys AG
  50. ^ "Spectronaut | Biognosys AG". Retrieved 3 February 2014. 
  51. ^ Law, KP; Lim YP (2013). "Recent advances in mass spectrometry: data independent analysis and hyper reaction monitoring". Expert Rev Proteomics 10 (6): 551–566. doi:10.1586/14789450.2013.858022. 
  52. ^ Vowinckel, J; et al. (2013). "The beauty of being (label)-free: sample preparation methods for SWATH-MS and next-generation targeted proteomics". F1000Research 2: 272. doi:10.12688/f1000research.2-272.v1. 
  53. ^ Valot, Benoît; Langella, Olivier; Nano, Edlira; Zivy, Michel (2011). "MassChroQ: A versatile tool for mass spectrometry quantification". Proteomics 11 (17): 3572–3577. doi:10.1002/pmic.201100120. PMID 21751374. 
  54. ^ Robert Winkler. "ESIprot Homepage". Retrieved 2011-09-27. 
  55. ^ Winkler, Robert (2010). "ESIprot: a universal tool for charge state determination and molecular weight calculation of proteins from electrospray ionization mass spectrometry data". Rapid Communications in Mass Spectrometry 24 (3): 285–94. doi:10.1002/rcm.4384. PMID 20049890. 
  56. ^ "Spectrolyzer". Retrieved 2011-09-27. 
  57. ^ Kuzniar, A.; Kanaar, R. (2014). "PIQMIe: a web server for semi-quantitative proteomics data management and analysis.". Nucleic Acids Res 42 (W1): W100–W106. doi:10.1093/nar/gku478. PMID 24861615. 
  58. ^ ProteoWizard: Home
  59. ^ Monroe, M. E.; Tolić, N.; Jaitly, N.; Shaw, J. L.; Adkins, J. N.; Smith, R. D. (2007). "VIPER: an advanced software package to support high-throughput LC-MS peptide identification". Bioinformatics 23 (15): 2021–3. doi:10.1093/bioinformatics/btm281. PMID 17545182. 
  60. ^ Filippo Rusconi, Ph.D. (ed.). "". massXpert version 2.0.2 User Manual (PDF). 
  61. ^ Rusconi, F. (2009). "massXpert 2: a cross-platform software environment for polymer chemistry modelling and simulation/analysis of mass spectrometric data". Bioinformatics 25 (20): 2741–2. doi:10.1093/bioinformatics/btp504. PMID 19740912. 
  62. ^ "Open Source Mass Spectrometry Tool". mMass. Retrieved 2011-09-27. 
  63. ^ "ms-alone & multiMS-toolbox". ms-alone & multiMS-toolbox. Retrieved 2014-06-04. 
  64. ^ Weatherly, D. B.; Atwood Ja, 3rd; Minning, TA; Cavola, C; Tarleton, RL; Orlando, R (2005). "A Heuristic Method for Assigning a False-discovery Rate for Protein Identifications from Mascot Database Search Results". Molecular & Cellular Proteomics 4 (6): 762–72. doi:10.1074/mcp.M400215-MCP200. PMID 15703444. 
  65. ^ Keller, Andrew; Nesvizhskii, Alexey I.; Kolker, Eugene; Aebersold, Ruedi (2002). "Empirical Statistical Model To Estimate the Accuracy of Peptide Identifications Made by MS/MS and Database Search". Analytical Chemistry 74 (20): 5383–92. doi:10.1021/ac025747h. PMID 12403597. 
  66. ^ Nesvizhskii, AI; Keller, A; Kolker, E; Aebersold, R (2003). "A statistical model for identifying proteins by tandem mass spectrometry". Analytical chemistry 75 (17): 4646–58. doi:10.1021/ac0341261. PMID 14632076. 
  67. ^ Carvalho, Paulo C; Fischer, Juliana SG; Chen, Emily I; Yates, John R; Barbosa, Valmir C (2008). "PatternLab for proteomics: a tool for differential shotgun proteomics". BMC Bioinformatics 9: 316. doi:10.1186/1471-2105-9-316. PMC 2488363. PMID 18644148. 
  68. ^ Palagi, Patricia M.; Walther, Daniel; Quadroni, Manfredo; Catherinet, SéBastien; Burgess, Jennifer; Zimmermann-Ivol, Catherine G.; Sanchez, Jean-Charles; Binz, Pierre-Alain; Hochstrasser, Denis F.; Appel, Ron D. (2005). "MSight: An image analysis software for liquid chromatography-mass spectrometry". Proteomics 5 (9): 2381–4. doi:10.1002/pmic.200401244. PMID 15880814. 
  69. ^ Zucht, Hans-Dieter; Lamerz, Jens; Khamenia, Valery; Schiller, Carsten; Appel, Annette; Tammen, Harald; Crameri, Reto; Selle, Hartmut (2005). "Datamining Methodology for LC-MALDI-MS Based Peptide Profiling". Combinatorial Chemistry & High Throughput Screening 8 (8): 717–23. doi:10.2174/138620705774962481. 
  70. ^ "Peacock on Google Code". Retrieved 2011-09-27. 
  71. ^ "MSGraph on". Retrieved 2011-09-27. 
  72. ^ "OpenChrom on SourceForge". Retrieved 2011-09-27. 
  73. ^ "Agilent ChemStation plug-in on". Retrieved 2013-07-21. 
  74. ^ "ms2mz - A Utility for Converting Mass Spectrometry Files on". Retrieved 2011-12-27. 
  75. ^ MALDI Imaging - ImaBiotech - Accelerating discovery, driving innovation
  76. ^ Welcome to pymzml’s documentation — pymzML 0.7.4 documentation
  77. ^ The cElementTree Module
  78. ^ Bald, Till; Barth, Johannes; Niehues, Anna; Specht, Michael; Hippler, Michael; Fufezan, Christian (2012). "pymzML - Python module for high throughput bioinformatics on mass spectrometry data". Bioinformatics 28 (7): 1052–3. doi:10.1093/bioinformatics/bts066. PMID 22302572. 
  79. ^ Specht, Michael; Kuhlgert, Sebastian; Fufezan, Christian; Hippler, Michael (2011). "Proteomics to go: Proteomatic enables the user-friendly creation of versatile MS/MS data evaluation workflows". Bioinformatics 27 (8): 1183–1184. doi:10.1093/bioinformatics/btr081. PMID 21325302. 
  80. ^ Prince, J. T.; Marcotte, E. M. (2008). "Mspire: Mass spectrometry proteomics in Ruby". Bioinformatics 24 (23): 2796–2797. doi:10.1093/bioinformatics/btn513. PMC 2639276. PMID 18930952.  edit
  81. ^ "WWU Münster - Organische Chemie - mass spectrometry - links". Retrieved 8 August 2013. 
  82. ^ Götze, Michael; Pettelkau J; Schaks S; Bosse K; Ihling CH; Krauth F; Fritzsche R; Kühn U; Sinz A. (January 2012). "StavroX--a software for analyzing crosslinked products in protein interaction studies". J Am Soc Mass Spectrom. 23 (1): 76–87. doi:10.1007/s13361-011-0261-2. PMID 22038510. Retrieved 6 November 2013. 
  83. ^ Robichaud, Guillaume; Garrard, Kenneth P.; Barry, Jeremy A.; Muddiman, David C. (March 2013). "MSiReader: An Open-Source Interface to View and Analyze High Resolving Power MS Imaging Files on Matlab Platform". Journal of The American Society for Mass Spectrometry (Springer) 24 (5): 718–721. doi:10.1007/s13361-013-0607-z. 

External links[edit]