MSFragger is an ultrafast database search tool for peptide identification in mass spectrometry-based proteomics. It has demonstrated excellent performance across a wide range of datasets and applications. The speed of MSFragger makes it particularly suitable for the analysis of large datasets (including timsTOF data), for enzyme unconstrained searches, and for ‘open’ database searches (with the precursor mass tolerance set to hundreds of Daltons) for identification of modified peptides.
MSFragger is implemented in the cross-platform Java programming language, and can be used three different ways:
- With FragPipe GUI (Graphical User Interface)
- Through ProteomeDiscoverer
- As a standalone Java executable (JAR) file
MSFragger writes output in either tabular or pepXML formats, making it fully compatible with downstream data analysis pipelines such as Trans-Proteomic Pipeline and Philosopher. See the complete documentation, including a list of Frequently Asked Questions. Example parameter files can be found here.
Supported instruments and file formats
mzML/mzXML: Data from any instrument in mzML/mzXML format can be used.
Thermo RAW: MSFragger supports direct reading Thermo RAW files. MSFragger ProteomeDiscoverer (PD) node is fully compatible with all downstream PD tools. FragPipe has limited support for RAW files (no MS1-based label-free quantification); conversion to mzML when running FragPipe is recommended.
Bruker TimsTOF: MSFragger support direct reading from Bruker timsTOF raw files (.d folder) and MGF files converted by Bruker DataAnalysis. Note that FragPipe currently supports for timsTOF data.
Whether you run it through FragPipe, PD, or stand-alone, you will need to download the latest MSFragger JAR file. See instructions for downloading or upgrading MSFragger.
The latest version of MSFragger was released on 2019-11-08. Check here for the full list of MSFragger versions and changes.
On Windows, the easiest way to run MSFragger is using FragPipe GUI. A tutorial on how to convert Thermo RAW files to mzML/mzXML (recommended for Thermo data to ensure full FragPipe functionality) can be found here. FragPipe tutorial can be found here.
FragPipe includes post-database search tool Philosopher (for downstream analysis with PeptideProphet and ProteinProphet), label-free quantification, FDR filtering, and report generation (at the PSM/ion/peptide/protein-levels). Additional tools (currently supporting Thermo data in mzML/mzXML format only) include DIA-Umpire SE module for DIA data, PTM-Shepherd for generating global PTM profiles, and SpectraST-based spectral library building module.
MSFragger and Philosopher (PeptideProphet) are also available as processing nodes in Proteome Discoverer (PD, Thermo Scientific). Currently, the MSFragger-PD node can be used in PD v2.2 and v2.3.
Please visit our PD-Nodes page for more information.
See the Launching MSFragger section of the Wiki page.
Complete analyses can be performed with the Philosopher pipeline, a command line tool: Tutorial.
For documentation on MSFragger (hardware requirements, search parameters, etc.), see MSFragger Documentation Wiki page.
Questions and Technical Support
See our Frequently Asked Questions (FAQ) page. Please post all questions/bug reports regarding MSFragger itself on the MSFragger GitHub page, or if more appropriate on FragPipe page or Philosopher page.
Requests for Collaboration
If you would like to propose a new collaboration that can take advantage of MSFragger and related tools, please contact us directly.
How to Cite
Kong AT, Leprevost FV, Avtonomov DM, Mellacheruvu D, Nesvizhskii AI. MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nature Methods 14:513–520 (2017). Manuscript.
For other tools developed by the Nesvizhskii lab, see our website www.nesvilab.org
The pepXML files produced by MSFragger may has additional attributes (e.g.,
ion_mobility) no in the schema. According to our test, both PeptideProphet and Philosopher can process those additional attributes.