Dinosaur: A Refined Open-Source Peptide MS Feature Detector
Open Access
- 8 June 2016
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Proteome Research
- Vol. 15 (7), 2143-2151
- https://doi.org/10.1021/acs.jproteome.6b00016
Abstract
In bottom-up mass spectrometry (MS)-based proteomics, peptide isotopic and chromatographic traces (features) are frequently used for label-free quantification in data-dependent acquisition MS but can also be used for the improved identification of chimeric spectra or sample complexity characterization. Feature detection is difficult because of the high complexity of MS proteomics data from biological samples, which frequently causes features to intermingle. In addition, existing feature detection algorithms commonly suffer from compatibility issues, long computation times, or poor performance on high-resolution data. Because of these limitations, we developed a new tool, Dinosaur, with increased speed and versatility. Dinosaur has the functionality to sample algorithm computations through quality-control plots, which we call a plot trail. From the evaluation of this plot trail, we introduce several algorithmic improvements to further improve the robustness and performance of Dinosaur, with the detection of features for 98% of MS/MS identifications in a benchmark data set, and no other algorithm tested in this study passed 96% feature detection. We finally used Dinosaur to reimplement a published workflow for peptide identification in chimeric spectra, increasing chimeric identification from 26% to 32% over the standard workflow. Dinosaur is operating-system-independent and is freely available as open source on https://github.com/fickludd/dinosaur.Keywords
Funding Information
- Vetenskapsr??det (2008:3356, 621-2012-3559)
- European Commission (309831)
- Crafoordska Stiftelsen (2010 0892, 2014 02079)
- Knut och Alice Wallenbergs Stiftelse (2012.0178)
- Stiftelsen f??r??Strategisk Forskning (FFL4)
- Stiftelsen Olle Engkvist Byggm??stare
- Stiftelsen f??r Milj??strategisk Forskning (Mistra Biotech)
This publication has 45 references indexed in Scilit:
- Large-scale inference of protein tissue origin in gram-positive sepsis plasma using quantitative targeted proteomicsNature Communications, 2016
- A Human Interactome in Three Quantitative Dimensions Organized by Stoichiometries and AbundancesCell, 2015
- The complete structure of the 55 S mammalian mitochondrial ribosomeScience, 2015
- MS-GF+ makes progress towards a universal database search tool for proteomicsNature Communications, 2014
- OpenSWATH enables automated, targeted analysis of data-independent acquisition MS dataNature Biotechnology, 2014
- TOPPAS: A Graphical Workflow Editor for the Analysis of High-Throughput Proteomics DataJournal of Proteome Research, 2012
- Proteome-wide selected reaction monitoring assays for the human pathogen Streptococcus pyogenesNature Communications, 2012
- More than 100,000 Detectable Peptide Species Elute in Single Shotgun Proteomics Runs but the Majority is Inaccessible to Data-Dependent LC−MS/MSJournal of Proteome Research, 2011
- Label-free Quantitative Proteomics Using Large Peptide Data Sets Generated by Nanoflow Liquid Chromatography and Mass SpectrometryMolecular & Cellular Proteomics, 2006
- Determination of monoisotopic masses and ion populations for large biomolecules from resolved isotopic distributionsJournal of the American Society for Mass Spectrometry, 1995