MetFusion: integration of compound identification strategies
Top Cited Papers
- 27 February 2013
- journal article
- Published by Wiley in Journal of Mass Spectrometry
- Vol. 48 (3), 291-298
- https://doi.org/10.1002/jms.3123
Abstract
Mass spectrometry (MS) is an important analytical technique for the detection and identification of small compounds. The main bottleneck in the interpretation of metabolite profiling or screening experiments is the identification of unknown compounds from tandem mass spectra. Spectral libraries for tandem MS, such as MassBank or NIST, contain reference spectra for many compounds, but their limited chemical coverage reduces the chance for a correct and reliable identification of unknown spectra outside the database domain. On the other hand, compound databases like PubChem or ChemSpider have a much larger coverage of the chemical space, but they cannot be queried with spectral information directly. Recently, computational mass spectrometry methods and in silico fragmentation prediction allow users to search such databases of chemical structures. We present a new strategy called MetFusion to combine identification results from several resources, in particular, from the in silico fragmenter MetFrag with the spectral library MassBank to improve compound identification. We evaluate the performance on a set of 1062 spectra and achieve an improved ranking of the correct compound from rank 28 using MetFrag alone, to rank 7 with MetFusion, even if the correct compound and similar compounds are absent from the spectral library. On the basis of the evaluation, we extrapolate the performance of MetFusion to the KEGG compound database. Copyright © 2013 John Wiley & Sons, Ltd.Keywords
This publication has 30 references indexed in Scilit:
- Environmental Mass Spectrometry: Emerging Contaminants and Current IssuesAnalytical Chemistry, 2011
- MassBank: a public repository for sharing mass spectral data for life sciencesJournal of Mass Spectrometry, 2010
- LC–high resolution MS in environmental analysis: from target screening to the identification of unknownsAnalytical and Bioanalytical Chemistry, 2010
- Comprehensive Analytical Strategy for Biomarker Identification based on Liquid Chromatography Coupled to Mass Spectrometry and New Candidate Confirmation ToolsAnalytical Chemistry, 2009
- Identification of Transformation Products of Organic Contaminants in Natural Waters by Computer-Aided Prediction and High-Resolution Mass SpectrometryEnvironmental Science & Technology, 2009
- A Chloroplastic UDP-Glucose Pyrophosphorylase from Arabidopsis Is the Committed Enzyme for the First Step of Sulfolipid BiosynthesisTHE PLANT CELL ONLINE, 2009
- Metabolome Analysis of Biosynthetic Mutants Reveals a Diversity of Metabolic Changes and Allows Identification of a Large Number of New Compounds in ArabidopsisPlant Physiology, 2008
- Optimized liquid chromatography–mass spectrometry approach for the isolation of minor stress biomarkers in plant extracts and their identification by capillary nuclear magnetic resonanceJournal of Chromatography A, 2008
- HMDB: the Human Metabolome DatabaseNucleic Acids Research, 2007
- Collision‐Induced Dissociation (CID) of Peptides and ProteinsMethods in Enzymology, 2005