A statistical selection strategy for normalization procedures in LC‐MS proteomics experiments through dataset‐dependent ranking of normalization scaling factors
- 28 October 2011
- journal article
- research article
- Published by Wiley in Proteomics
- Vol. 11 (24), 4736-4741
- https://doi.org/10.1002/pmic.201100078
Abstract
Quantification of LC‐MS peak intensities assigned during peptide identification in a typical comparative proteomics experiment will deviate from run‐to‐run of the instrument due to both technical and biological variation. Thus, normalization of peak intensities across an LC‐MS proteomics dataset is a fundamental step in pre‐processing. However, the downstream analysis of LC‐MS proteomics data can be dramatically affected by the normalization method selected. Current normalization procedures for LC‐MS proteomics data are presented in the context of normalization values derived from subsets of the full collection of identified peptides. The distribution of these normalization values is unknown a priori. If they are not independent from the biological factors associated with the experiment the normalization process can introduce bias into the data, possibly affecting downstream statistical biomarker discovery. We present a novel approach to evaluate normalization strategies, which includes the peptide selection component associated with the derivation of normalization values. Our approach evaluates the effect of normalization on the between‐group variance structure in order to identify the most appropriate normalization methods that improve the structure of the data without introducing bias into the normalized peak intensities.Keywords
Funding Information
- National Institutes of Health (1R011GM084892, U54-016015, U54-AI081680, HHSN272200800060C)
- U.S. Department of Energy (DE-AC05-76RL01830)
This publication has 11 references indexed in Scilit:
- Improved quality control processing of peptide-centric LC-MS proteomics dataBioinformatics, 2011
- Combined Statistical Analyses of Peptide Intensities and Peptide Occurrences Improves Identification of Significant Peptides from MS-Based Proteomics DataJournal of Proteome Research, 2010
- Development and Evaluation of Normalization Methods for Label-free Relative Quantification of Endogenous PeptidesMolecular & Cellular Proteomics, 2009
- Normalization of peak intensities in bottom-up MS-based proteomics using singular value decompositionBioinformatics, 2009
- Statistical Design of Quantitative Mass Spectrometry-Based Proteomic ExperimentsJournal of Proteome Research, 2009
- Papers on normalization, variable selection, classification or clustering of microarray dataBioinformatics, 2009
- Normalization Approaches for Removing Systematic Biases Associated with Mass Spectrometry and Label-Free ProteomicsJournal of Proteome Research, 2006
- NORMALIZATION REGARDING NON-RANDOM MISSING VALUES IN HIGH-THROUGHPUT MASS SPECTROMETRY DATAPacific Symposium on Biocomputing, 2005
- Microarray data normalization and transformationNature Genetics, 2002
- An accurate mass tag strategy for quantitative and high-throughput proteome measurementsProteomics, 2002