A statistical selection strategy for normalization procedures in LC‐MS proteomics experiments through dataset‐dependent ranking of normalization scaling factors

28 October 2011

journal article
research article
Published by Wiley in Proteomics

Vol. 11 (24), 4736-4741
https://doi.org/10.1002/pmic.201100078

Abstract

Quantification of LC‐MS peak intensities assigned during peptide identification in a typical comparative proteomics experiment will deviate from run‐to‐run of the instrument due to both technical and biological variation. Thus, normalization of peak intensities across an LC‐MS proteomics dataset is a fundamental step in pre‐processing. However, the downstream analysis of LC‐MS proteomics data can be dramatically affected by the normalization method selected. Current normalization procedures for LC‐MS proteomics data are presented in the context of normalization values derived from subsets of the full collection of identified peptides. The distribution of these normalization values is unknown a priori. If they are not independent from the biological factors associated with the experiment the normalization process can introduce bias into the data, possibly affecting downstream statistical biomarker discovery. We present a novel approach to evaluate normalization strategies, which includes the peptide selection component associated with the derivation of normalization values. Our approach evaluates the effect of normalization on the between‐group variance structure in order to identify the most appropriate normalization methods that improve the structure of the data without introducing bias into the normalized peak intensities.

Keywords

Funding Information

National Institutes of Health (1R011GM084892, U54-016015, U54-AI081680, HHSN272200800060C)
U.S. Department of Energy (DE-AC05-76RL01830)

This publication has 11 references indexed in Scilit:

Improved quality control processing of peptide-centric LC-MS proteomics data
Bioinformatics, 2011
Combined Statistical Analyses of Peptide Intensities and Peptide Occurrences Improves Identification of Significant Peptides from MS-Based Proteomics Data
Journal of Proteome Research, 2010
Development and Evaluation of Normalization Methods for Label-free Relative Quantification of Endogenous Peptides
Molecular & Cellular Proteomics, 2009
Normalization of peak intensities in bottom-up MS-based proteomics using singular value decomposition
Bioinformatics, 2009
Statistical Design of Quantitative Mass Spectrometry-Based Proteomic Experiments
Journal of Proteome Research, 2009
Papers on normalization, variable selection, classification or clustering of microarray data
Bioinformatics, 2009
Normalization Approaches for Removing Systematic Biases Associated with Mass Spectrometry and Label-Free Proteomics
Journal of Proteome Research, 2006
NORMALIZATION REGARDING NON-RANDOM MISSING VALUES IN HIGH-THROUGHPUT MASS SPECTROMETRY DATA
Pacific Symposium on Biocomputing, 2005
Microarray data normalization and transformation
Nature Genetics, 2002
An accurate mass tag strategy for quantitative and high-throughput proteome measurements
Proteomics, 2002

Cited by 82 articles