Normalization of oligonucleotide arrays based on the least-variant set of genes

Open Access

5 March 2008

journal article
Published by Springer Science and Business Media LLC in BMC Bioinformatics

Vol. 9 (1), 140
https://doi.org/10.1186/1471-2105-9-140

Abstract

Background It is well known that the normalization step of microarray data makes a difference in the downstream analysis. All normalization methods rely on certain assumptions, so differences in results can be traced to different sensitivities to violation of the assumptions. Illustrating the lack of robustness, in a striking spike-in experiment all existing normalization methods fail because of an imbalance between up- and down-regulated genes. This means it is still important to develop a normalization method that is robust against violation of the standard assumptions Results We develop a new algorithm based on identification of the least-variant set (LVS) of genes across the arrays. The array-to-array variation is evaluated in the robust linear model fit of pre-normalized probe-level data. The genes are then used as a reference set for a non-linear normalization. The method is applicable to any existing expression summaries, such as MAS5 or RMA. Conclusion We show that LVS normalization outperforms other normalization methods when the standard assumptions are not satisfied. In the complex spike-in study, LVS performs similarly to the ideal (in practice unknown) housekeeping-gene normalization. An R package called lvs is available in http://www.meb.ki.se/~yudpaw.

Keywords

This publication has 29 references indexed in Scilit:

Comparison of Affymetrix GeneChip expression measures
Bioinformatics, 2006
Multidimensional local false discovery rate for microarray studies
Bioinformatics, 2005
Standardization strategy for quantitative PCR in human seminoma and normal testis
Journal of Biotechnology, 2005
A benchmark for Affymetrix GeneChip expression measures
Bioinformatics, 2004
Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments
Statistical Applications in Genetics and Molecular Biology, 2004
Exploration, normalization, and summaries of high density oligonucleotide array probe level data
Biostatistics, 2003
Microarray data normalization and transformation
Nature Genetics, 2002
beta-Actin and GAPDH housekeeping gene expression in asthmatic airways is variable and not suitable for normalising mRNA levels
Thorax, 2002
Quantification of mRNA using real-time reverse transcription PCR (RT-PCR): trends and problems
Journal of Molecular Endocrinology, 2002
Regression Quantiles
Econometrica, 1978

Cited by 55 articles