Efficient quadratic regularization for expression arrays

Open Access

1 July 2004

journal article
Published by Oxford University Press (OUP) in Biostatistics

Vol. 5 (3), 329-340
https://doi.org/10.1093/biostatistics/5.3.329

Abstract

Gene expression arrays typically have 50 to 100 samples and 1000 to 20 000 variables (genes). There have been many attempts to adapt statistical models for regression and classification to these data, and in many cases these attempts have challenged the computational resources. In this article we expose a class of techniques based on quadratic regularization of linear models, including regularized (ridge) regression, logistic and multinomial regression, linear and mixture discriminant analysis, the Cox model and neural networks. For all of these models, we show that dramatic computational savings are possible over naive implementations, using standard transformations in numerical linear algebra.

Keywords

EUCLIDEAN METHODS
:E IGENGENES
QUADRATIC REGULARIZATION
SVD.
DISCRIMINANT ANALYSIS
NUMERICAL LINEAR ALGEBRA
LINEAR MODEL
GENE EXPRESSION
RIDGE REGRESSION
NEURAL NETWORK
STATISTICAL MODEL
COX MODEL

Cited by 30 articles