Sparsity and Smoothness Via the Fused Lasso

Abstract

Summary. The lasso penalizes a least squares regression by the sum of the absolute values (L₁-norm) of the coefficients. The form of this penalty encourages sparse solutions (with many coefficients equal to 0). We propose the ‘fused lasso’, a generalization that is designed for problems with features that can be ordered in some meaningful way. The fused lasso penalizes the L₁-norm of both the coefficients and their successive differences. Thus it encourages sparsity of the coefficients and also sparsity of their differences—i.e. local constancy of the coefficient profile. The fused lasso is especially useful when the number of features p is much greater than N, the sample size. The technique is also extended to the ‘hinge’ loss function that underlies the support vector classifier. We illustrate the methods on examples from protein mass spectroscopy and gene expression data.

Keywords

Funding Information

National Science Foundation (DMS-9971405, CCR-0306662)
National Institutes of Health (N01-HV-28183)
Office of Naval Research (N00014-02-1-0076)

This publication has 7 references indexed in Scilit:

Diagnosis of multiple cancer types by shrunken centroids of gene expression
Proceedings of the National Academy of Sciences of the United States of America, 2002
Use of proteomic patterns in serum to identify ovarian cancer
The Lancet, 2002
Atomic Decomposition by Basis Pursuit
SIAM Review, 2001
Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring
Science, 1999
Ideal spatial adaptation by wavelet shrinkage
Biometrika, 1994
Estimation of the Mean of a Multivariate Normal Distribution
The Annals of Statistics, 1981
Ridge Regression: Biased Estimation for Nonorthogonal Problems
Technometrics, 1970

Cited by 1581 articles