Greedy function approximation: A gradient boosting machine.

Top Cited Papers

Open Access

1 October 2001

journal article
Published by Institute of Mathematical Statistics in The Annals of Statistics

Vol. 29 (5), 1189-1232
https://doi.org/10.1214/aos/1013203451

Abstract

Function estimation/approximation is viewed from the perspective of numerical optimization in function space, rather than parameter space. A connection is made between stagewise additive expansions and steepest-descent minimization. A general gradient descent “boosting” paradigm is developed for additive expansions based on any fitting criterion.Specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification. Special enhancements are derived for the particular case where the individual additive components are regression trees, and tools for interpreting such “TreeBoost” models are presented. Gradient boosting of regression trees produces competitive, highly robust, interpretable procedures for both regression and classification, especially appropriate for mining less than clean data. Connections between this approach and the boosting methods of Freund and Shapire and Friedman, Hastie and Tibshirani are discussed.

Keywords

This publication has 15 references indexed in Scilit:

Additive logistic regression: a statistical view of boosting (With discussion and a rejoinder by the authors)
The Annals of Statistics, 2000
A Geometric Approach to Leveraging Weak Learners
Lecture Notes in Computer Science, 1999
Cr-Pyrope Garnets in the Lithospheric Mantle. I. Compositional Systematics and Relations to Tectonic Setting
Journal of Petrology, 1999
The Visual Design and Control of Trellis Display
Journal of Computational and Graphical Statistics, 1996
Matching pursuits with time-frequency dictionaries
IEEE Transactions on Signal Processing, 1993
Nonlinear wavelet methods for recovery of signals, densities, and spectra from indirect and noisy data
Proceedings of Symposia in Applied Mathematics, 1993
Multivariate Adaptive Regression Splines
The Annals of Statistics, 1991
Learning representations by back-propagating errors
Nature, 1986
Robust Estimation of a Location Parameter
The Annals of Mathematical Statistics, 1964
A Mathematical Approach to Medical Diagnosis
JAMA, 1961

Cited by 12416 articles