Sure independence screening in generalized linear models with NP-dimensionality
Top Cited Papers
Open Access
- 1 December 2010
- journal article
- Published by Institute of Mathematical Statistics in The Annals of Statistics
- Vol. 38 (6)
- https://doi.org/10.1214/10-aos798
Abstract
Ultrahigh-dimensional variable selection plays an increasingly important role in contemporary scientific discoveries and statistical research. Among others, Fan and Lv [J. R. Stat. Soc. Ser. B Stat. Methodol. 70 (2008) 849-911] propose an independent screening framework by ranking the marginal correlations. They showed that the correlation ranking procedure possesses a sure independence screening property within the context of the linear model with Gaussian covariates and responses. In this paper, we propose a more general version of the independent learning with ranking the maximum marginal likelihood estimates or the maximum marginal likelihood itself in generalized linear models. We show that the proposed methods, with Fan and Lv [J. R. Stat. Soc. Ser. B Stat. Methodol. 70 (2008) 849-911] as a very special case, also possess the sure screening property with vanishing false selection rate. The conditions under which the independence learning possesses a sure screening is surprisingly simple. This justifies the applicability of such a simple method in a wide spectrum. We quantify explicitly the extent to which the dimensionality can be reduced by independence screening, which depends on the interactions of the covariance matrix of covariates and true parameters. Simulation studies are used to illustrate the utility of the proposed approaches. In addition, we establish an exponential inequality for the quasi-maximum likelihood estimator which is useful for high-dimensional statistical learning.Comment: Published in at http://dx.doi.org/10.1214/10-AOS798 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.orgKeywords
Other Versions
This publication has 23 references indexed in Scilit:
- High-dimensional classification using features annealed independence rulesThe Annals of Statistics, 2008
- Sure Independence Screening for Ultrahigh Dimensional Feature SpaceJournal of the Royal Statistical Society Series B: Statistical Methodology, 2008
- High-dimensional generalized linear models and the lassoThe Annals of Statistics, 2008
- Asymptotic properties of bridge estimators in sparse high-dimensional regression modelsThe Annals of Statistics, 2008
- The Adaptive Lasso and Its Oracle PropertiesJournal of the American Statistical Association, 2006
- Persistence in high-dimensional linear predictor selection and the virtue of overparametrizationBernoulli, 2004
- M-estimation using penalties or sievesJournal of Statistical Planning and Inference, 2002
- A Statistical View of Some Chemometrics Regression ToolsTechnometrics, 1993
- Projection Pursuit RegressionJournal of the American Statistical Association, 1981
- Probability Inequalities for Sums of Bounded Random VariablesJournal of the American Statistical Association, 1963