A new data adaptive elastic net predictive model using hybridized smoothed covariance estimators with information complexity
- 11 February 2019
- journal article
- conference paper
- Published by Taylor & Francis Ltd in Journal of Statistical Computation and Simulation
- Vol. 89 (6), 1060-1089
- https://doi.org/10.1080/00949655.2019.1576683
Abstract
We develop a novel Adaptive Elastic Net (AEN) modelling using a new covariance-regularization approach via the Hybridized Smoothed Covariance Estimators (HSCEs) to identify and select the best subset of predictors in undersized high-dimensional data sets. We introduce and score the Consistent and Misspecification Resistant Information Measure of Complexity criterion, and the Extended Consistent Akaike's Information Criterion with Fisher Information in AEN models for the first time. We carry out a large Monte Carlo simulation study using the medianmean-squared-error (MMSE) to demonstrate and compare the performance of the MMSE prediction. This is done using Cross-validated Fit Adaptive Elastic Net (CV-AEN) to avoid double shrinkage by varying both the error variance and the correlation structure of the model. Later, the new proposed AEN model is applied to a real undersized benchmark data set to predict the Riboflavin (Vitamin B2) production to select the best subset of predictors to predict the production rate of vitamin B2 and provide the best predictive model. The proposed new approach enables a simple and reliable identification of the best subset of predictive genes of the production rate of Riboflavin (Vitamin B2) without an exhaustive search of all possible subset selection in undersized high-dimensional data. It is a new and novel approach that has generalizability to other regularized General Linear Regression (GLM) models to determine the best predictor space for undersized data.
Keywords
This publication has 34 references indexed in Scilit:
- Nearly unbiased variable selection under minimax concave penaltyThe Annals of Statistics, 2010
- The Adaptive Lasso and Its Oracle PropertiesJournal of the American Statistical Association, 2006
- Regularization and Variable Selection Via the Elastic NetJournal of the Royal Statistical Society Series B: Statistical Methodology, 2005
- Least angle regressionThe Annals of Statistics, 2004
- Akaike's Information Criterion and Recent Developments in Information ComplexityJournal of Mathematical Psychology, 2000
- Empirical Bayes Estimation of the Multivariate Normal Covariance MatrixThe Annals of Statistics, 1980
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978
- A Monte Carlo Evaluation of Some Ridge-Type EstimatorsJournal of the American Statistical Association, 1975
- Sufficient statistics and minimum variance estimatesMathematical Proceedings of the Cambridge Philosophical Society, 1949
- Minimum variance and the estimation of several parametersMathematical Proceedings of the Cambridge Philosophical Society, 1947