l2-Penalized temporal logit-mixed models for the estimation of regional obesity prevalence over time
- 2 June 2021
- journal article
- research article
- Published by SAGE Publications in Statistical Methods in Medical Research
- Vol. 30 (7), 1744-1768
- https://doi.org/10.1177/09622802211017583
Abstract
Obesity is considered to be one of the primary health risks in modern industrialized societies. Estimating the evolution of its prevalence over time is an essential element of public health reporting. This requires the application of suitable statistical methods on epidemiologic data with substantial local detail. Generalized linear-mixed models with medical treatment records as covariates mark a powerful combination for this purpose. However, the task is methodologically challenging. Disease frequencies are subject to both regional and temporal heterogeneity. Medical treatment records often show strong internal correlation due to diagnosis-related grouping. This frequently causes excessive variance in model parameter estimation due to rank-deficiency problems. Further, generalized linear-mixed models are often estimated via approximate inference methods as their likelihood functions do not have closed forms. These problems combined lead to unacceptable uncertainty in prevalence estimates over time. We propose an l2-penalized temporal logit-mixed model to solve these issues. We derive empirical best predictors and present a parametric bootstrap to estimate their mean-squared errors. A novel penalized maximum approximate likelihood algorithm for model parameter estimation is stated. With this new methodology, the regional obesity prevalence in Germany from 2009 to 2012 is estimated. We find that the national prevalence ranges between 15 and 16%, with significant regional clustering in eastern Germany.Keywords
Funding Information
- Spanish Grant (PGC2018-096840-B-I00)
- German Federal Statistical Office (RIFOSS)
This publication has 48 references indexed in Scilit:
- Bootstrap mean squared error of a small-area EBLUPJournal of Statistical Computation and Simulation, 2008
- Small Area Estimates of Labour Force Participation Under a Multinomial Logit Mixed ModelJournal of the Royal Statistical Society Series A: Statistics in Society, 2007
- A Stepwise AIC Method for Variable Selection in Linear RegressionCommunications in Statistics - Theory and Methods, 2007
- Regularization and Variable Selection Via the Elastic NetJournal of the Royal Statistical Society Series B: Statistical Methodology, 2005
- Obesity-Associated HypertensionHypertension, 2005
- Bias Correction in Generalized Linear Mixed Models with Multiple Components of DispersionJournal of the American Statistical Association, 1996
- Ridge Estimators in Logistic RegressionJournal of the Royal Statistical Society Series C: Applied Statistics, 1992
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978
- A new look at the statistical model identificationIEEE Transactions on Automatic Control, 1974
- Ridge Regression: Biased Estimation for Nonorthogonal ProblemsTechnometrics, 1970