Dealing with missing information on covariates for excess mortality hazard regression models – Making the imputation model compatible with the substantive model
- 2 September 2021
- journal article
- research article
- Published by SAGE Publications in Statistical Methods in Medical Research
- Vol. 30 (10), 2256-2268
- https://doi.org/10.1177/09622802211031615
Abstract
Missing data is a common issue in epidemiological databases. Among the different ways of dealing with missing data, multiple imputation has become more available in common statistical software packages. However, the incompatibility between the imputation and substantive model, which can arise when the associations between variables in the substantive model are not taken into account in the imputation models or when the substantive model is itself nonlinear, can lead to invalid inference. Aiming at analysing population-based cancer survival data, we extended the multiple imputation substantive model compatible-fully conditional specification (SMC-FCS) approach, proposed by Bartlett et al. in 2015 to accommodate excess hazard regression models. The proposed approach was compared with the standard fully conditional specification multiple imputation procedure and with the complete-case analysis using a simulation study. The SMC-FCS approach produced unbiased estimates in both scenarios tested, while the fully conditional specification produced biased estimates and poor empirical coverages probabilities. The SMC-FCS algorithm was then used for handling missing data in the evaluation of socioeconomic inequalities in survival from colorectal cancer patients diagnosed in the North Region of Portugal. The analysis using SMC-FCS showed a clearer trend in higher excess hazards for patients coming from more deprived areas. The proposed algorithm was implemented in R software and is presented as Supplementary Material.This publication has 30 references indexed in Scilit:
- The influence of geographical access to health care and material deprivation on colorectal cancer survival: Evidence from France and EnglandHealth & Place, 2014
- Multiple imputation of covariates by fully conditional specification: Accommodating the substantive modelStatistical Methods in Medical Research, 2014
- Construction of an adaptable European transnational ecological deprivation index: the French versionJournal of Epidemiology and Community Health, 2012
- On Estimation in Relative SurvivalBiometrics, 2011
- Comparison of techniques for handling missing covariate data within prognostic modelling studies: a simulation studyBMC Medical Research Methodology, 2010
- Modelling relative survival in the presence of incomplete data: a tutorialInternational Journal of Epidemiology, 2009
- Imputing missing covariate values for the Cox modelStatistics in Medicine, 2009
- The performance of multiple imputation for missing covariate data within the context of regression relative survival analysisStatistics in Medicine, 2008
- Imputations of Missing Values in Practice: Results from Imputations of Serum Cholesterol in 28 Cohort StudiesAmerican Journal of Epidemiology, 2004
- Developing a prognostic model in the presence of missing data: an ovarian cancer case studyJournal of Clinical Epidemiology, 2003