Non-parametric approach for frequentist multiple imputation in survival analysis with missing covariates
- 10 June 2021
- journal article
- research article
- Published by SAGE Publications in Statistical Methods in Medical Research
- Vol. 30 (7), 1691-1707
- https://doi.org/10.1177/09622802211011197
Abstract
In clinical and epidemiological studies using survival analysis, some explanatory variables are often missing. When this occurs, multiple imputation (MI) is frequently used in practice. In many cases, simple parametric imputation models are routinely adopted without checking the validity of the model specification. Misspecified imputation models can cause biased parameter estimates. In this study, we describe novel frequentist type MI procedures for survival analysis using proportional and additive hazards models. The procedures are based on non-parametric estimation techniques and do not require the correct specification of parametric imputation models. For continuous missing covariates, we first sample imputation values from a parametric imputation model. Then, we obtain estimates by solving the estimating equation modified by non-parametrically estimated conditional densities. For categorical missing covariates, we directly sample imputation values from a non-parametrically estimated conditional distribution and then obtain estimates by solving the corresponding estimating equation. We evaluate the performance of the proposed procedures using simulation studies: one uses simulated data; another uses data informed by parameters generated from a real-world medical claims database. We also applied the procedures to a pharmacoepidemiological study that examined the effect of antihyperlipidemics on hyperglycemia incidence.Funding Information
- Institute for Health Economics and Policy, Tokyo, Japan (No grant number)
This publication has 36 references indexed in Scilit:
- Desmopressin and the risk of hyponatremia: A population-based cohort studyPLoS Medicine, 2019
- Sodium glucose cotransporter 2 inhibitors and risk of serious adverse events: nationwide register based cohort studyBMJ, 2018
- Multiple Imputation: A Review of Practical and Theoretical FindingsStatistical Science, 2018
- Multiple Imputation for Incomplete Data in Epidemiologic StudiesAmerican Journal of Epidemiology, 2017
- Principled Approaches to Missing Data in Epidemiologic StudiesAmerican Journal of Epidemiology, 2017
- Lipid-lowering drugs and risk of new-onset diabetes: a cohort study using Japanese healthcare data linked to clinical data for health screeningBMJ Open, 2017
- Missing laboratory test data in electronic general practice records: analysis of rheumatoid factor recording in the clinical practice research datalinkPharmacoepidemiology and Drug Safety, 2015
- Exploring the effect of erythropoietin on mortality using USRDS dataPharmacoepidemiology and Drug Safety, 2013
- Multiple Imputation after 18+ YearsJournal of the American Statistical Association, 1996
- Multiple Imputation for Nonresponse in SurveysWiley Series in Probability and Statistics, 1987