Combining Multiple Imputation and Inverse‐Probability Weighting
Open Access
- 3 November 2011
- journal article
- Published by Oxford University Press (OUP) in Biometrics
- Vol. 68 (1), 129-137
- https://doi.org/10.1111/j.1541-0420.2011.01666.x
Abstract
Two approaches commonly used to deal with missing data are multiple imputation (MI) and inverse-probability weighting (IPW). IPW is also used to adjust for unequal sampling fractions. MI is generally more efficient than IPW but more complex. Whereas IPW requires only a model for the probability that an individual has complete data (a univariate outcome), MI needs a model for the joint distribution of the missing data (a multivariate outcome) given the observed data. Inadequacies in either model may lead to important bias if large amounts of data are missing. A third approach combines MI and IPW to give a doubly robust estimator. A fourth approach (IPW/MI) combines MI and IPW but, unlike doubly robust methods, imputes only isolated missing values and uses weights to account for remaining larger blocks of unimputed missing data, such as would arise, e.g., in a cohort study subject to sample attrition, and/or unequal sampling fractions. In this article, we examine the performance, in terms of bias and efficiency, of IPW/MI relative to MI and IPW alone and investigate whether the Rubin's rules variance estimator is valid for IPW/MI. We prove that the Rubin's rules variance estimator is valid for IPW/MI for linear regression with an imputed outcome, we present simulations supporting the use of this variance estimator in more general settings, and we demonstrate that IPW/MI can have advantages over alternatives. IPW/MI is applied to data from the National Child Development Study.Keywords
This publication has 25 references indexed in Scilit:
- Multiple imputation using chained equations: Issues and guidance for practiceStatistics in Medicine, 2010
- Bias and efficiency of multiple imputation compared with complete‐case analysis for missing covariate valuesStatistics in Medicine, 2010
- Analysis of Incomplete Data Using Inverse Probability Weighting and Doubly Robust EstimatorsMethodology, 2010
- Psychosocial work characteristics and anxiety and depressive disorders in midlife: the effects of prior psychological distressOccupational and Environmental Medicine, 2008
- On the Bias of the Multiple-Imputation Variance Estimator in Survey SamplingJournal of the Royal Statistical Society Series B: Statistical Methodology, 2006
- Proper and Improper Multiple ImputationInternational Statistical Review, 2003
- Large-sample theory for parametric multiple imputation proceduresBiometrika, 1998
- Indicator and Stratification Methods for Missing Explanatory Variables in Multiple Linear RegressionJournal of the American Statistical Association, 1996
- Estimation of Regression Coefficients When Some Regressors are not Always ObservedJournal of the American Statistical Association, 1994
- Asymptotic Results for Multiple ImputationThe Annals of Statistics, 1988