Combining Multiple Imputation and Inverse‐Probability Weighting

Open Access

3 November 2011

journal article
Published by Oxford University Press (OUP) in Biometrics

Vol. 68 (1), 129-137
https://doi.org/10.1111/j.1541-0420.2011.01666.x

Abstract

Two approaches commonly used to deal with missing data are multiple imputation (MI) and inverse-probability weighting (IPW). IPW is also used to adjust for unequal sampling fractions. MI is generally more efficient than IPW but more complex. Whereas IPW requires only a model for the probability that an individual has complete data (a univariate outcome), MI needs a model for the joint distribution of the missing data (a multivariate outcome) given the observed data. Inadequacies in either model may lead to important bias if large amounts of data are missing. A third approach combines MI and IPW to give a doubly robust estimator. A fourth approach (IPW/MI) combines MI and IPW but, unlike doubly robust methods, imputes only isolated missing values and uses weights to account for remaining larger blocks of unimputed missing data, such as would arise, e.g., in a cohort study subject to sample attrition, and/or unequal sampling fractions. In this article, we examine the performance, in terms of bias and efficiency, of IPW/MI relative to MI and IPW alone and investigate whether the Rubin's rules variance estimator is valid for IPW/MI. We prove that the Rubin's rules variance estimator is valid for IPW/MI for linear regression with an imputed outcome, we present simulations supporting the use of this variance estimator in more general settings, and we demonstrate that IPW/MI can have advantages over alternatives. IPW/MI is applied to data from the National Child Development Study.

Keywords

This publication has 25 references indexed in Scilit:

Multiple imputation using chained equations: Issues and guidance for practice
Statistics in Medicine, 2010
Bias and efficiency of multiple imputation compared with complete‐case analysis for missing covariate values
Statistics in Medicine, 2010
Analysis of Incomplete Data Using Inverse Probability Weighting and Doubly Robust Estimators
Methodology, 2010
Psychosocial work characteristics and anxiety and depressive disorders in midlife: the effects of prior psychological distress
Occupational and Environmental Medicine, 2008
On the Bias of the Multiple-Imputation Variance Estimator in Survey Sampling
Journal of the Royal Statistical Society Series B: Statistical Methodology, 2006
Proper and Improper Multiple Imputation
International Statistical Review, 2003
Large-sample theory for parametric multiple imputation procedures
Biometrika, 1998
Indicator and Stratification Methods for Missing Explanatory Variables in Multiple Linear Regression
Journal of the American Statistical Association, 1996
Estimation of Regression Coefficients When Some Regressors are not Always Observed
Journal of the American Statistical Association, 1994
Asymptotic Results for Multiple Imputation
The Annals of Statistics, 1988

Cited by 218 articles