An Empirical Evaluation of the Predictive Mean Matching Method for Imputing Missing Values

1 August 1997

journal article
research article
Published by SAGE Publications in Sociological Methods & Research

Vol. 26 (1), 3-33
https://doi.org/10.1177/0049124197026001001

Abstract

This article reports empirical explorations of how well the predictive mean matching method for imputing missing data works for an often problematic variable—income—when income is used as an explanatory variable in a substantive regression model. It is found that the performance of the predictive mean method varies considerably with the predictive power of the imputation regression model and the percentage of cases with missing data on income. In comparisons of single-value with multiple-imputation methods, it also is found that the amount of bias and the loss of precision associated with single-value methods is considerably less than that associated with a weak imputation model. Situations in which using imputed data can lead to seriously biased estimates of regression coefficients (and related statistics) and situations in which the bias is so minimal as to be nonproblematic are identified.

Keywords

MISSING VALUES

This publication has 5 references indexed in Scilit:

Multiple Imputation for Interval Estimation from Simple Random Samples with Ignorable Nonresponse
Journal of the American Statistical Association, 1986
Alternative Methods for CPS Income Imputation
Journal of the American Statistical Association, 1986
Some efficient random imputation methods
Communications in Statistics - Theory and Methods, 1984
An Introduction to Sample Selection Bias in Sociological Data
American Sociological Review, 1983
Occupational Prestige in the United States, 1925-63
American Journal of Sociology, 1964

Cited by 120 articles