Principled missing data methods for researchers
Top Cited Papers
Open Access
- 14 May 2013
- journal article
- research article
- Published by Springer Science and Business Media LLC in SpringerPlus
- Vol. 2 (1), 1-17
- https://doi.org/10.1186/2193-1801-2-222
Abstract
The impact of missing data on quantitative research can be serious, leading to biased estimates of parameters, loss of information, decreased statistical power, increased standard errors, and weakened generalizability of findings. In this paper, we discussed and demonstrated three principled missing data methods: multiple imputation, full information maximum likelihood, and expectation-maximization algorithm, applied to a real-world data set. Results were contrasted with those obtained from the complete data set and from the listwise deletion method. The relative merits of each method are noted, along with common features they share. The paper concludes with an emphasis on the importance of statistical assumptions, and recommendations for researchers. Quality of research will be enhanced if (a) researchers explicitly acknowledge missing data problems and the conditions under which they occurred, (b) principled methods are employed to handle missing data, and (c) the appropriate treatment of missing data is incorporated into review standards of manuscripts submitted for publication.Keywords
This publication has 59 references indexed in Scilit:
- Best practices for missing data management in counseling psychology.Journal of Counseling Psychology, 2010
- Missing Data Analysis: Making It Work in the Real WorldAnnual Review of Psychology, 2009
- Plausibility of multivariate normality assumption when multiply imputing non-Gaussian continuous outcomes: a simulation assessmentJournal of Statistical Computation and Simulation, 2008
- How Many Imputations are Really Needed? Some Practical Clarifications of Multiple Imputation TheoryPrevention Science, 2007
- Much Ado About NothingThe American Statistician, 2007
- Robustness of a multivariate normal approximation for imputation of incomplete binary dataStatistics in Medicine, 2006
- Using the Expectation Maximization Algorithm to Estimate Coefficient Alpha for Scales With Item-Level Missing Data.Psychological Methods, 2003
- How can I deal with missing data in my study?Australian and New Zealand Journal of Public Health, 2001
- Multiple Imputation after 18+ YearsJournal of the American Statistical Association, 1996
- The Calculation of Posterior Distributions by Data AugmentationJournal of the American Statistical Association, 1987