Spatial autocorrelation among automated geocoding errors and its effects on testing for disease clustering
- 19 January 2010
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 29 (9), 1025-1036
- https://doi.org/10.1002/sim.3836
Abstract
Automated geocoding of patient addresses is an important data assimilation component of many spatial epidemiologic studies. Inevitably, the geocoding process results in positional errors. Positional errors incurred by automated geocoding tend to reduce the power of tests for disease clustering and otherwise affect spatial analytic methods. However, there are reasons to believe that the errors may often be positively spatially correlated and that this may mitigate their deleterious effects on spatial analyses. In this article, we demonstrate explicitly that the positional errors associated with automated geocoding of a data set of more than 6000 addresses in Carroll County, Iowa are spatially autocorrelated. Furthermore, through two simulation studies of disease processes, including one in which the disease process is overlain upon the Carroll County addresses, we show that spatial autocorrelation among geocoding errors maintains the power of two tests for disease clustering at a level higher than that which would occur if the errors were independent. Implications of these results for cluster detection, privacy protection, and measurement error modeling of geographic health data are discussed.Keywords
Funding Information
- National Cancer Institute (N01-PC-35143)
- National Institutes of Health
- U.S. Department of Health and Human Services
This publication has 32 references indexed in Scilit:
- Accuracy of commercial geocoding: assessment and implicationsEpidemiologic Perspectives & Innovations, 2006
- Positional Accuracy of Two Methods of GeocodingEpidemiology, 2005
- Accuracy and Repeatability of Commercial GeocodingAmerican Journal of Epidemiology, 2004
- Improving Geocoding Practices: Evaluation of Geocoding ToolsJournal of Medical Systems, 2004
- Applied Spatial Statistics for Public Health DataWiley Series in Probability and Statistics, 2004
- Geocoding Addresses from a Large Population-based Study: Lessons LearnedEpidemiology, 2003
- Positional Accuracy of Geocoded Addresses in Epidemiologic ResearchEpidemiology, 2003
- Positional error in automated geocoding of residential addressesInternational Journal of Health Geographics, 2003
- Locational uncertainty in georeferencing public health datasetsJournal of Exposure Science & Environmental Epidemiology, 2001
- On the wrong side of the tracts? Evaluating the accuracy of geocoding in public health researchAmerican Journal of Public Health, 2001