Interpreting Posterior Relative Risk Estimates in Disease-Mapping Studies

Top Cited Papers

1 June 2004

journal article
review article
Published by Environmental Health Perspectives in Environmental Health Perspectives

Vol. 112 (9), 1016-1025
https://doi.org/10.1289/ehp.6740

Abstract

There is currently much interest in conducting spatial analyses of health outcomes at the small-area scale. This requires sophisticated statistical techniques, usually involving Bayesian models, to smooth the underlying risk estimates because the data are typically sparse. However, questions have been raised about the performance of these models for recovering the “true” risk surface, about the influence of the prior structure specified, and about the amount of smoothing of the risks that is actually performed. We describe a comprehensive simulation study designed to address these questions. Our results show that Bayesian disease-mapping models are essentially conservative, with high specificity even in situations with very sparse data but low sensitivity if the raised-risk areas have only a moderate (< 2-fold) excess or are not based on substantial expected counts (> 50 per area). Semiparametric spatial mixture models typically produce less smoothing than their conditional autoregressive counterpart when there is sufficient information in the data (moderate-size expected count and/or high true excess risk). Sensitivity may be improved by exploiting the whole posterior distribution to try to detect true raised-risk areas rather than just reporting and mapping the mean posterior relative risk. For the widely used conditional autoregressive model, we show that a decision rule based on computing the probability that the relative risk is above 1 with a cutoff between 70 and 80% gives a specific rule with reasonable sensitivity for a range of scenarios having moderate expected counts (~ 20) and excess risks (~1.5- to 2-fold). Larger (3-fold) excess risks are detected almost certainly using this rule, even when based on small expected counts, although the mean of the posterior distribution is typically smoothed to about half the true value.

Keywords

This publication has 18 references indexed in Scilit:

Proper multivariate conditional autoregressive models for spatial data analysis
Biostatistics, 2003
Hidden Markov Models and Disease Mapping
Journal of the American Statistical Association, 2002
Geographical epidemiology of prostate cancer in Great Britain
International Journal of Cancer, 2002
Clustering, cluster detection, and spatial variation in risk
Published by Oxford University Press (OUP) ,2001
A Shared Component Model for Detecting Joint and Selective Clustering of Two Diseases
Journal of the Royal Statistical Society Series A: Statistics in Society, 2001
Disease mapping models: an empirical evaluation
Statistics in Medicine, 2000
Bayesian Detection of Clusters and Discontinuities in Disease Maps
Biometrics, 2000
Triple-goal Estimates in Two-stage Hierarchical Models
Journal of the Royal Statistical Society Series B: Statistical Methodology, 1998
Bayesian methods for mapping disease risk
Published by Oxford University Press (OUP) ,1996
Spatial Correlation in Ecological Analysis
International Journal of Epidemiology, 1993

Cited by 392 articles