Interpreting Posterior Relative Risk Estimates in Disease-Mapping Studies
Top Cited Papers
- 1 June 2004
- journal article
- review article
- Published by Environmental Health Perspectives in Environmental Health Perspectives
- Vol. 112 (9), 1016-1025
- https://doi.org/10.1289/ehp.6740
Abstract
There is currently much interest in conducting spatial analyses of health outcomes at the small-area scale. This requires sophisticated statistical techniques, usually involving Bayesian models, to smooth the underlying risk estimates because the data are typically sparse. However, questions have been raised about the performance of these models for recovering the “true” risk surface, about the influence of the prior structure specified, and about the amount of smoothing of the risks that is actually performed. We describe a comprehensive simulation study designed to address these questions. Our results show that Bayesian disease-mapping models are essentially conservative, with high specificity even in situations with very sparse data but low sensitivity if the raised-risk areas have only a moderate (< 2-fold) excess or are not based on substantial expected counts (> 50 per area). Semiparametric spatial mixture models typically produce less smoothing than their conditional autoregressive counterpart when there is sufficient information in the data (moderate-size expected count and/or high true excess risk). Sensitivity may be improved by exploiting the whole posterior distribution to try to detect true raised-risk areas rather than just reporting and mapping the mean posterior relative risk. For the widely used conditional autoregressive model, we show that a decision rule based on computing the probability that the relative risk is above 1 with a cutoff between 70 and 80% gives a specific rule with reasonable sensitivity for a range of scenarios having moderate expected counts (~ 20) and excess risks (~1.5- to 2-fold). Larger (3-fold) excess risks are detected almost certainly using this rule, even when based on small expected counts, although the mean of the posterior distribution is typically smoothed to about half the true value.Keywords
This publication has 18 references indexed in Scilit:
- Proper multivariate conditional autoregressive models for spatial data analysisBiostatistics, 2003
- Hidden Markov Models and Disease MappingJournal of the American Statistical Association, 2002
- Geographical epidemiology of prostate cancer in Great BritainInternational Journal of Cancer, 2002
- Clustering, cluster detection, and spatial variation in riskPublished by Oxford University Press (OUP) ,2001
- A Shared Component Model for Detecting Joint and Selective Clustering of Two DiseasesJournal of the Royal Statistical Society Series A: Statistics in Society, 2001
- Disease mapping models: an empirical evaluationStatistics in Medicine, 2000
- Bayesian Detection of Clusters and Discontinuities in Disease MapsBiometrics, 2000
- Triple-goal Estimates in Two-stage Hierarchical ModelsJournal of the Royal Statistical Society Series B: Statistical Methodology, 1998
- Bayesian methods for mapping disease riskPublished by Oxford University Press (OUP) ,1996
- Spatial Correlation in Ecological AnalysisInternational Journal of Epidemiology, 1993