The re-identification risk of Canadians from longitudinal demographics
Open Access
- 22 June 2011
- journal article
- Published by Springer Science and Business Media LLC in BMC Medical Informatics and Decision Making
- Vol. 11 (1), 46
- https://doi.org/10.1186/1472-6947-11-46
Abstract
The public is less willing to allow their personal health information to be disclosed for research purposes if they do not trust researchers and how researchers manage their data. However, the public is more comfortable with their data being used for research if the risk of re-identification is low. There are few studies on the risk of re-identification of Canadians from their basic demographics, and no studies on their risk from their longitudinal data. Our objective was to estimate the risk of re-identification from the basic cross-sectional and longitudinal demographics of Canadians. Uniqueness is a common measure of re-identification risk. Demographic data on a 25% random sample of the population of Montreal were analyzed to estimate population uniqueness on postal code, date of birth, and gender as well as their generalizations, for periods ranging from 1 year to 11 years. Almost 98% of the population was unique on full postal code, date of birth and gender: these three variables are effectively a unique identifier for Montrealers. Uniqueness increased for longitudinal data. Considerable generalization was required to reach acceptably low uniqueness levels, especially for longitudinal data. Detailed guidelines and disclosure policies on how to ensure that the re-identification risk is low are provided. A large percentage of Montreal residents are unique on basic demographics. For non-longitudinal data sets, the three character postal code, gender, and month/year of birth represent sufficiently low re-identification risk. Data custodians need to generalize their demographic information further for longitudinal data sets.Keywords
This publication has 36 references indexed in Scilit:
- A method for managing re-identification risk from small geographic areas in CanadaBMC Medical Informatics and Decision Making, 2010
- Evaluating re-identification risks with respect to the HIPAA privacy ruleJournal of the American Medical Informatics Association, 2010
- Preparing raw clinical data for publication: guidance for journal editors, authors, and peer reviewersBMJ, 2010
- Evaluating Predictors of Geographic Area Population Size Cut-offs to Manage Re-identification RiskJournal of the American Medical Informatics Association, 2009
- Protecting Privacy Using k-AnonymityJournal of the American Medical Informatics Association, 2008
- Alternatives to Project-specific Consent for Access to Personal Information for Health Research: What Is the Opinion of the Canadian Public?Journal of the American Medical Informatics Association, 2007
- Privacy concerns in preventing fraudulent publicationCMAJ : Canadian Medical Association Journal, 2006
- Disclosure Control of MicrodataJournal of the American Statistical Association, 1990
- Justifications for the sharing of social science data.Law and Human Behavior, 1988
- Obtaining Access to Data from Government-Sponsored Medical ResearchNew England Journal of Medicine, 1986