Privacy-preserving record linkage using Bloom filters
Open Access
- 25 August 2009
- journal article
- Published by Springer Science and Business Media LLC in BMC Medical Informatics and Decision Making
- Vol. 9 (1), 41
- https://doi.org/10.1186/1472-6947-9-41
Abstract
Combining multiple databases with disjunctive or additional information on the same person is occurring increasingly throughout research. If unique identification numbers for these individuals are not available, probabilistic record linkage is used for the identification of matching record pairs. In many applications, identifiers have to be encrypted due to privacy concerns. A new protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers has been developed. The protocol is based on Bloom filters on q-grams of identifiers. Tests on simulated and actual databases yield linkage results comparable to non-encrypted identifiers and superior to results from phonetic encodings. We proposed a protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers. Since the protocol can be easily enhanced and has a low computational burden, the protocol might be useful for many applications requiring privacy-preserving record linkage.Keywords
This publication has 35 references indexed in Scilit:
- Privacy preserving record linkage approachesInternational Journal of Data Mining, Modelling and Management, 2009
- Privacy-Preserving String Comparisons in Record Linkage Systems: A ReviewInformation Security Journal: A Global Perspective, 2008
- A Distributed Patient Identification Protocol Based on Control Numbers with Semantic AnnotationInternational Journal on Semantic Web and Information Systems, 2005
- Network Applications of Bloom Filters: A SurveyInternet Mathematics, 2004
- Business Survey MethodsTechnometrics, 1996
- Tolerating spelling errors during patient validationComputers and Biomedical Research, 1992
- SEARCHING FOR HISTORICAL WORD FORMS IN TEXT DATABASES USING SPELLING‐CORRECTION METHODS: REVERSE ERROR AND PHONETIC CODING METHODSJournal of Documentation, 1991
- Advances in Record-Linkage Methodology as Applied to Matching the 1985 Census of Tampa, FloridaJournal of the American Statistical Association, 1989
- Space/time trade-offs in hash coding with allowable errorsCommunications of the ACM, 1970
- Measures of the Amount of Ecologic Association Between SpeciesEcology, 1945