A System for Automated Lexical Mapping
Open Access
- 1 May 2006
- journal article
- Published by Oxford University Press (OUP) in Journal of the American Medical Informatics Association
- Vol. 13 (3), 334-343
- https://doi.org/10.1197/jamia.m1823
Abstract
Objective: To automate the mapping of disparate databases to standardized medical vocabularies. Background: Merging of clinical systems and medical databases, or aggregation of information from disparate databases, frequently requires a process whereby vocabularies are compared and similar concepts are mapped. Design: Using a normalization phase followed by a novel alignment stage inspired by DNA sequence alignment methods, automated lexical mapping can map terms from various databases to standard vocabularies such as the UMLS (Unified Medical Language System) and LOINC (Logical Observation Identifier Names and Codes). Measurements: This automated lexical mapping was evaluated using three real-world laboratory databases from different health care institutions. The authors report the sensitivity, specificity, percentage correct (true positives plus true negatives divided by total number of terms), and true positive and true negative rates as measures of system performance. Results: The alignment algorithm was able to map 57% to 78% (average of 63% over all runs and databases) of equivalent concepts through lexical mapping alone. True positive rates ranged from 18% to 70%; true negative rates ranged from 5% to 52%. Conclusion: Lexical mapping can facilitate the integration of data from diverse sources and decrease the time and cost required for manual mapping and integration of clinical systems and medical databases.This publication has 29 references indexed in Scilit:
- Methods for automated concept mapping between medical databasesJournal of Biomedical Informatics, 2004
- Creating an Online Dictionary of Abbreviations from MEDLINEJournal of the American Medical Informatics Association, 2002
- Mapping Abbreviations to Full Forms in Biomedical ArticlesJournal of the American Medical Informatics Association, 2002
- Evaluation of a "Lexically Assign, Logically Refine" Strategy for Semi-automated Integration of Overlapping TerminologiesJournal of the American Medical Informatics Association, 1998
- The Unified Medical Language System: An Informatics Research CollaborationJournal of the American Medical Informatics Association, 1998
- Evaluating the Coverage of Controlled Health Data Terminologies: Report on the Results of the NLM/AHCPR Large Scale Vocabulary TestJournal of the American Medical Informatics Association, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- A General Natural-language Text Processor for Clinical RadiologyJournal of the American Medical Informatics Association, 1994
- Issues in searching molecular sequence databasesNature Genetics, 1994
- Basic local alignment search toolJournal of Molecular Biology, 1990