Discovering Disease Associations by Integrating Electronic Clinical Data and Medical Literature
Open Access
- 23 June 2011
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 6 (6), e21132
- https://doi.org/10.1371/journal.pone.0021132
Abstract
Electronic health record (EHR) systems offer an exceptional opportunity for studying many diseases and their associated medical conditions within a population. The increasing number of clinical record entries that have become available electronically provides access to rich, large sets of patients' longitudinal medical information. By integrating and comparing relations found in the EHRs with those already reported in the literature, we are able to verify existing and to identify rare or novel associations. Of particular interest is the identification of rare disease co-morbidities, where the small numbers of diagnosed patients make robust statistical analysis difficult. Here, we introduce ADAMS, an Application for Discovering Disease Associations using Multiple Sources, which contains various statistical and language processing operations. We apply ADAMS to the New York-Presbyterian Hospital's EHR to combine the information from the relational diagnosis tables and textual discharge summaries with those from PubMed and Wikipedia in order to investigate the co-morbidities of the rare diseases Kaposi sarcoma, toxoplasmosis, and Kawasaki disease. In addition to finding well-known characteristics of diseases, ADAMS can identify rare or previously unreported associations. In particular, we report a statistically significant association between Kawasaki disease and diagnosis of autistic disorder.Keywords
This publication has 46 references indexed in Scilit:
- An analytical approach to characterize morbidity profile dissimilarity between distinct cohorts using electronic medical recordsJournal of Biomedical Informatics, 2010
- Adequacy of thyroid hormone replacement in a general populationQJM: An International Journal of Medicine, 2010
- Signs of the 2009 Influenza Pandemic in the New York-Presbyterian Hospital Electronic Health RecordsPLOS ONE, 2010
- Under-documentation of chronic kidney disease in the electronic health record in outpatientsJournal of the American Medical Informatics Association, 2010
- Lack of Association Between Measles-Mumps-Rubella Vaccination and Autism in ChildrenThe Pediatric Infectious Disease Journal, 2010
- Characterizing environmental and phenotypic associations using information theory and electronic health recordsBMC Bioinformatics, 2009
- PhenoGO: an integrated resource for the multiscale mining of clinical and biological dataBMC Bioinformatics, 2009
- Literature mining for the biologist: from information retrieval to biological discoveryNature Reviews Genetics, 2006
- Accuracy of ICD-9-CM Coding for the Identification of Patients With Acute Ischemic StrokeStroke, 1998
- Idiopathisches multiples Pigmentsarkom der HautArchiv für dermatologische Forschung, 1872