Phenotypic signatures in clinical data enable systematic identification of patients for genetic testing
- 3 June 2021
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Medicine
- Vol. 27 (6), 1097-1104
- https://doi.org/10.1038/s41591-021-01356-z
Abstract
Around 5% of the population is affected by a rare genetic disease, yet most endure years of uncertainty before receiving a genetic test. A common feature of genetic diseases is the presence of multiple rare phenotypes that often span organ systems. Here, we use diagnostic billing information from longitudinal clinical data in the electronic health records (EHRs) of 2,286 patients who received a chromosomal microarray test, and 9,144 matched controls, to build a model to predict who should receive a genetic test. The model achieved high prediction accuracies in a held-out test sample (area under the receiver operating characteristic curve (AUROC), 0.97; area under the precision–recall curve (AUPRC), 0.92), in an independent hospital system (AUROC, 0.95; AUPRC, 0.62), and in an independent set of 172,265 patients in which cases were broadly defined as having an interaction with a genetics provider (AUROC, 0.9; AUPRC, 0.63). Patients carrying a putative pathogenic copy number variant were also accurately identified by the model. Compared with current approaches for genetic test determination, our model could identify more patients for testing while also increasing the proportion of those tested who have a genetic disease. We demonstrate that phenotypic patterns representative of a wide range of genetic diseases can be captured from EHRs to systematize decision-making for genetic testing, with the potential to speed up diagnosis, improve care and reduce costs.Funding Information
- U.S. Department of Health & Human Services | NIH | National Institute of Mental Health (R01MH113362, R01MH111776, R01MH113362)
- U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (U01HG009068)
- U.S. Department of Health & Human Services | NIH | U.S. National Library of Medicine (R01LM010685)
This publication has 22 references indexed in Scilit:
- Clinical Genetic Testing in Autism Spectrum Disorder in a Large Community-Based Population SampleJAMA Psychiatry, 2020
- Estimating cumulative point prevalence of rare diseases: analysis of the Orphanet databaseEuropean Journal of Human Genetics, 2019
- The burden of rare diseasesAmerican Journal of Medical Genetics Part A, 2019
- Barriers to Lynch Syndrome Testing and Preoperative Result Availability in Earlyonset Colorectal Cancer: A National Physician Survey StudyClinical and Translational Gastroenterology, 2018
- Resolution of Disease Phenotypes Resulting from Multilocus Genomic VariationThe New England Journal of Medicine, 2017
- Where genotype is not predictive of phenotype: towards an understanding of the molecular basis of reduced penetrance in human inherited diseaseHuman Genetics, 2013
- Phenotypic Heterogeneity of Genomic Disorders and Rare Copy-Number VariantsThe New England Journal of Medicine, 2012
- Barriers to the use of genetic testing: A study of racial and ethnic disparitiesGenetics in Medicine, 2009
- Mendelian Inheritance in Man and Its Online Version, OMIMAmerican Journal of Human Genetics, 2007
- Parental attitudes toward a diagnosis in children with unidentified multiple congenital anomaly syndromesAmerican Journal of Medical Genetics, 2001