Developing and validating COVID-19 adverse outcome risk prediction models from a bi-national European cohort of 5594 patients
Top Cited Papers
Open Access
- 5 February 2021
- journal article
- research article
- Published by Springer Science and Business Media LLC in Scientific Reports
- Vol. 11 (1), 1-12
- https://doi.org/10.1038/s41598-021-81844-x
Abstract
Patients with severe COVID-19 have overwhelmed healthcare systems worldwide. We hypothesized that machine learning (ML) models could be used to predict risks at different stages of management and thereby provide insights into drivers and prognostic markers of disease progression and death. From a cohort of approx. 2.6 million citizens in Denmark, SARS-CoV-2 PCR tests were performed on subjects suspected for COVID-19 disease; 3944 cases had at least one positive test and were subjected to further analysis. SARS-CoV-2 positive cases from the United Kingdom Biobank was used for external validation. The ML models predicted the risk of death (Receiver Operation Characteristics—Area Under the Curve, ROC-AUC) of 0.906 at diagnosis, 0.818, at hospital admission and 0.721 at Intensive Care Unit (ICU) admission. Similar metrics were achieved for predicted risks of hospital and ICU admission and use of mechanical ventilation. Common risk factors, included age, body mass index and hypertension, although the top risk features shifted towards markers of shock and organ dysfunction in ICU patients. The external validation indicated fair predictive performance for mortality prediction, but suboptimal performance for predicting ICU admission. ML may be used to identify drivers of progression to more severe disease and for prognostication patients in patients with COVID-19. We provide access to an online risk calculator based on these findings.Funding Information
- Novo Nordisk Fonden (#NNF20SA0062879)
This publication has 38 references indexed in Scilit:
- PROBAST: A Tool to Assess the Risk of Bias and Applicability of Prediction Model StudiesAnnals of Internal Medicine, 2019
- On the overestimation of random forest’s out-of-bag errorPLOS ONE, 2018
- Hyperferritinemia and inflammationInternational Immunology, 2017
- Obesity and the outcome of infectionThe Lancet Infectious Diseases, 2010
- A crucial role of angiotensin converting enzyme 2 (ACE2) in SARS coronavirus–induced lung injuryNature Medicine, 2005
- Missing value estimation methods for DNA microarraysBioinformatics, 2001
- Random ForestsMachine Learning, 2001
- Multisystem Organ Failure Predicts Mortality of ICU Patients With Acute Respiratory Failure Secondary to AIDS-Related PCPSocial psychiatry. Sozialpsychiatrie. Psychiatrie sociale, 1992
- Comparing the Areas under Two or More Correlated Receiver Operating Characteristic Curves: A Nonparametric ApproachBiometrics, 1988