Short‐term prediction of mortality in patients with systemic lupus erythematosus: Classification of outcomes using random forests

Open Access

6 February 2006

journal article
research article
Published by Wiley in Arthritis Care & Research

Vol. 55 (1), 74-80
https://doi.org/10.1002/art.21695

Abstract

Objective To identify demographic and clinical characteristics that classify patients with systemic lupus erythematosus (SLE) at risk for in-hospital mortality. Methods Patients hospitalized in California from 1996 to 2000 with a principal diagnosis of SLE (N = 3,839) were identified from a state hospitalization database. As candidate predictors of mortality, we used patient demographic characteristics; the presence or absence of 40 different clinical conditions listed among the discharge diagnoses; and 2 summary indexes derived from the discharge diagnoses, the Charlson Index and the SLE Comorbidity Index. Predictors of patients at increased risk of mortality were identified and validated using random forests, a statistical procedure that is a generalization of single classification trees. Random forests use bootstrapped samples of patients and randomly selected subsets of predictors to create individual classification trees, and this process is repeated to generate multiple trees (a forest). Classification is then done by majority vote across all trees. Results Of the 3,839 patients, 109 died during hospitalization. Selecting from all available predictors, the random forests had excellent predictive accuracy for classification of death. The mean classification error rate, averaged over 10 forests of 500 trees each, was 11.9%. The most important predictors were the Charlson Index, respiratory failure, SLE Comorbidity Index, age, sepsis, nephritis, and thrombocytopenia. Conclusion Information on clinical diagnoses can be used to accurately predict mortality among hospitalized patients with SLE. Random forests represent a useful technique to identify the most important predictors from a larger (often much larger) number and to validate the classification.

Keywords

This publication has 32 references indexed in Scilit:

Accuracy of identification of patients with immune thrombocytopenic purpura through administrative records: A data validation study
American Journal of Hematology, 2003
Hospitalizations for coronary artery disease among patients with systemic lupus erythematosus
Arthritis & Rheumatism, 2003
Contribution of the initial features of systemic lupus erythematosus to the clinical evolution and survival of a cohort of Mediterranean patients
Annals Of The Rheumatic Diseases, 2003
SLICC/ACR DAMAGE INDEX IS VALID, AND RENAL AND PULMONARY ORGAN SCORES ARE PREDICTORS OF SEVERE OUTCOME IN PATIENTS WITH SYSTEMIC LUPUS ERYTHEMATOSUS
Rheumatology, 1996
Bias in the Coding of Hospital Discharge Data and Its Implications for Quality Assessment
Medical Care, 1994
Adapting a clinical comorbidity index for use with ICD-9-CM administrative databases
Journal of Clinical Epidemiology, 1992
Using Clinical Variables to Estimate the Risk of Patient Mortality
Medical Care, 1991
Predicting In-Hospital Survival of Myocardial Infarction
Medical Care, 1990
Prognosis in systemic lupus erythematosus
Arthritis & Rheumatism, 1990
Risk adjustment in claims-based research: The search for efficient approaches
Journal of Clinical Epidemiology, 1989

Cited by 77 articles