An Automated Line-of-Therapy Algorithm for Adults With Metastatic Non–Small Cell Lung Cancer: Validation Study Using Blinded Manual Chart Review (Preprint)

Preprint

22 March 2021

preprint
Published by JMIR Publications Inc.

https://doi.org/10.2196/preprints.29017

Abstract

BACKGROUND Extraction of line-of-therapy (LOT) information from electronic health record and claims data is essential for determining longitudinal changes in systemic anticancer therapy in real-world clinical settings. OBJECTIVE The aim of this retrospective cohort analysis is to validate and refine our previously described open-source LOT algorithm by comparing the output of the algorithm with results obtained through blinded manual chart review. METHODS We used structured electronic health record data and clinical documents to identify 500 adult patients treated for metastatic non–small cell lung cancer with systemic anticancer therapy from 2011 to mid-2018; we assigned patients to training (n=350) and test (n=150) cohorts, randomly divided proportional to the overall ratio of simple:complex cases (n=254:246). Simple cases were patients who received one LOT and no maintenance therapy; complex cases were patients who received more than one LOT and/or maintenance therapy. Algorithmic changes were performed using the training cohort data, after which the refined algorithm was evaluated against the test cohort. RESULTS For simple cases, 16 instances of discordance between the LOT algorithm and chart review prerefinement were reduced to 8 instances postrefinement; in the test cohort, there was no discordance between algorithm and chart review. For complex cases, algorithm refinement reduced the discordance from 68 to 62 instances, with 37 instances in the test cohort. The percentage agreement between LOT algorithm output and chart review for patients who received one LOT was 89% prerefinement, 93% postrefinement, and 93% for the test cohort, whereas the likelihood of precise matching between algorithm output and chart review decreased with an increasing number of unique regimens. Several areas of discordance that arose from differing definitions of LOTs and maintenance therapy could not be objectively resolved because of a lack of precise definitions in the medical literature. CONCLUSIONS Our findings identify common sources of discordance between the LOT algorithm and clinician documentation, providing the possibility of targeted algorithm refinement.

Keywords

Other Versions

Published version: Version JMIR Public Health and Surveillance, 9, preprints

This publication has 15 references indexed in Scilit:

Lung cancer
Nature, 2020
Visualization of Sequential Treatments in Metastatic Breast Cancer
JCO Clinical Cancer Informatics, 2020
Temporal phenotyping by mining healthcare data to derive lines of therapy for cancer
Journal of Biomedical Informatics, 2019
Global Epidemiology of Lung Cancer
Annals of Global Health, 2019
Precision Diagnosis and Treatment for Advanced Non–Small-Cell Lung Cancer
The New England Journal of Medicine, 2017
Real-world first-line treatment and overall survival in non-small cell lung cancer without known EGFR mutations or ALK rearrangements in US community oncology setting
PLOS ONE, 2017
Lung cancer: current therapies and new targeted treatments
The Lancet, 2016
Opportunities and challenges in leveraging electronic health record data in oncology
Future Oncology, 2016
Targeted therapies for treatment of non‐small cell lung cancer—Recent advances and future perspectives
International Journal of Cancer, 2015
Consensus recommendations for the uniform reporting of clinical trials: report of the International Myeloma Workshop Consensus Panel 1
Blood, 2011