International Validation Study for Interim PET in ABVD-Treated, Advanced-Stage Hodgkin Lymphoma: Interpretation Criteria and Concordance Rate Among Reviewers
Open Access
- 20 March 2013
- journal article
- research article
- Published by Society of Nuclear Medicine in Journal of Nuclear Medicine
- Vol. 54 (5), 683-690
- https://doi.org/10.2967/jnumed.112.110890
Abstract
At present, there are no standard criteria that have been validated for interim PET reporting in lymphoma. In 2009, an international workshop attended by hematologists and nuclear medicine experts in Deauville, France, proposed to develop simple and reproducible rules for interim PET reporting in lymphoma. Accordingly, an international validation study was undertaken with the primary aim of validating the prognostic role of interim PET using the Deauville 5-point score to evaluate images and with the secondary aim of measuring concordance rates among reviewers using the same 5-point score. This paper focuses on the criteria for interpretation of interim PET and on concordance rates. Methods: A cohort of advanced-stage Hodgkin lymphoma patients treated with doxorubicin, bleomycin, vinblastine, and dacarbazine (ABVD) were enrolled retrospectively from centers worldwide. Baseline and interim scans were reviewed by an international panel of 6 nuclear medicine experts using the 5-point score. Results: Complete scan datasets of acceptable diagnostic quality were available for 260 of 440 (59%) enrolled patients. Independent agreement among reviewers was reached on 252 of 260 patients (97%), for whom at least 4 reviewers agreed the findings were negative (score of 1–3) or positive (score of 4–5). After discussion, consensus was reached in all cases. There were 45 of 260 patients (17%) with positive interim PET findings and 215 of 260 patients (83%) with negative interim PET findings. Thirty-three interim PET–positive scans were true-positive, and 12 were false-positive. Two hundred three interim PET–negative scans were true-negative, and 12 were false-negative. Sensitivity, specificity, and accuracy were 0.73, 0.94, and 0.91, respectively. Negative predictive value and positive predictive value were 0.94 and 0.73, respectively. The 3-y failure-free survival was 83%, 28%, and 95% for the entire population and for interim PET–positive and –negative patients, respectively (P < 0.0001). The agreement between pairs of reviewers was good or very good, ranging from 0.69 to 0.84 as measured with the Cohen kappa. Overall agreement was good at 0.76 as measured with the Krippendorf α. Conclusion: The 5-point score proposed at Deauville for reviewing interim PET scans in advanced Hodgkin lymphoma is accurate and reproducible enough to be accepted as a standard reporting criterion in clinical practice and for clinical trials.This publication has 16 references indexed in Scilit:
- Interim positron emission tomography scans in diffuse large B-cell lymphoma: an independent expert nuclear medicine evaluation of the Eastern Cooperative Oncology Group E3404 studyBlood, 2010
- FDG PET and PET/CT: EANM procedure guidelines for tumour PET imaging: version 1.0European Journal of Nuclear Medicine and Molecular Imaging, 2009
- Fluorine-18-Fluorodeoxyglucose Positron Emission Tomography for Interim Response Assessment of Advanced-Stage Hodgkin's Lymphoma and Diffuse Large B-Cell Lymphoma: A Systematic ReviewJournal of Clinical Oncology, 2009
- Report on the First International Workshop on interim-PET scan in lymphomaLeukemia & Lymphoma, 2009
- Early Interim 2-[18F]Fluoro-2-Deoxy-D-Glucose Positron Emission Tomography Is Prognostically Superior to International Prognostic Score in Advanced-Stage Hodgkin's Lymphoma: A Report From a Joint Italian-Danish StudyJournal of Clinical Oncology, 2007
- Answering the Call for a Standard Reliability Measure for Coding DataCommunication Methods and Measures, 2007
- Revised Response Criteria for Malignant LymphomaJournal of Clinical Oncology, 2007
- FDG-PET after two cycles of chemotherapy predicts treatment failure and progression-free survival in Hodgkin lymphomaBlood, 2006
- A Coefficient of Agreement for Nominal ScalesEducational and Psychological Measurement, 1960
- Nonparametric Estimation from Incomplete ObservationsJournal of the American Statistical Association, 1958