Classification and Mutation Prediction from Non-Small Cell Lung Cancer Histopathology Images using Deep Learning
Open Access
- 3 October 2017
- preprint content
- other
- Published by Cold Spring Harbor Laboratory
- p. 197574
- https://doi.org/10.1101/197574
Abstract
Visual analysis of histopathology slides of lung cell tissues is one of the main methods used by pathologists to assess the stage, types and sub-types of lung cancers. Adenocarcinoma and squamous cell carcinoma are two most prevalent sub-types of lung cancer, but their distinction can be challenging and time-consuming even for the expert eye. In this study, we trained a deep learning convolutional neural network (CNN) model (inception v3) on histopathology images obtained from The Cancer Genome Atlas (TCGA) to accurately classify whole-slide pathology images into adenocarcinoma, squamous cell carcinoma or normal lung tissue. Our method slightly outperforms a human pathologist, achieving better sensitivity and specificity, with ∼0.97 average Area Under the Curve (AUC) on a held-out population of whole-slide scans. Furthermore, we trained the neural network to predict the ten most commonly mutated genes in lung adenocarcinoma. We found that six of these genes – STK11, EGFR, FAT1, SETBP1, KRAS and TP53 – can be predicted from pathology images with an accuracy ranging from 0.733 to 0.856, as measured by the AUC on the held-out population. These findings suggest that deep learning models can offer both specialists and patients a fast, accurate and inexpensive detection of cancer types or gene mutations, and thus have a significant impact on cancer treatment.Keywords
This publication has 21 references indexed in Scilit:
- Deep Learning in Medical Image AnalysisAnnual Review of Biomedical Engineering, 2017
- Oncology Drug Approvals: Evaluating Endpoints and Evidence in an Era of Breakthrough TherapiesThe Oncologist, 2017
- Accurate and reproducible invasive breast cancer detection in whole-slide images: A Deep Learning approach for quantifying tumor extentScientific Reports, 2017
- Comprehensive Computational Pathological Image Analysis Predicts Lung Cancer PrognosisJournal of Thoracic Oncology, 2016
- Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image featuresNature Communications, 2016
- Molecular characterization of pulmonary sarcomatoid carcinoma: analysis of 33 casesLaboratory Investigation, 2016
- Guest Editorial Deep Learning in Medical Imaging: Overview and Future Promise of an Exciting New TechniqueIEEE Transactions on Medical Imaging, 2016
- Deep learning in neural networks: An overviewNeural Networks, 2015
- Selumetinib plus docetaxel for KRAS-mutant advanced non-small-cell lung cancer: a randomised, multicentre, placebo-controlled, phase 2 studyThe Lancet Oncology, 2013
- A Fast Learning Algorithm for Deep Belief NetsNeural Computation, 2006