Classification and Mutation Prediction from Non-Small Cell Lung Cancer Histopathology Images using Deep Learning

Open Access

3 October 2017

preprint content
other
Published by Cold Spring Harbor Laboratory

p. 197574
https://doi.org/10.1101/197574

Abstract

Visual analysis of histopathology slides of lung cell tissues is one of the main methods used by pathologists to assess the stage, types and sub-types of lung cancers. Adenocarcinoma and squamous cell carcinoma are two most prevalent sub-types of lung cancer, but their distinction can be challenging and time-consuming even for the expert eye. In this study, we trained a deep learning convolutional neural network (CNN) model (inception v3) on histopathology images obtained from The Cancer Genome Atlas (TCGA) to accurately classify whole-slide pathology images into adenocarcinoma, squamous cell carcinoma or normal lung tissue. Our method slightly outperforms a human pathologist, achieving better sensitivity and specificity, with ∼0.97 average Area Under the Curve (AUC) on a held-out population of whole-slide scans. Furthermore, we trained the neural network to predict the ten most commonly mutated genes in lung adenocarcinoma. We found that six of these genes – STK11, EGFR, FAT1, SETBP1, KRAS and TP53 – can be predicted from pathology images with an accuracy ranging from 0.733 to 0.856, as measured by the AUC on the held-out population. These findings suggest that deep learning models can offer both specialists and patients a fast, accurate and inexpensive detection of cancer types or gene mutations, and thus have a significant impact on cancer treatment.

Keywords

This publication has 21 references indexed in Scilit:

Deep Learning in Medical Image Analysis
Annual Review of Biomedical Engineering, 2017
Oncology Drug Approvals: Evaluating Endpoints and Evidence in an Era of Breakthrough Therapies
The Oncologist, 2017
Accurate and reproducible invasive breast cancer detection in whole-slide images: A Deep Learning approach for quantifying tumor extent
Scientific Reports, 2017
Comprehensive Computational Pathological Image Analysis Predicts Lung Cancer Prognosis
Journal of Thoracic Oncology, 2016
Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features
Nature Communications, 2016
Molecular characterization of pulmonary sarcomatoid carcinoma: analysis of 33 cases
Laboratory Investigation, 2016
Guest Editorial Deep Learning in Medical Imaging: Overview and Future Promise of an Exciting New Technique
IEEE Transactions on Medical Imaging, 2016
Deep learning in neural networks: An overview
Neural Networks, 2015
Selumetinib plus docetaxel for KRAS-mutant advanced non-small-cell lung cancer: a randomised, multicentre, placebo-controlled, phase 2 study
The Lancet Oncology, 2013
A Fast Learning Algorithm for Deep Belief Nets
Neural Computation, 2006

Cited by 18 articles