Comparison of convolutional neural networks for detecting large vessel occlusion on computed tomography angiography

22 August 2021

journal article
research article
Published by Wiley in Medical Physics

Vol. 48 (10), 6060-6068
https://doi.org/10.1002/mp.15122

Abstract

Purpose Artificial intelligence diagnosis and triage of large vessel occlusion may quicken clinical response for a subset of time-sensitive acute ischemic stroke patients, improving outcomes. Differences in architectural elements within data-driven convolutional neural network (CNN) models impact performance. Foreknowledge of effective model architectural elements for domain-specific problems can narrow the search for candidate models and inform strategic model design and adaptation to optimize performance on available data. Here, we study CNN architectures with a range of learnable parameters and which span the inclusion of architectural elements, such as parallel processing branches and residual connections with varying methods of recombining residual information. Methods We compare five CNNs: ResNet-50, DenseNet-121, EfficientNet-B0, PhiNet, and an Inception module-based network, on a computed tomography angiography large vessel occlusion detection task. The models were trained and preliminarily evaluated with 10-fold cross-validation on preprocessed scans (n = 240). An ablation study was performed on PhiNet due to superior cross-validated test performance across accuracy, precision, recall, specificity, and F1 score. The final evaluation of all models was performed on a withheld external validation set (n = 60) and these predictions were subsequently calibrated with sigmoid curves. Results Uncalibrated results on the withheld external validation set show that DenseNet-121 had the best average performance on accuracy, precision, recall, specificity, and F1 score. After calibration DenseNet-121 maintained superior performance on all metrics except recall. Conclusions The number of learnable parameters in our five models and best-ablated PhiNet directly related to cross-validated test performance-the smaller the model the better. However, this pattern did not hold when looking at generalization on the withheld external validation set. DenseNet-121 generalized the best; we posit this was due to its heavy use of residual connections utilizing concatenation, which causes feature maps from earlier layers to be used deeper in the network, while aiding in gradient flow and regularization.

Keywords

Funding Information

Patient-Centered Outcomes Research Institute (CDRN‐1306‐04869)
National Institutes of Health (R01GM120484)
National Center for Research Resources (1UL1RR024975‐01)
National Center for Advancing Translational Sciences (2UL1TR000445‐06)

This publication has 24 references indexed in Scilit:

Epidemiology, Natural History, and Clinical Presentation of Large Vessel Ischemic Stroke
Neurosurgery, 2019
Machine Learning in Acute Ischemic Stroke Neuroimaging
Frontiers in Neurology, 2018
Structurally-Sensitive Multi-Scale Deep Neural Network for Low-Dose CT Denoising
IEEE Access, 2018
Classifying magnetic resonance image modalities with convolutional neural networks
Published by SPIE-Intl Soc Optical Eng ,2018
Ischemic Strokes Due to Large-Vessel Occlusions Contribute Disproportionately to Stroke-Related Dependence and Death: A Review
Frontiers in Neurology, 2017
Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: The LUNA16 challenge
Medical Image Analysis, 2017
Densely Connected Convolutional Networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2017
Deep Residual Learning for Image Recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2016
Validated automatic brain extraction of head CT images
NeuroImage, 2015
Going deeper with convolutions
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015

Cited by 6 articles