Comparison of convolutional neural networks for detecting large vessel occlusion on computed tomography angiography
- 22 August 2021
- journal article
- research article
- Published by Wiley in Medical Physics
- Vol. 48 (10), 6060-6068
- https://doi.org/10.1002/mp.15122
Abstract
Purpose Artificial intelligence diagnosis and triage of large vessel occlusion may quicken clinical response for a subset of time-sensitive acute ischemic stroke patients, improving outcomes. Differences in architectural elements within data-driven convolutional neural network (CNN) models impact performance. Foreknowledge of effective model architectural elements for domain-specific problems can narrow the search for candidate models and inform strategic model design and adaptation to optimize performance on available data. Here, we study CNN architectures with a range of learnable parameters and which span the inclusion of architectural elements, such as parallel processing branches and residual connections with varying methods of recombining residual information. Methods We compare five CNNs: ResNet-50, DenseNet-121, EfficientNet-B0, PhiNet, and an Inception module-based network, on a computed tomography angiography large vessel occlusion detection task. The models were trained and preliminarily evaluated with 10-fold cross-validation on preprocessed scans (n = 240). An ablation study was performed on PhiNet due to superior cross-validated test performance across accuracy, precision, recall, specificity, and F1 score. The final evaluation of all models was performed on a withheld external validation set (n = 60) and these predictions were subsequently calibrated with sigmoid curves. Results Uncalibrated results on the withheld external validation set show that DenseNet-121 had the best average performance on accuracy, precision, recall, specificity, and F1 score. After calibration DenseNet-121 maintained superior performance on all metrics except recall. Conclusions The number of learnable parameters in our five models and best-ablated PhiNet directly related to cross-validated test performance-the smaller the model the better. However, this pattern did not hold when looking at generalization on the withheld external validation set. DenseNet-121 generalized the best; we posit this was due to its heavy use of residual connections utilizing concatenation, which causes feature maps from earlier layers to be used deeper in the network, while aiding in gradient flow and regularization.Keywords
Funding Information
- Patient-Centered Outcomes Research Institute (CDRN‐1306‐04869)
- National Institutes of Health (R01GM120484)
- National Center for Research Resources (1UL1RR024975‐01)
- National Center for Advancing Translational Sciences (2UL1TR000445‐06)
This publication has 24 references indexed in Scilit:
- Epidemiology, Natural History, and Clinical Presentation of Large Vessel Ischemic StrokeNeurosurgery, 2019
- Machine Learning in Acute Ischemic Stroke NeuroimagingFrontiers in Neurology, 2018
- Structurally-Sensitive Multi-Scale Deep Neural Network for Low-Dose CT DenoisingIEEE Access, 2018
- Classifying magnetic resonance image modalities with convolutional neural networksPublished by SPIE-Intl Soc Optical Eng ,2018
- Ischemic Strokes Due to Large-Vessel Occlusions Contribute Disproportionately to Stroke-Related Dependence and Death: A ReviewFrontiers in Neurology, 2017
- Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: The LUNA16 challengeMedical Image Analysis, 2017
- Densely Connected Convolutional NetworksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2017
- Deep Residual Learning for Image RecognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2016
- Validated automatic brain extraction of head CT imagesNeuroImage, 2015
- Going deeper with convolutionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015