DPANet: Dual Pooling‐aggregated Attention Network for fish segmentation

Open Access

10 September 2021

journal article
research article
Published by Institution of Engineering and Technology (IET) in IET Computer Vision

Vol. 16 (1), 67-82
https://doi.org/10.1049/cvi2.12065

Abstract

The sustainable development of marine fisheries depends on the accurate measurement of data on fish stocks. Semantic segmentation methods based on deep learning can be applied to automatically obtain segmentation masks of fish in images to obtain measurement data. However, general semantic segmentation methods cannot accurately segment fish objects in underwater images. In this study, a Dual Pooling-aggregated Attention Network (DPANet) to adaptively capture long-range dependencies through an efficient and computing-friendly manner to enhance feature representation and improve segmentation performance is proposed. Specifically, a novel pooling-aggregate position attention module and a pooling-aggregate channel attention module are designed to aggregate contexts in the spatial dimension and channel dimension, respectively. These two modules adopt pooling operations along the channel dimension and along the spatial dimension to aggregate information, respectively, thus reducing computational costs. In these modules, attention maps are generated by four different paths and are aggregated into one. The authors conduct extensive experiments to validate the effectiveness of the DPANet and achieve new state-of-the-art segmentation performance on the well-known fish image dataset DeepFish as well as on the underwater image dataset SUIM, achieving a Mean IoU score of 91.08% and 85.39% respectively, while significantly reducing FLOPs of attention modules by about 93%.

Keywords

This publication has 66 references indexed in Scilit:

Semantic Object Parsing with Graph LSTM
Published by Springer Science and Business Media LLC ,2016
Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation
Published by Springer Science and Business Media LLC ,2016
Saliency motivated pulse coupled neural network for underwater laser image segmentation
Journal of Shanghai Jiaotong University (Science), 2016
Learning Deconvolution Network for Semantic Segmentation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
U-Net: Convolutional Networks for Biomedical Image Segmentation
Published by Springer Science and Business Media LLC ,2015
Scene labeling with LSTM recurrent neural networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
Fully convolutional networks for semantic segmentation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
ImageNet Large Scale Visual Recognition Challenge
International Journal of Computer Vision, 2015
The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS)
IEEE Transactions on Medical Imaging, 2014
Improving quality inspection of food products by computer vision––a review
Journal of Food Engineering, 2004

Cited by 14 articles