An end-to-End deep context gate convolutional visual odometry system based on lightweight attention mechanism

13 September 2021

journal article
research article
Published by Emerald in Industrial Robot: the international journal of robotics research and application

Vol. 49 (1), 47-53
https://doi.org/10.1108/ir-01-2021-0019

Abstract

Purpose: Conventional learning-based visual odometry (VO) systems usually use convolutional neural networks (CNN) to extract features, where some important context-related and attention-holding global features might be ignored. Without essential global features, VO system will be sensitive to various environmental perturbations. The purpose of this paper is to design a novel learning-based framework that aims to improve accuracy of learning-based VO without decreasing the generalization ability. Design/methodology/approach: Instead of CNN, a context-gated convolution is adopted to build an end-to-end learning framework, which enables convolutional layers that dynamically capture representative local patterns and composes local features of interest under the guidance of global context. In addition, an attention mechanism module is introduced to further improve learning ability and enhance robustness and generalization ability of the VO system. Findings: The proposed system is evaluated on the public data set KITTI and the self-collected data sets of our college building, where it shows competitive performance compared with some classical and state-of-the-art learning-based methods. Quantitative experimental results on the public data set KITTI show that compared with CNN-based VO methods, the average translational error and rotational error of all the test sequences are reduced by 45.63% and 37.22%, respectively. Originality/value: The main contribution of this paper is that an end-to-end deep context gate convolutional VO system based on lightweight attention mechanism is proposed, which effectively improves the accuracy compared with other learning-based methods.

Keywords

This publication has 8 references indexed in Scilit:

Graph-based visual odometry for VSLAM
Industrial Robot: the international journal of robotics research and application, 2018
Convolutional Neural Networks with Alternately Updated Clique
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2018
Fusion of inertial and visual information for indoor localisation
Electronics Letters, 2018
Feedback Networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2017
SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2017
SVO: Semidirect Visual Odometry for Monocular and Multicamera Systems
IEEE Transactions on Robotics, 2016
Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age
IEEE Transactions on Robotics, 2016
Visual Odometry [Tutorial]
IEEE Robotics & Automation Magazine, 2011

Cited by 2 articles