Video super‐resolution with non‐local alignment network

Open Access

11 February 2021

journal article
research article
Published by Institution of Engineering and Technology (IET) in IET Image Processing

Vol. 15 (8), 1655-1667
https://doi.org/10.1049/ipr2.12134

Abstract

Video super‐resolution (VSR) aims at recovering high‐resolution frames from their low‐resolution counterparts. Over the past few years, deep neural networks have dominated the video super‐resolution task because of its strong non‐linear representational ability. To exploit temporal correlations, most deep neural networks have to face two challenges: (1) how to align consecutive frames containing motions, occlusions and blurring, and establish accurate temporal correspondences, (2) how to effectively fuse aligned frames and balance their contributions. In this work, a novel video super‐resolution network, named NLVSR, is proposed to solve above problems in an efficient and effective manner. For alignment, a temporal‐spatial non‐local operation is employed to align each frame to the reference frame. Compared with existing alignment approaches, the proposed temporal‐spatial non‐local operation is able to integrate the global information of each frame by a weighted sum, leading to a better performance in alignment. For fusion, an attention‐based progressive fusion framework was designed to integrate aligned frames gradually. To penalize the points with low‐quality in aligned features, an attention mechanism was employed for a robust reconstruction. Experimental results demonstrate the superiority of the proposed network in terms of quantitative and qualitative evaluation, and surpasses other state‐of‐the‐art methods by 0.33 dB at least.

Keywords

This publication has 59 references indexed in Scilit:

Image Super-Resolution Using Deep Convolutional Networks
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015
Deep learning
Nature, 2015
Facial Image Hallucination Through Coupled-Layer Neighbor Embedding
IEEE Transactions on Circuits and Systems for Video Technology, 2015
Deep learning in neural networks: An overview
Neural Networks, 2015
Super-resolution: a comprehensive survey
Machine Vision and Applications, 2014
Image Interpolation via Graph-Based Bayesian Label Propagation
IEEE Transactions on Image Processing, 2013
A Comprehensive Survey to Face Hallucination
International Journal of Computer Vision, 2013
PatchMatch
ACM Transactions on Graphics, 2009
High Accuracy Optical Flow Estimation Based on a Theory for Warping
Lecture Notes in Computer Science, 2004
Example-based super-resolution
IEEE Computer Graphics and Applications, 2002

Cited by 2 articles