Video super‐resolution with non‐local alignment network
Open Access
- 11 February 2021
- journal article
- research article
- Published by Institution of Engineering and Technology (IET) in IET Image Processing
- Vol. 15 (8), 1655-1667
- https://doi.org/10.1049/ipr2.12134
Abstract
Video super‐resolution (VSR) aims at recovering high‐resolution frames from their low‐resolution counterparts. Over the past few years, deep neural networks have dominated the video super‐resolution task because of its strong non‐linear representational ability. To exploit temporal correlations, most deep neural networks have to face two challenges: (1) how to align consecutive frames containing motions, occlusions and blurring, and establish accurate temporal correspondences, (2) how to effectively fuse aligned frames and balance their contributions. In this work, a novel video super‐resolution network, named NLVSR, is proposed to solve above problems in an efficient and effective manner. For alignment, a temporal‐spatial non‐local operation is employed to align each frame to the reference frame. Compared with existing alignment approaches, the proposed temporal‐spatial non‐local operation is able to integrate the global information of each frame by a weighted sum, leading to a better performance in alignment. For fusion, an attention‐based progressive fusion framework was designed to integrate aligned frames gradually. To penalize the points with low‐quality in aligned features, an attention mechanism was employed for a robust reconstruction. Experimental results demonstrate the superiority of the proposed network in terms of quantitative and qualitative evaluation, and surpasses other state‐of‐the‐art methods by 0.33 dB at least.Keywords
This publication has 59 references indexed in Scilit:
- Image Super-Resolution Using Deep Convolutional NetworksIEEE Transactions on Pattern Analysis and Machine Intelligence, 2015
- Deep learningNature, 2015
- Facial Image Hallucination Through Coupled-Layer Neighbor EmbeddingIEEE Transactions on Circuits and Systems for Video Technology, 2015
- Deep learning in neural networks: An overviewNeural Networks, 2015
- Super-resolution: a comprehensive surveyMachine Vision and Applications, 2014
- Image Interpolation via Graph-Based Bayesian Label PropagationIEEE Transactions on Image Processing, 2013
- A Comprehensive Survey to Face HallucinationInternational Journal of Computer Vision, 2013
- PatchMatchACM Transactions on Graphics, 2009
- High Accuracy Optical Flow Estimation Based on a Theory for WarpingLecture Notes in Computer Science, 2004
- Example-based super-resolutionIEEE Computer Graphics and Applications, 2002