Presentation + Paper
4 October 2023 Switchable CNNs for in-loop restoration and super-resolution for AV2
Urvang Joshi, Yue Chen, Innfarn Yoo, Shan Li, Feng Yang, Debargha Mukherjee
Author Affiliations +
Abstract
In video codecs, CNN-based models have shown huge promise in two related tasks: in-loop restoration and frame super-resolution. In our previous work, we presented a framework that uses a common CNN architecture with downloadable model parameters for both these tasks with a preliminary performance study, where encoderside selection of scale factor was left as future work. The advantage of a common architecture with switchable parameters is that a single hardware inference engine can be utilized in all cases of same-resolution and super-resolution restoration, thereby limiting implementation costs. In this paper, we fully integrate this framework into the under-development AV2 video codec from the Alliance for Open Media (AOM). We also implement an algorithm for encoder-side selection of the super-resolution scale factor. With this implementation, we are able to achieve combined compression improvement up to −3.5% (AI) and −3.9% (RA) in BDRATE PSNR-Y and up to −7.8% (AI) and −7.9% (RA) in BDRATE VMAF, with inference cost as low as 1500 MACs/pixel.
Conference Presentation
(2023) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Urvang Joshi, Yue Chen, Innfarn Yoo, Shan Li, Feng Yang, and Debargha Mukherjee "Switchable CNNs for in-loop restoration and super-resolution for AV2", Proc. SPIE 12674, Applications of Digital Image Processing XLVI, 126740I (4 October 2023); https://doi.org/10.1117/12.2681954
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Super resolution

Video

Video coding

Image compression

Network architectures

Video compression

RELATED CONTENT

Perceptually-inspired super-resolution of compressed videos
Proceedings of SPIE (September 06 2019)
Video coding algorithm based on singularities reconstruction
Proceedings of SPIE (October 13 1998)
Compression of full-parallax integral 3D-TV image data
Proceedings of SPIE (May 15 1997)
A new video codec based on 3D DTCWT and vector...
Proceedings of SPIE (October 01 2011)
Standards-compliant watermarking for access management
Proceedings of SPIE (November 12 2001)

Back to Top