Traditional vs. Neural Video Codecs: Compression Efficiency, Visual Artifacts, and Quality Analysis Beyond PSNR

  • Leandro Tavares UFPel
  • Ruhan Conceição UFPel
  • Victor Costa UFPel
  • Luciano Agostini UFPel
  • Marcelo Porto UFPel
  • Guilherme Corrêa UFPel

Resumo


Video compression technology is rapidly evolving, with end-to-end learned Neural Video Codecs (NVCs) emerging as powerful alternatives to traditional, block-based standards. This paper presents a comparative analysis of two traditional codecs, HEVC and VVC, against two leading NVCs, DCVC-FM and DCVC-RT. We conduct a thorough rate distortion analysis using multiple objective metrics (PSNR, SSIM, VMAF, and LPIPS) and introduce a formal method to identify rate-matched operating points for fair visual comparison. Our results demonstrate that NVCs offer substantial gains not only in signal fidelity but also in perceptual quality, achieving up to 37.82% bitrate savings over HEVC. Furthermore, our analysis reveals fundamental differences in distortion patterns: NVCs excel at preserving structural and color fidelity, producing visually pleasing results free of blocking artifacts, but can sometimes smooth over fine textures. Conversely, traditional codecs are prone to blockiness but can occasionally retain more high-frequency detail. These findings confirm the superior efficiency of NVCs and highlight the need for evaluation methodologies that look beyond PSNR.

Palavras-chave: video compression, neural networks, neural codecs, quality assessment, HEVC, VVC, DCVC-FM, DCVC-RT

Referências

2003. Recommendation H.264: Advanced video coding for generic audiovisual services. International Telecommunication Union (ITU). [link] Available: [link].

2013. Recommendation H.265: High efficiency video coding. International Telecommunication Union (ITU). [link] Available: [link].

2020. Recommendation H.266: Versatile video coding. International Telecommunication Union (ITU). [link] Available: [link].

Gisle Bjontegaard. 2001. Calculation of average PSNR differences between RDcurves. ITU SG16 Doc. VCEG-M33 (2001).

Frank Bossen, Bin Li, Kyeongkeun Lim, and Alexei Norkin. 2013. Common test conditions and software reference configurations. Technical Report JCTVC-L1100. Joint Collaborative Team on Video Coding (JCT-VC). Meeting 12, Geneva, Switzerland.

Yi-Hsin Chen, Kuan-Wei Ho, Martin Benjak, Jörn Ostermann, and Wen-Hsiao Peng. 2024. On the Rate-Distortion-Complexity Trade-Offs of Neural Video Coding. In 2024 IEEE 26th International Workshop on Multimedia Signal Processing (MMSP). 1–6. DOI: 10.1109/MMSP61759.2024.10743250

Zhaoyang Jia, Bin Li, Jiahao Li, Wenxuan Xie, Linfeng Qi, Houqiang Li, and Yan Lu. 2025. Towards Practical Real-Time Neural Video Compression. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, TN, USA, June 11-25, 2024.

Jiahao Li, Bin Li, and Yan Lu. 2021. Deep Contextual Video Compression. Advances in Neural Information Processing Systems 34 (2021).

Jiahao Li, Bin Li, and Yan Lu. 2024. Neural Video Compression with Feature Modulation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 17-21, 2024.

Xiaodong Liao, Yuanyi Yang, Shiqi Li, Kai Huang, Zhenyu Li, Shiqi Lu, Zhan Zhang, and Liang Ma. 2021. Advances in video compression system using deep neural network: A review and case studies. Neurocomputing 462 (2021), 364–385. DOI: 10.1016/j.neucom.2021.07.045

Guo Lu, Wanli Ouyang, Dong Xu, Xiaoyun Zhang, and Zhiyong Gao. 2019. DVC: An End-to-end Deep Video Compression Framework. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 11006–11015. DOI: 10.1109/CVPR.2019.01126

Aksels Mercat, Mika Vaittinen, and Jarno Vanne. 2020. UVG Dataset: 50/120fps 4K Sequences for Video Codec Analysis and Development. In Proceedings of the 11th ACM Multimedia Systems Conference (MMSys). 297–302. DOI: 10.1145/3339825.3394937

Netflix Technology Blog. 2018. VMAF: The Journey Continues. [link]. Accessed: 2025-07-07.

Farhad Pakdaman, Mohammad Ali Adelimanesh, Moncef Gabbouj, and Mahmoud Reza Hashemi. 2020. Complexity Analysis Of Next-Generation VVC Encoding And Decoding. In 2020 IEEE International Conference on Image Processing (ICIP). 3134–3138. DOI: 10.1109/ICIP40778.2020.9190983

Icaro Siqueira, Guilherme Correa, and Mateus Grellert. 2021. Complexity and Coding Efficiency Assessment of the Versatile Video Coding Standard. In IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, 1–5. DOI: 10.1109/ISCAS51556.2021.9401714

Zhou Wang, Alan C. Bovik, Hamid R. Sheikh, and Eero P. Simoncelli. 2004. Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Transactions on Image Processing 13, 4 (2004), 600–612. DOI: 10.1109/TIP.2003.819861

Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shechtman, and Oliver Wang. 2018. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018), 586–595. DOI: 10.1109/CVPR.2018.00068
Publicado
10/11/2025
TAVARES, Leandro; CONCEIÇÃO, Ruhan; COSTA, Victor; AGOSTINI, Luciano; PORTO, Marcelo; CORRÊA, Guilherme. Traditional vs. Neural Video Codecs: Compression Efficiency, Visual Artifacts, and Quality Analysis Beyond PSNR. In: BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB (WEBMEDIA), 31. , 2025, Rio de Janeiro/RJ. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2025 . p. 446-454. DOI: https://doi.org/10.5753/webmedia.2025.16011.

Artigos mais lidos do(s) mesmo(s) autor(es)

1 2 > >>