skip to main content
10.1145/3428658.3430966acmconferencesArticle/Chapter ViewAbstractPublication PageswebmediaConference Proceedingsconference-collections
research-article

Decoder-Side Quality Enhancement of JPEG Images Using Deep Learning-Based Prediction Models for Quantized DCT Coefficients

Published:30 November 2020Publication History

ABSTRACT

Many recent works have successfully applied some types of Convolutional Neural Networks (CNNs) to reduce the noticeable distortion resulting from the lossy JPEG compression technique. Most of them are built upon the processing made on the spatial domain. In this work, we propose a JPEG image decoder that is purely based on the frequency-to-frequency domain: it reads the quantized DCT coefficients received from a low-quality image bitstream and, using a deep learning-based model, predicts the missing coefficients in order to recompose the same image with enhanced quality. In experiments with two datasets, our best model was able to improve from images with quantized DCT coefficients corresponding to a Qualityz Factor (QF) of 10 to enhanced quality images with QF slightly higher than 20.

References

  1. Carlo "zED" Caputo. 2008. Optimization of Video Compression Parameters through Genetic Algorithms. In Companion Proceedings of the XIV Brazilian Symposium on Multimedia and the Web (Vila Velha, Espírito Santo, Brazil) (WebMedia '08). Association for Computing Machinery, New York, NY, USA, 33--36. https://doi.org/10.1145/1809980.1809990Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Chao Dong, Yubin Deng, Chen Change Loy, and Xiaoou Tang. 2015. Compression artifacts reduction by a deep convolutional network. In Proceedings of the IEEE International Conference on Computer Vision. 576--584.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. 2015. Image super-resolution using deep convolutional networks. IEEE transactions on pattern analysis and machine intelligence 38, 2 (2015), 295--307.Google ScholarGoogle Scholar
  4. Joint Photographic Experts Group et al. 2004. JPEG standards: ISO/IEC IS 10918-1, ITU-T Recommendation T.81.Google ScholarGoogle Scholar
  5. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.Google ScholarGoogle ScholarCross RefCross Ref
  6. Jingru Hou, Yujuan Si, and Liangliang Li. 2019. Image Super-Resolution Reconstruction Method Based on Global and Local Residual Learning. In 2019 IEEE 4th International Conference on Image, Vision and Computing (ICIVC). IEEE, 341--348.Google ScholarGoogle Scholar
  7. N Instruments. 2013. Peak signal-to-noise ratio as an image quality metric.Google ScholarGoogle Scholar
  8. Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 (2015).Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Hossam M Kasem, Kwok-Wai Hung, and Jianmin Jiang. 2018. Revised Spatial Transformer Network towards Improved Image Super-resolutions. In 2018 24th International Conference on Pattern Recognition (ICPR). IEEE, 2688--2692.Google ScholarGoogle ScholarCross RefCross Ref
  10. T. Kim, H. Lee, H. Son, and S. Lee. 2019. SF-CNN: A Fast Compression Artifacts Removal via Spatial-To-Frequency Convolutional Neural Networks. In 2019 IEEE International Conference on Image Processing (ICIP). 3606--3610.Google ScholarGoogle Scholar
  11. Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google ScholarGoogle Scholar
  12. Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, et al. 2017. Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition. 4681--4690.Google ScholarGoogle ScholarCross RefCross Ref
  13. Sheng Li, Fengxiang He, Bo Du, Lefei Zhang, Yonghao Xu, and Dacheng Tao. 2019. Fast spatio-temporal residual network for video super-resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 10522--10531.Google ScholarGoogle ScholarCross RefCross Ref
  14. Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, and Kyoung Mu Lee. 2017. Enhanced deep residual networks for single image super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops. 136--144.Google ScholarGoogle ScholarCross RefCross Ref
  15. Bumjun Park, Songhyun Yu, and Jechang Jeong. 2019. Densely connected hierarchical network for image denoising. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops.Google ScholarGoogle ScholarCross RefCross Ref
  16. RajeshMehra Pinki. 2016. Estimation of the image quality under different distortions. International Journal of Engineering and Computer Science 5, 7 (2016), 17291--17296.Google ScholarGoogle Scholar
  17. Bulla Rajesh, Mohammed Javed, Ratnesh, and Shubham Srivastava. 2019. DCT-CompCNN: A Novel Image Classification Network Using JPEG Compressed DCT Coefficients. arXiv:cs.CV/1907.11503Google ScholarGoogle Scholar
  18. Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention. Springer, 234--241.Google ScholarGoogle ScholarCross RefCross Ref
  19. Gabriel N. P. dos Santos, Pedro V. A. de Freitas, Antonio José G Busson, Álan LV Guedes, Ruy Milidiú, and Sérgio Colcher. 2019. Deep learning methods for video understanding. In Proceedings of the 25th Brazillian Symposium on Multimedia and the Web. 21--23. https://doi.org/10.1145/3323503.3345029Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Vinay Verma, Nikita Agarwal, and Nitin Khanna. 2018. DCT-domain deep convolutional neural networks for multiple JPEG compression classification. Signal Processing: Image Communication 67 (2018), 22--33.Google ScholarGoogle ScholarCross RefCross Ref
  21. John Watkinson. 2004. The MPEG Handbook: MPEG-1, MPEG-2, MPEG-4. Taylor & Francis.Google ScholarGoogle Scholar
  22. Qi Ye, Shanxin Yuan, and Tae-Kyun Kim. 2016. Spatial attention deep net with partial pso for hierarchical hybrid hand pose estimation. In European conference on computer vision. Springer, 346--361.Google ScholarGoogle ScholarCross RefCross Ref
  23. Ke Yu, Chao Dong, Chen Change Loy, and Xiaoou Tang. 2016. Deep convolution networks for compression artifacts reduction. arXiv preprint arXiv:1608.02778 (2016).Google ScholarGoogle Scholar
  24. Kai Zhang, Wangmeng Zuo, Yunjin Chen, Deyu Meng, and Lei Zhang. 2017. Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising. IEEE Transactions on Image Processing 26, 7 (2017), 3142--3155.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Xiaoshuai Zhang, Wenhan Yang, Yueyu Hu, and Jiaying Liu. 2018. Dmcnn: Dual-domain multi-scale convolutional neural network for compression artifacts removal. In 2018 25th IEEE International Conference on Image Processing (ICIP). IEEE, 390--394.Google ScholarGoogle ScholarCross RefCross Ref
  26. Leilei Zhu, Shu Zhan, and Haiyan Zhang. 2019. Stacked U-shape networks with channel-wise attention for image super-resolution. Neurocomputing 345 (2019), 58--66.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Decoder-Side Quality Enhancement of JPEG Images Using Deep Learning-Based Prediction Models for Quantized DCT Coefficients

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        WebMedia '20: Proceedings of the Brazilian Symposium on Multimedia and the Web
        November 2020
        364 pages
        ISBN:9781450381963
        DOI:10.1145/3428658

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 30 November 2020

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

        Acceptance Rates

        WebMedia '20 Paper Acceptance Rate34of87submissions,39%Overall Acceptance Rate270of873submissions,31%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader