Fast Method for Dynamic Point Cloud Coding Based on Block Partition Prediction
Abstract
Point clouds are a versatile 3D data representation format that captures spatial and attribute information about objects or environments. It has been widely used in applications like autonomous driving, augmented and virtual reality, and 3D scanning. However, the large volume of data involved in point cloud processing poses challenges in terms of storage, transmission, and real-time processing. The Video-based Point Cloud Compression (V-PCC) standard addresses these challenges by employing 2D video compression techniques to encode dynamic 3D point clouds. Despite its effectiveness, V-PCC demands a high computational cost, particularly in encoding geometry and attribute video sub-streams. This paper presents a machine learning-based approach to accelerate the V-PCC encoding, focusing on the Coding Unit partitioning process in the geometry and attribute sub-streams deployed in the reference software (TMC2). The proposed method significantly reduces the encoding complexity by using decision tree models with negligible impact on coding efficiency. Experimental results demonstrate an average encoding time reduction of 36.12% for geometry and attribute sub-streams, with a minimal impact on coding efficiency between -0.43% and 0.83% in terms of BD-Rate.
References
Frank Bossen, Benjamin Bross, Karsten Suhring, and David Flynn. 2012. HEVC Complexity and Implementation Analysis. IEEE Transactions on Circuits and Systems for Video Technology 22, 12 (2012), 1685–1696. DOI: 10.1109/TCSVT.2012.2221255
Benjamin Bross, Ye-Kui Wang, Yan Ye, Shan Liu, Jianle Chen, Gary J Sullivan, and Jens-Rainer Ohm. 2021. Overview of the versatile video coding (VVC) standard and its applications. IEEE Transactions on Circuits and Systems for Video Technology 31, 10 (2021), 3736–3764.
Wei Gao, Hang Yuan, Ge Li, Zhu Li, and Hui Yuan. 2023. Low Complexity Coding Unit Decision for Video-Based Point Cloud Compression. IEEE Transactions on Image Processing 33 (2023), 149–162.
Guillaume Gautier, Alexandre Mercat, Louis Fréneau, Mikko Pitkänen, and Jarno Vanne. 2023. UVG-VPC: Voxelized Point Cloud Dataset for Visual Volumetric Video-based Coding. In 2023 15th International Conference on Quality of Multimedia Experience (QoMEX). IEEE, 244–247.
Danillo Graziosi, Ohji Nakagami, Satoru Kuma, Alexandre Zaghetto, Teruhiko Suzuki, and Ali Tabatabai. 2020. An overview of ongoing point cloud compression standardization activities: Video-based (V-PCC) and geometry-based (G-PCC). APSIPA Transactions on Signal and Information Processing 9 (2020), e13.
Dan Grois, Detlev Marpe, Amit Mulayoff, Benaya Itzhaky, and Ofer Hadar. 2013. Performance comparison of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC encoders. In 2013 Picture Coding Symposium (PCS). 394–397.
Il-Koo Kim, Junghye Min, Tammy Lee, Woo-Jin Han, and JeongHoon Park. 2012. Block partitioning structure in the HEVC standard. IEEE Transactions on Circuits and Systems for Video Technology 22, 12 (2012), 1697–1706.
Yue Li, Jun Huang, Chaofeng Wang, and Hongyue Huang. 2024. Unsupervised learning-based fast CU size decision for geometry videos in V-PCC. Journal of Real-Time Image Processing 21, 1 (2024), 11.
Ting-Lan Lin, Hong-Bin Bu, Yan-Cheng Chen, Jun-Rui Yang, Chi-Fu Liang, Kun-Hu Jiang, Ching-Hsuan Lin, and Xiao-Feng Yue. 2021. Efficient quadtree search for HEVC coding units for V-PCC. IEEE Access 9 (2021), 139109–139121.
Loren Merritt and Rahul Vanam. 2006. x264: A high performance H. 264/AVC encoder. [link] (2006).
MPEG. 2020. Common Test Conditions for V3C and V-PCC. ISO/IEC JTC 1/SC 29/WG 11 (2020).
MPEG. 2024. Video Point Cloud Compression - VPCC - mpeg-pcc-tmc2 test model candidate software. [link].
Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, et al. 2011. Scikit-learn: Machine learning in Python. the Journal of machine Learning research 12 (2011), 2825–2830.
Marius Preda. 2020. V-PCC codec description. [link].
Shicheng Que and Yue Li. 2023. Portable Perceptron Network-Based Fast Mode Decision for Video-Based Point Cloud Compression. CAAI Artificial Intelligence Research 2 (2023).
Gustavo Rehbein, Eduardo Costa, Guilherme Corrêa, Cristiano Santos, and Marcelo Porto. 2024. A Machine-Learning-Driven Fast Video-based Point Cloud Compression (V-PCC). In Proceedings of the 30th Brazilian Symposium on Multimedia and the Web (Juiz de Fora/MG). SBC, Porto Alegre, RS, Brasil, 20–27. DOI: 10.5753/webmedia.2024.242069
R Schaefer. 2017. Call for proposals for point cloud compression V2. In ISO/IEC JTC1 SC29/WG11 MPEG, 117th Meeting. Hobart, TAS.
Wenjun Song, Xinqi Liu, and Qiuwen Zhang. 2025. Fast Coding Unit Partitioning Method for Video-Based Point Cloud Compression: Combining Convolutional Neural Networks and Bayesian Optimization. Electronics 14, 7 (2025), 1295.
Gary J Sullivan, Jens-Rainer Ohm, Woo-Jin Han, and Thomas Wiegand. 2012. Overview of the high efficiency video coding (HEVC) standard. IEEE Transactions on circuits and systems for video technology 22, 12 (2012), 1649–1668.
