Analysing Hybrid Neural and Ray Tracing Perception for Foveated Rendering

Resumo


Foveated Rendering is a fundamental approach for optimizing latency and achieving high fidelity graphics in Virtual Reality. With the advent of real-time ray tracing, different global illumination effects are becoming feasible in interactive media, although they still require solutions for optimally displaying such effects. At the same time, neural rendering techniques, such as Neural Radiance Fields (NeRF) and Gaussian Splatting, are featuring realistic reconstruction of objects and scenes through real or pre-rendered images. Previous works introduced the possibility of hybrid rendering approaches, using full Ray Tracing techniques for the Fovea regions of a Head Mounted Display and Neural Rendering approaches, using baked and previously trained Networks for inferring realistic light effects in a more optimal and faster way for the peripheral region. In this work, we make a deep analysis of human perception related to five different lighting conditions, commonly achieved by Ray Tracing techniques when combining NeRF for approaching the peripheral region. To this end, we conducted a Double-Stimulus Impairment Scale (DSIS) test with 42 users, half of whom were tested on their perception of lighting effects with the reticule present in the image to fixate the user’s gaze, and half of whom were tested without the reticule. Results analysis enhances our understanding of visual perception in virtual reality and the limitations of NeRF on reproducing ray tracing effects for VR environments.
Palavras-chave: perception-based rendering, neural 3d rendering, real-time raytracing, virtual reality

Referências

Statista Market Forecast. "AR & VR - Worldwide." [link]. Accessed June 2, 2023.

MirrorGaussian. "Reflecting 3D Gaussians for Reconstructing Mirror Reflections." arXiv preprint arXiv:2405.11921 (2024).

Albert, R., Patney, A., Luebke, D., & Kim, J. "Latency requirements for foveated rendering in virtual reality." ACM Transactions on Applied Perception (TAP), 14(4), 2017. DOI: 10.1145/3127589.

Banks, M. S., Sekuler, A. B., & Anderson, S. J. "Peripheral spatial vision: Limits imposed by optics, photoreceptors, and receptor pooling." JOSA A, 8(11), 1991.

Barron, J. T., Mildenhall, B., Verbin, D., Srinivasan, P. P., & Hedman, P. "Mip-nerf 360: Unbounded anti-aliased neural radiance fields." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022. DOI: 10.1109/CVPR52688.2022.00539.

Bates, D., Mächler, M., Bolker, B. M., & Walker, S. C. "Fitting linear mixed-effects models using lme4." Journal of Statistical Software, 67(1), 2015. DOI: 10.18637/jss.v067.i01.

Caulfield, B. "What Is Path Tracing?" NVIDIA Blog. [link]. Accessed December 26, 2022.

Chen, X., Liu, J., Zhao, H., Zhou, G., & Zhang, Y. "Nerrf: 3d reconstruction and view synthesis for transparent and specular objects with neural refractive-reflective fields." arXiv preprint arXiv:2309.13039 (2023).

Chen, Z., Funkhouser, T., Hedman, P., & Tagliasacchi, A. "Mobilenerf: Exploiting the polygon rasterization pipeline for efficient neural field rendering on mobile architectures." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023.

Cui, Z., Gu, L., Sun, X., Ma, X., Qiao, Y., & Harada, T. "Aleth-nerf: Illumination adaptive nerf with concealing field assumption." In Proceedings of the AAAI Conference on Artificial Intelligence, 38, 2024.

Deng, N., He, Z., Ye, J., Duinkharjav, B., Chakravarthula, P., Yang, X., & Sun, Q. "Fov-nerf: Foveated neural radiance fields for virtual reality." IEEE Transactions on Visualization and Computer Graphics, 28(11), 2022. DOI: 10.1109/TVCG.2022.3203102.

Deng, W., Campbell, D., Sun, C., Kanitkar, S., Shaffer, M., & Gould, S. "Ray Deformation Networks for Novel View Synthesis of Refractive Objects." In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024.

Dix, A., Finlay, J., Abowd, G. D., & Beale, R. "Human-computer interaction." Pearson Education, 2003.

Duchowski, A. T., Bate, D., Stringfellow, P., Thakur, K., Melloy, B. J., & Gramopadhye, A. K. "On spatiochromatic visual sensitivity and peripheral color LOD management." ACM Transactions on Applied Perception (TAP), 6(2), 2009.

Einhorn, E., & Mawdsley, J. "Generate Groundbreaking Ray-Traced Images with Next-Generation NVIDIA DLSS." NVIDIA Blog, 2023. [link].

Elkin, L. A., Kay, M., Higgins, J. J., & Wobbrock, J. O. "An Aligned Rank Transform Procedure for Multifactor Contrast Tests." UIST 2021 - Proceedings of the 34th Annual ACM Symposium on User Interface Software and Technology, 2021. DOI: 10.1145/3472749.3474784.

Feng, Y., Feng, X., Shang, Y., Jiang, Y., Yu, C., Zong, Z., Shao, T., Wu, H., Zhou, K., & Yang, Y. "Gaussian Splashing: Dynamic Fluid Synthesis with Gaussian Splatting." arXiv:2401.15318 [cs.GR], 2024.

Fridovich-Keil, S., Yu, A., Tancik, M., Chen, Q., Recht, B., & Kanazawa, A. "Plenoxels: Radiance fields without neural networks." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022. DOI: 10.1109/CVPR52688.2022.00542.

Gao, Q., Xu, Q., Cao, Z., Mildenhall, B., Ma, W., Chen, L., Tang, D., & Neumann, U. "GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation." 2024.

Guenter, B., Finch, M., Drucker, S., Tan, D., & Snyder, J. "Foveated 3D graphics." ACM Transactions on Graphics (TOG), 31(6), 2012. DOI: 10.1145/2366145.2366183.

Guo, Z., Zhou, W., Li, L., Wang, M., & Li, H. "Motion-aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction." arXiv:2403.11447 [cs.CV], 2024.

Henriques, H., Oliveira, E., Clua, E., & Trevisan, D. "A mixed path tracing and NeRF approach for optimizing rendering in XR Displays." In Proceedings of the 25th Symposium on Virtual and Augmented Reality, 2023.

Holm, S. "A simple sequentially rejective multiple test procedure." Scandinavian journal of statistics, 1979.

Jiang, Y., Tu, J., Liu, Y., Gao, X., Long, X., Wang, W., & Ma, Y. "GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces." arXiv:2311.17977 [cs.CV], 2023.

Kajiya, J. T. "The rendering equation." In Proceedings of the 13th annual conference on Computer graphics and interactive techniques, 1986. DOI: 10.1145/15922.15902.

Kerbl, B., Kopanas, G., Leimkühler, T., & Drettakis, G. "3d gaussian splatting for real-time radiance field rendering." ACM Transactions on Graphics, 42(4), 2023, 1–14.

Kilgariff, E., Moreton, H., Stam, N., & Bell, B. "NVIDIA Turing Architecture In-Depth." NVIDIA Technical Blog, 2018. [link]. Accessed December 19, 2022.

Kim, J., Kim, J., Jung, M., Kwon, T., & Kim, K. K. "Individualized foveated rendering with eye-tracking head-mounted display." Virtual Reality, 28(1), January 2024, 25. DOI: 10.1007/s10055-023-00931-8.

Koskela, M. "Foveated path tracing with fast reconstruction and efficient sample distribution." 2020.

Koskela, M., Lotvonen, A., Mäkitalo, M., Kivi, P., Viitanen, T., & Jääskeläinen, P. "Foveated Real-Time Path Tracing in Visual-Polar Space." In Eurographics Symposium on Rendering - DL-only and Industry Track, Boubekeur, T., & Sen, P. (Eds.), The Eurographics Association, 2019. DOI: 10.2312/sr.20191219.

Levoy, M., & Whitaker, R. "Gaze-directed volume rendering." In Proceedings of the 1990 symposium on interactive 3d graphics, 1990. DOI: 10.1145/91385.91449.

Liu, L., Gu, J., Lin, K. Z., Chua, T. S., & Theobalt, C. "Neural Sparse Voxel Fields." In Advances in Neural Information Processing Systems, Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M. F., & Lin, H. (Eds.), Vol. 33, Curran Associates, Inc., 2020. [link].

Mildenhall, B., Srinivasan, P. P., Tancik, M., Barron, J. T., Ramamoorthi, R., & Ng, R. "NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis." Commun. ACM, 65(1), December 2021, 99–106. DOI: 10.1145/3503250.

Mohanto, B., Islam, A. B. T., Gobbetti, E., & Staadt, O. "An integrative view of foveated rendering." Computers & Graphics, 102, 2022, 474–501. DOI: 10.1016/j.cag.2021.10.010.

Müller, T., Evans, A., Schied, C., & Keller, A. "Instant Neural Graphics Primitives with a Multiresolution Hash Encoding." ACM Trans. Graph., 41(4), Article 102, July 2022, 15 pages. DOI: 10.1145/3528223.3530127.

Ogboso, Y. U., & Bedell, H. E. "Magnitude of lateral chromatic aberration across the retina of the human eye." JOSA A, 8(8), 1987, 1775–1787.

Patney, A., Salvi, M., Kim, J., Kaplanyan, A., Wyman, C., Benty, N., Luebke, D., & Lefohn, A. "Towards foveated rendering for gaze-tracked virtual reality." ACM Transactions on Graphics (TOG), 35(6), 2016, 1–12.

Porcino, T., Trevisan, D., & Clua, E. "Minimizing cybersickness in head-mounted display systems: causes and strategies review." In 2020 22nd Symposium on Virtual and Augmented Reality (SVR). IEEE, 154–163. DOI: 10.1109/SVR51698.2020.00035.

Qian, S., Kirschstein, T., Schoneveld, L., Davoli, D., Giebenhain, S., & Nießner, M. "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians." arXiv preprint arXiv:2312.02069 (2023).

Reiser, C., Peng, S., Liao, Y., & Geiger, A. "Kilonerf: Speeding up neural radiance fields with thousands of tiny mlps." arXiv e-prints, 2021, 1–11. DOI: 10.48550/arXiv.2103.13744.

Schütz, M., Krösl, K., & Wimmer, M. "Real-time continuous level of detail rendering of point clouds." In 2019 IEEE Conference on Virtual Reality and 3D User Interfaces (VR). IEEE, 103–110.

BT Series. "Methodology for the subjective assessment of the quality of television pictures." Recommendation ITU-R BT 500, 2012, 1–46.

Sorokin, A., & Forsyth, D. "Utility data annotation with amazon mechanical turk." In 2008 IEEE computer society conference on computer vision and pattern recognition workshops. IEEE, 1–8.

Swafford, N. T., Iglesias-Guitian, J. A., Koniaris, C., Moon, B., Cosker, D., & Mitchell, K. "User, metric, and computational evaluation of foveated rendering methods." In Proceedings of the ACM Symposium on Applied Perception, ACM, Anaheim, California, 7–14. DOI: 10.1145/2931002.2931011.

Tursun, O. T., Arabadzhiyska-Koleva, E., Wernikowski, M., Mantiuk, R., Seidel, H. P., Myszkowski, K., & Didyk, P. "Luminance-contrast-aware foveated rendering." ACM Transactions on Graphics (TOG), 38(4), 2019, 1–14.

Ujjainkar, N., Shahan, E., Chen, K., Duinkharjav, B., Sun, Q., & Zhu, Y. "Exploiting Human Color Discrimination for Memory- and Energy-Efficient Image Encoding in Virtual Reality." In Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 1, ACM, La Jolla, CA, USA, 166–180. DOI: 10.1145/3617232.3624860.

Verbin, D., Hedman, P., Mildenhall, B., Zickler, T., Barron, J. T., & Srinivasan, P. P. "Ref-nerf: Structured view-dependent appearance for neural radiance fields." In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, 5481–5490.

Wandell, B. A. "Foundations of vision." Sinauer Associates, 1995.

Wang, L., Shi, X., & Liu, Y. "Foveated rendering: A state-of-the-art survey." Computational Visual Media, 9(2), 2023, 195–228. DOI: 10.1007/s41095-022-0306-4.

Weier, M., Roth, T., Kruijff, E., Hinkenjann, A., Pérard-Gayot, A., Slusallek, P., & Li, Y. "Foveated Real-Time Ray Tracing for Head-Mounted Displays." Computer Graphics Forum, 2016, 289–298. DOI: 10.1111/cgf.13026.

Whitted, T. "An Improved Illumination Model for Shaded Display." In Proceedings of the 6th Annual Conference on Computer Graphics and Interactive Techniques (Chicago, Illinois, USA) (SIGGRAPH ’79). Association for Computing Machinery, New York, NY, USA, 14. DOI: 10.1145/800249.807419.

Wobbrock, J. O., Findlater, L., Gergle, D., & Higgins, J. J. "The Aligned Rank Transform for nonparametric factorial analyses using only ANOVA procedures." Conference on Human Factors in Computing Systems - Proceedings, 2011, 143–146. DOI: 10.1145/1978942.1978963.

Yan, X., Xu, J., Huo, Y., & Bao, H. "Neural Rendering and Its Hardware Acceleration: A Review." arXiv preprint arXiv:2402.00028 (2024).

Yee, H., Pattanaik, S., & Greenberg, D. P. "Spatiotemporal sensitivity and visual attention for efficient rendering of dynamic environments." ACM Transactions on Graphics (TOG), 20(1), 2001, 39–65.
Publicado
30/09/2024
HENRIQUES, Horácio; OLIVEIRA, Eder; CLUA, Esteban; TREVISAN, Daniela. Analysing Hybrid Neural and Ray Tracing Perception for Foveated Rendering. In: SIMPÓSIO DE REALIDADE VIRTUAL E AUMENTADA (SVR), 26. , 2024, Manaus/AM. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2024 . p. 21-30.