An evaluation of SVBRDF Prediction from Generative Image Models for Appearance Modeling of 3D Scenes

Gauthier, AlbanDeschaintre, ValentinLanvin, AlexandreDurand, FredoBousseau, AdrienDrettakis, GeorgeWang, BeibeiWilkie, Alexander2025-06-202025-06-202025978-3-03868-292-91727-3463https://doi.org/10.2312/sr.20251186https://diglib.eg.org/handle/10.2312/sr20251186Digital content creation is experiencing a profound change with the advent of deep generative models. For texturing, conditional image generators now allow the synthesis of realistic RGB images of a 3D scene that align with the geometry of that scene. For appearance modeling, SVBRDF prediction networks recover material parameters from RGB images. Combining these technologies allows us to quickly generate SVBRDF maps for multiple views of a 3D scene, which can be merged to form a SVBRDF texture atlas of that scene. In this paper, we analyze the challenges and opportunities for SVBRDF prediction in the context of such a fast appearance modeling pipeline. On the one hand, single-view SVBRDF predictions might suffer from multiview incoherence and yield inconsistent texture atlases. On the other hand, generated RGB images, and the different modalities on which they are conditioned, can provide additional information for SVBRDF estimation compared to photographs. We compare neural architectures and conditions to identify designs that achieve high accuracy and coherence. We find that, surprisingly, a standard UNet is competitive with more complex designs.Attribution 4.0 International LicenseCCS Concepts: Computing methodologies -> Texturing; Reflectance modelingComputing methodologiesTexturingReflectance modelingAn evaluation of SVBRDF Prediction from Generative Image Models for Appearance Modeling of 3D Scenes10.2312/sr.2025118615 pages