G-SplatGAN: Disentangled 3D Gaussian Generation for Complex Shapes via Multi-Scale Patch Discriminators

dc.contributor.authorLi, Jiaqien_US
dc.contributor.authorDang, Haochuanen_US
dc.contributor.authorZhou, Zhien_US
dc.contributor.authorZhu, Junkeen_US
dc.contributor.authorHuang, Zhangjinen_US
dc.contributor.editorChristie, Marcen_US
dc.contributor.editorPietroni, Nicoen_US
dc.contributor.editorWang, Yu-Shuenen_US
dc.date.accessioned2025-10-07T05:02:46Z
dc.date.available2025-10-07T05:02:46Z
dc.date.issued2025
dc.description.abstractGenerating 3D objects with complex topologies from monocular images remains a challenge in computer graphics, due to the difficulty of modeling varying 3D shapes with disentangled, steerable geometry and visual attributes. While NeRF-based methods suffer from slow volumetric rendering and limited structural controllability. Recent advances in 3D Gaussian Splatting provide a more efficient alternative and its generative modeling with separate control over structure and appearance remains underexplored. In this paper, we propose G-SplatGAN, a novel 3D-aware generation framework that combines the rendering efficiency of 3D Gaussian Splatting with disentangled latent modeling. Starting from a shared Gaussian template, our method uses dual modulation branches to modulate geometry and appearance from independent latent codes, enabling precise shape manipulation and controllable generation. We adopt a progressive adversarial training scheme with multi-scale and patchbased discriminators to capture both global structure and local detail. Our model requires no 3D supervision and is trained on monocular images with known camera poses, reducing data reliance while supporting real image inversion through a geometryaware encoder. Experiments show that G-SplatGAN achieves superior performance in rendering speed, controllability and image fidelity, offering a compelling solution for controllable 3D generation using Gaussian representations.en_US
dc.description.number7
dc.description.sectionheadersGaussian Splatting
dc.description.seriesinformationComputer Graphics Forum
dc.description.volume44
dc.identifier.doi10.1111/cgf.70256
dc.identifier.issn1467-8659
dc.identifier.pages12 pages
dc.identifier.urihttps://doi.org/10.1111/cgf.70256
dc.identifier.urihttps://diglib.eg.org/handle/10.1111/cgf70256
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.subjectCCS Concepts: Computing methodologies → Shape modeling; Rendering
dc.subjectComputing methodologies → Shape modeling
dc.subjectRendering
dc.titleG-SplatGAN: Disentangled 3D Gaussian Generation for Complex Shapes via Multi-Scale Patch Discriminatorsen_US
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
cgf70256.pdf
Size:
3.27 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
paper1285_mm1.mp4
Size:
35.75 MB
Format:
Video MP4
Collections