Multimodal 3D Few-Shot Classification via Gaussian Mixture Discriminant Analysis

Wu, Yiqi; Wu, Huachao; Hu, Ronglei; Chen, Yilin; Zhang, Dejun

Multimodal 3D Few-Shot Classification via Gaussian Mixture Discriminant Analysis

dc.contributor.author	Wu, Yiqi	en_US
dc.contributor.author	Wu, Huachao	en_US
dc.contributor.author	Hu, Ronglei	en_US
dc.contributor.author	Chen, Yilin	en_US
dc.contributor.author	Zhang, Dejun	en_US
dc.contributor.editor	Christie, Marc	en_US
dc.contributor.editor	Pietroni, Nico	en_US
dc.contributor.editor	Wang, Yu-Shuen	en_US
dc.date.accessioned	2025-10-07T05:03:24Z
dc.date.available	2025-10-07T05:03:24Z
dc.date.issued	2025
dc.description.abstract	While pre-trained 3D vision-language models are becoming increasingly available, there remains a lack of frameworks that can effectively harness their capabilities for few-shot classification. In this work, we propose PointGMDA, a training-free framework that combines Gaussian Mixture Models (GMMs) with Gaussian Discriminant Analysis (GDA) to perform robust classification using only a few labeled point cloud samples. Our method estimatesGMMparameters per class from support data and computes mixture-weighted prototypes, which are then used in GDA with a shared covariance matrix to construct decision boundaries. This formulation allows us to model intra-class variability more expressively than traditional single-prototype approaches, while maintaining analytical tractability. To incorporate semantic priors, we integrate CLIP-style textual prompts and fuse predictions from geometric and textual modalities through a hybrid scoring strategy. We further introduce PointGMDA-T, a lightweight attention-guided refinement module that learns residuals for fast feature adaptation, improving robustness under distribution shift. Extensive experiments on ModelNet40 and ScanObjectNN demonstrate that PointGMDA outperforms strong baselines across a variety of few-shot settings, with consistent gains under both training-free and fine-tuned conditions. These results highlight the effectiveness and generality of our probabilistic modeling and multimodal adaptation framework. Our code is publicly available at https://github.com/djzgroup/PointGMDA.	en_US
dc.description.number	7
dc.description.sectionheaders	Creating and Processing Point Clouds
dc.description.seriesinformation	Computer Graphics Forum
dc.description.volume	44
dc.identifier.doi	10.1111/cgf.70268
dc.identifier.issn	1467-8659
dc.identifier.pages	10 pages
dc.identifier.uri	https://doi.org/10.1111/cgf.70268
dc.identifier.uri	https://diglib.eg.org/handle/10.1111/cgf70268
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	CCS Concepts: Computing methodologies → Computer graphics; 3D imaging; Computer vision; Machine learning
dc.subject	Computing methodologies → Computer graphics
dc.subject	3D imaging
dc.subject	Computer vision
dc.subject	Machine learning
dc.title	Multimodal 3D Few-Shot Classification via Gaussian Mixture Discriminant Analysis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: cgf70268.pdf
Size:: 4.52 MB
Format:: Adobe Portable Document Format

Download

Collections

44-Issue 7