Multimodal 3D Few-Shot Classification via Gaussian Mixture Discriminant Analysis
dc.contributor.author | Wu, Yiqi | en_US |
dc.contributor.author | Wu, Huachao | en_US |
dc.contributor.author | Hu, Ronglei | en_US |
dc.contributor.author | Chen, Yilin | en_US |
dc.contributor.author | Zhang, Dejun | en_US |
dc.contributor.editor | Christie, Marc | en_US |
dc.contributor.editor | Pietroni, Nico | en_US |
dc.contributor.editor | Wang, Yu-Shuen | en_US |
dc.date.accessioned | 2025-10-07T05:03:24Z | |
dc.date.available | 2025-10-07T05:03:24Z | |
dc.date.issued | 2025 | |
dc.description.abstract | While pre-trained 3D vision-language models are becoming increasingly available, there remains a lack of frameworks that can effectively harness their capabilities for few-shot classification. In this work, we propose PointGMDA, a training-free framework that combines Gaussian Mixture Models (GMMs) with Gaussian Discriminant Analysis (GDA) to perform robust classification using only a few labeled point cloud samples. Our method estimatesGMMparameters per class from support data and computes mixture-weighted prototypes, which are then used in GDA with a shared covariance matrix to construct decision boundaries. This formulation allows us to model intra-class variability more expressively than traditional single-prototype approaches, while maintaining analytical tractability. To incorporate semantic priors, we integrate CLIP-style textual prompts and fuse predictions from geometric and textual modalities through a hybrid scoring strategy. We further introduce PointGMDA-T, a lightweight attention-guided refinement module that learns residuals for fast feature adaptation, improving robustness under distribution shift. Extensive experiments on ModelNet40 and ScanObjectNN demonstrate that PointGMDA outperforms strong baselines across a variety of few-shot settings, with consistent gains under both training-free and fine-tuned conditions. These results highlight the effectiveness and generality of our probabilistic modeling and multimodal adaptation framework. Our code is publicly available at https://github.com/djzgroup/PointGMDA. | en_US |
dc.description.number | 7 | |
dc.description.sectionheaders | Creating and Processing Point Clouds | |
dc.description.seriesinformation | Computer Graphics Forum | |
dc.description.volume | 44 | |
dc.identifier.doi | 10.1111/cgf.70268 | |
dc.identifier.issn | 1467-8659 | |
dc.identifier.pages | 10 pages | |
dc.identifier.uri | https://doi.org/10.1111/cgf.70268 | |
dc.identifier.uri | https://diglib.eg.org/handle/10.1111/cgf70268 | |
dc.publisher | The Eurographics Association and John Wiley & Sons Ltd. | en_US |
dc.subject | CCS Concepts: Computing methodologies → Computer graphics; 3D imaging; Computer vision; Machine learning | |
dc.subject | Computing methodologies → Computer graphics | |
dc.subject | 3D imaging | |
dc.subject | Computer vision | |
dc.subject | Machine learning | |
dc.title | Multimodal 3D Few-Shot Classification via Gaussian Mixture Discriminant Analysis | en_US |
Files
Original bundle
1 - 1 of 1