Learning Transformation-Isomorphic Latent Space for Accurate Hand Pose Estimation

Ren, Kaiwen; Hu, Lei; Zhang, Zhiheng; Ye, Yongjing; Xia, Shihong

Learning Transformation-Isomorphic Latent Space for Accurate Hand Pose Estimation

Files

pg20251270.pdf (2.89 MB)

paper1003_mm1.pdf (105.44 KB)

Date

2025

Authors

Ren, Kaiwen
Hu, Lei
Zhang, Zhiheng
Ye, Yongjing
Xia, Shihong

Publisher

The Eurographics Association

Abstract

Vision-based regression tasks, such as hand pose estimation, have achieved higher accuracy and faster convergence through representation learning. However, existing representation learning methods often encounter the following issues: the high semantic level of features extracted from images is inadequate for regressing low-level information, and the extracted features include task-irrelevant information, reducing their compactness and interfering with regression tasks. To address these challenges, we propose TI-Net, a highly versatile visual Network backbone designed to construct a Transformation Isomorphic latent space. Specifically, we employ linear transformations to model geometric transformations in the latent space and ensure that TI-Net aligns them with those in the image space. This ensures that the latent features capture compact, low-level information beneficial for pose estimation tasks. We evaluated TI-Net on the hand pose estimation task to demonstrate the network's superiority. On the DexYCB dataset, TI-Net achieved a 10% improvement in the PA-MPJPE metric compared to specialized state-of-the-art (SOTA) hand pose estimation methods. Our code is available at https://github.com/Mine268/TI-Net.

CCS Concepts: Computing methodologies → Motion capture; Image representations; Tracking

        @inproceedings{10.2312:pg.20251270
,
booktitle = {Pacific Graphics Conference Papers, Posters, and Demos
},
editor = {Christie, Marc and 
Han, Ping-Hsuan and 
Lin, Shih-Syun and 
Pietroni, Nico and 
Schneider, Teseo and 
Tsai, Hsin-Ruey and 
Wang, Yu-Shuen and 
Zhang, Eugene
},
title = {{Learning Transformation-Isomorphic Latent Space for Accurate Hand Pose Estimation
}},
author = {Ren, Kaiwen and 
Hu, Lei and 
Zhang, Zhiheng and 
Ye, Yongjing and 
Xia, Shihong
},
year = {2025
},
publisher = {The Eurographics Association
},
ISBN = {978-3-03868-295-0
},
DOI = {10.2312/pg.20251270
}
}

URI

https://doi.org/10.2312/pg.20251270
https://diglib.eg.org/handle/10.2312/pg20251270

Collections

PG2025 Conference Papers, Posters, and Demos

Full item page