Abstract
We study multi-dataset training (MDT) for pose estimation, where skeletal heterogeneity presents a unique challenge that existing methods have yet to address. In traditional domains, e.g. regression and classification, MDT typically relies on dataset merging or multi-head supervision. However, the diversity of skeleton types and limited cross-dataset supervision complicate integration in pose estimation. To address these challenges, we introduce PoseBH, a new MDT framework that tackles keypoint heterogeneity and limited supervision through two key techniques. First, we propose nonparametric keypoint prototypes that learn within a unified embedding space, enabling seamless integration across skeleton types. Second, we develop a cross-type self-supervision mechanism that aligns keypoint predictions with keypoint embedding prototypes, providing supervision without relying on teacher-student models or additional augmentations. PoseBH substantially improves generalization across whole-body and animal pose datasets, including COCO-WholeBody, AP-10K, and APT-36K, while preserving performance on standard human pose benchmarks (COCO, MPII, and AIC). Furthermore, our learned key-point embeddings transfer effectively to hand shape estimation (InterHand2.6M) and human body shape estimation (3DPW). The code for PoseBH is available at: https://github.com/uyoung-jeong/PoseBH.
| Original language | English |
|---|---|
| Title of host publication | 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
| Publisher | IEEE |
| Pages | 12278-12288 |
| Number of pages | 11 |
| ISBN (Electronic) | 9798331543655 |
| ISBN (Print) | 9798331543655 (PoD) |
| DOIs | |
| Publication status | Published - 13 Aug 2025 |
| Event | 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition - Music City Center, Nashville, United States Duration: 11 Jun 2025 → 15 Jun 2025 https://cvpr.thecvf.com/virtual/2025/index.html |
Publication series
| Name | IEEE Conference on Computer Vision and Pattern Recognition |
|---|---|
| Publisher | IEEE |
| ISSN (Print) | 1063-6919 |
| ISSN (Electronic) | 2575-7075 |
Conference
| Conference | 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition |
|---|---|
| Abbreviated title | CVPR 2025 |
| Country/Territory | United States |
| City | Nashville |
| Period | 11/06/25 → 15/06/25 |
| Internet address |
Keywords
- Training
- Hands
- Shape
- Animals
- Pose estimation
- Prototypes
- Predictive models
- Skeleton
- Pattern recognition
- Standards