PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation

  • Uyoung Jeong
  • , Jonathan Freer
  • , Seungryul Baek
  • , Hyung Jin Chang
  • , Kwang In Kim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We study multi-dataset training (MDT) for pose estimation, where skeletal heterogeneity presents a unique challenge that existing methods have yet to address. In traditional domains, e.g. regression and classification, MDT typically relies on dataset merging or multi-head supervision. However, the diversity of skeleton types and limited cross-dataset supervision complicate integration in pose estimation. To address these challenges, we introduce PoseBH, a new MDT framework that tackles keypoint heterogeneity and limited supervision through two key techniques. First, we propose nonparametric keypoint prototypes that learn within a unified embedding space, enabling seamless integration across skeleton types. Second, we develop a cross-type self-supervision mechanism that aligns keypoint predictions with keypoint embedding prototypes, providing supervision without relying on teacher-student models or additional augmentations. PoseBH substantially improves generalization across whole-body and animal pose datasets, including COCO-WholeBody, AP-10K, and APT-36K, while preserving performance on standard human pose benchmarks (COCO, MPII, and AIC). Furthermore, our learned key-point embeddings transfer effectively to hand shape estimation (InterHand2.6M) and human body shape estimation (3DPW). The code for PoseBH is available at: https://github.com/uyoung-jeong/PoseBH.
Original languageEnglish
Title of host publication2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
PublisherIEEE
Pages12278-12288
Number of pages11
ISBN (Electronic)9798331543655
ISBN (Print)9798331543655 (PoD)
DOIs
Publication statusPublished - 13 Aug 2025
Event2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition - Music City Center, Nashville, United States
Duration: 11 Jun 202515 Jun 2025
https://cvpr.thecvf.com/virtual/2025/index.html

Publication series

NameIEEE Conference on Computer Vision and Pattern Recognition
PublisherIEEE
ISSN (Print)1063-6919
ISSN (Electronic)2575-7075

Conference

Conference2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Abbreviated titleCVPR 2025
Country/TerritoryUnited States
CityNashville
Period11/06/2515/06/25
Internet address

Keywords

  • Training
  • Hands
  • Shape
  • Animals
  • Pose estimation
  • Prototypes
  • Predictive models
  • Skeleton
  • Pattern recognition
  • Standards

Fingerprint

Dive into the research topics of 'PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation'. Together they form a unique fingerprint.

Cite this