BoIR: Box-Supervised Instance Representation for Multi Person Pose Estimation

Uyoung Jeong, Seungryul Baek, Hyung Jin Chang, Kwang In Kim*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Single-stage multi-person human pose estimation (MPPE) methods have shown great performance improvements, but existing methods fail to disentangle features by individual instances under crowded scenes. In this paper, we propose a bounding box-level instance representation learning called BoIR, which simultaneously solves instance detection, instance disentanglement and instance-keypoint association problems. Our new instance embedding loss provides learning signal on the entire area of the image with bounding box annotations, achieving globally consistent and disentangled instance representation. Our method exploits multi-task learning of bottom-up keypoint estimation, bounding box regression and contrastive instance embedding learning, without additional computational cost during inference. We demonstrate that BoIR outperforms state-of-the-arts on COCO (0.5 AP), CrowdPose (4.9 AP) and OCHuman (3.5 AP).
Original languageEnglish
Title of host publicationThe 34th British Machine Vision Conference Proceedings
PublisherBritish Machine Vision Association
Publication statusAccepted/In press - 25 Aug 2023
EventThe 34th British Machine Vision Conference - Aberdeen, United Kingdom
Duration: 20 Nov 202324 Nov 2023

Conference

ConferenceThe 34th British Machine Vision Conference
Country/TerritoryUnited Kingdom
CityAberdeen
Period20/11/2324/11/23

Bibliographical note

Not yet published as of 29/02/2024.

Fingerprint

Dive into the research topics of 'BoIR: Box-Supervised Instance Representation for Multi Person Pose Estimation'. Together they form a unique fingerprint.

Cite this