Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects

  • Zicong Fan
  • , Takehiko Ohkawa
  • , Linlin Yang
  • , Nie Lin
  • , Zhishan Zhou
  • , Shihao Zhou
  • , Jiajun Liang
  • , Zhong Gao
  • , Xuanyang Zhang
  • , Xue Zhang
  • , Fei Li
  • , Zheng Liu
  • , Feng Lu
  • , Karim Abou Zeid
  • , Bastian Leibe
  • , Jeongwan On
  • , Seungryul Baek
  • , Aditya Prakash
  • , Saurabh Gupta
  • , Kun He
  • Yoichi Sato, Otmar Hilliges, Hyung Jin Chang, Angela Yao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We interact with the world with our hands and see it through our own (egocentric) perspective. A holistic 3Dunderstanding of such interactions from egocentric views is important for tasks in robotics, AR/VR, action recognition and motion generation. Accurately reconstructing such interactions in 3D is challenging due to heavy occlusion, viewpoint bias, camera distortion, and motion blur from the head movement. To this end, we designed the HANDS23 challenge based on the AssemblyHands and ARCTIC datasets with carefully designed training and testing splits. Based on the results of the top submitted methods and more recent baselines on the leaderboards, we perform a thorough analysis on 3D hand(-object) reconstruction tasks. Our analysis demonstrates the effectiveness of addressing distortion specific to egocentric cameras, adopting high-capacity transformers to learn complex hand-object interactions, and fusing predictions from different views. Our study further reveals challenging scenarios intractable with state-of-the-art methods, such as fast hand motion, object reconstruction from narrow egocentric views, and close contact between two hands and objects. Our efforts will enrich the community's knowledge foundation and facilitate future hand studies on egocentric hand-object interactions.
Original languageEnglish
Title of host publicationComputer Vision – ECCV 2024
Subtitle of host publication18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XXV
PublisherSpringer
Pages428–448
ISBN (Electronic)9783031726989
ISBN (Print)9783031726972
DOIs
Publication statusPublished - 26 Oct 2024
Event18th European Conference on Computer Vision, ECCV 2024 - MiCo, Milan, Italy
Duration: 29 Sept 20244 Oct 2024
https://eccv.ecva.net/

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume15083
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference18th European Conference on Computer Vision, ECCV 2024
Abbreviated titleECCV 2024
Country/TerritoryItaly
CityMilan
Period29/09/244/10/24
Internet address

Keywords

  • cs.CV

Fingerprint

Dive into the research topics of 'Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects'. Together they form a unique fingerprint.

Cite this