Abstract
Reconstructing a 3D hand from a single-view RGB image is challenging due to various hand configurations and depth ambiguity. To reliably reconstruct a 3D hand from a monocular image, most state-of-the-art methods heavily rely on 3D annotations at the training stage, but obtaining 3D annotations is expensive. To alleviate reliance on labeled training data, we propose S2HAND, a self-supervised 3D hand reconstruction network that can jointly estimate pose, shape, texture, and the camera viewpoint. Specifically, we obtain geometric cues from the input image through easily accessible 2D detected keypoints. To learn an accurate hand reconstruction model from these noisy geometric cues, we utilize the consistency between 2D and 3D representations and propose a set of novel losses to rationalize outputs of the neural network. For the first time, we demonstrate the feasibility of training an accurate 3D hand reconstruction network without relying on manual annotations. Our experiments show that the proposed self-supervised method achieves comparable performance with recent fully-supervised methods. The code is available at https://github.com/TerenceCYJ/S2HAND.
Original language | English |
---|---|
Title of host publication | 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
Publisher | IEEE |
Pages | 10446-10455 |
Number of pages | 10 |
ISBN (Electronic) | 9781665445092 |
ISBN (Print) | 9781665445108 |
DOIs | |
Publication status | Published - 2 Nov 2021 |
Event | 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) - Nashville, TN, USA Duration: 20 Jun 2021 → 25 Jun 2021 |
Publication series
Name | Conference on Computer Vision and Pattern Recognition (CVPR) |
---|---|
Publisher | IEEE |
ISSN (Print) | 1063-6919 |
ISSN (Electronic) | 2575-7075 |
Conference
Conference | 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
---|---|
Period | 20/06/21 → 25/06/21 |
Bibliographical note
Funding Agency:10.13039/501100012166-National Key Research and Development Program of China
10.13039/501100012226-Fundamental Research Funds for the Central Universities
10.13039/100000001-National Science Foundation
Keywords
- Training
- Solid modeling
- Surface reconstruction
- Three-dimensional displays
- Annotations
- Shape
- Training data