Abstract
In this paper we present the Latent Regression Forest (LRF), a novel framework for real-time, 3D hand pose estimation from a single depth image. In contrast to prior forest-based methods, which take dense pixels as input, classify them independently and then estimate joint positions afterwards, our method can be considered as a structured coarse-to-fine search, starting from the centre of mass of a point cloud until locating all the skeletal joints. The searching process is guided by a learnt Latent Tree Model which reflects the hierarchical topology of the hand. Our main contributions can be summarised as follows: (i) Learning the topology of the hand in an unsupervised, data-driven manner. (ii) A new forest-based, discriminative framework for structured search in images, as well as an error regression step to avoid error accumulation. (iii) A new multi-view hand pose dataset containing 180K annotated images from 10 different subjects. Our experiments show that the LRF out-performs state-of-the-art methods in both accuracy and efficiency.
Original language | English |
---|---|
Title of host publication | Proceedigs of 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), |
Publisher | IEEE Computer Society |
Pages | 3786-3793 |
ISBN (Electronic) | 978-1-4799-5118-5 |
DOIs | |
Publication status | Published - 23 Jun 2014 |
Event | 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) - Columbus, OH, USA Duration: 23 Jun 2014 → 28 Jun 2014 |
Conference
Conference | 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) |
---|---|
Period | 23/06/14 → 28/06/14 |