ABSTRACT
We present an integrated, real-time approach for 2D hand pose detection from a monocular RGB image, with a common backbone shared between the bounding box detector and the keypoint detector subnets. This is in contrast to traditional methods which use two separate models for hand localization and keypoint detection with no sharing of features. We build on the popular RetinaNet architecture for object detection and introduce an integrated model which performs both hand localization and keypoint detection in real-time. We evaluate our approach on two different datasets and show evidence that our model obtains accurate results.
- [1] 2020. https://github.com/nihal-rao/Integrated-Real-time-2D-hand-pose.Google Scholar
- A. Boukhayma, R. de Bem, and P. H. S. Torr. 2019. 3D Hand Shape and Pose From Images in the Wild. In CVPR’19. 10835–10844.Google Scholar
- L. Ge, Z. Ren, Y. Li, Z. Xue, Y. Wang, J. Cai, and J. Yuan. 2019. 3D Hand Shape and Pose Estimation from a Single RGB Image. In IEEE CVPR’19.Google Scholar
- F. Gomez-Donoso, S. Orts-Escolano, and M. Cazorla. 2019. Large-scale multiview 3D hand pose dataset. Image and Vision Computing 81 (2019), 25 – 33.Google ScholarDigital Library
- U. Iqbal, P. Molchanov, T. Breuel, J. Gall, and J. Kautz. 2018. Hand Pose Estimation via Latent 2.5D Heatmap Regression. In Computer Vision – ECCV 2018. 125–143.Google Scholar
- Y. Li, X. Wang, W. Liu, and B. Feng. 2020. Pose Anchor: A Single-Stage Hand Keypoint Detection Network. IEEE TCSVT’20 30, 7 (2020), 2104–2113.Google Scholar
- T. Lin, P. Goyal, R. Girshick, K. He, and P. Dollár. 2017. Focal Loss for Dense Object Detection. In IEEE ICCV’17. 2999–3007.Google Scholar
- T. Simon, H. Joo, I. Matthews, and Y. Sheikh. 2017. Hand Keypoint Detection in Single Images Using Multiview Bootstrapping. In IEEE CVPR’17. 4645–4653.Google Scholar
- F. Zhang, V. Bazarevsky, A. Vakunov, A. Tkachenka, G. Sung, Chuo-Ling Chang, and M. Grundmann. 2020. MediaPipe Hands: On-device Real-time Hand Tracking. arxiv:2006.10214 [cs.CV]Google Scholar
Recommendations
Robust hand detection for augmented reality interface
VRCAI '09: Proceedings of the 8th International Conference on Virtual Reality Continuum and its Applications in IndustryFor interactive augmented reality, vision-based and hand-gesture-based interface are most desirable due to being natural and human-friendly. However, detecting hands and recognizing hand gestures in cluttered background are still challenging. Especially,...
Automatic Hand Detection in Color Images based on skin region verification
Among the modern means of communications that appeared recently, there is the natural computer interaction using hands. Several methods have been proposed for their detection in the literature, and the common methods are based on skin color. The ...
2D Hand Detection Using Multi-Feature Skin Model Supervised Cascaded CNN
AbstractHand gesture recognition is one of the most popular Human Computer Interface. The first step in most vision-based gesture recognition system is the hand detection and segmentation. Since hands are involved in a variety of daily tasks, the ...
Comments