KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking

Authors

  • Liu Liu Hefei University of Technology
  • Anran Huang Hefei University of Technology
  • Qi Wu Shanghai Jiao Tong University
  • Dan Guo Hefei University of Technology
  • Xun Yang University of Science and Technology of China
  • Meng Wang Hefei University of Technology

DOI:

https://doi.org/10.1609/aaai.v38i4.28158

Keywords:

CV: Vision for Robotics & Autonomous Driving, CV: 3D Computer Vision, CV: Representation Learning for Vision

Abstract

Our life is populated with articulated objects. Current category-level articulation estimation works largely focus on predicting part-level 6D poses on static point cloud observations. In this paper, we tackle the problem of category-level online robust and real-time 6D pose tracking of articulated objects, where we propose KPA-Tracker, a novel 3D KeyPoint based Articulated object pose Tracker. Given an RGB-D image or a partial point cloud at the current frame as well as the estimated per-part 6D poses from the last frame, our KPA-Tracker can effectively update the poses with learned 3D keypoints between the adjacent frames. Specifically, we first canonicalize the input point cloud and formulate the pose tracking as an inter-frame pose increment estimation task. To learn consistent and separate 3D keypoints for every rigid part, we build KPA-Gen that outputs the high-quality ordered 3D keypoints in an unsupervised manner. During pose tracking on the whole video, we further propose a keypoint-based articulation tracking algorithm that mines keyframes as reference for accurate pose updating. We provide extensive experiments on validating our KPA-Tracker on various datasets ranging from synthetic point cloud observation to real-world scenarios, which demonstrates the superior performance and robustness of the KPA-Tracker. We believe that our work has the potential to be applied in many fields including robotics, embodied intelligence and augmented reality. All the datasets and codes are available at https://github.com/hhhhhar/KPA-Tracker.

Published

2024-03-24

How to Cite

Liu, L., Huang, A., Wu, Q., Guo, D., Yang, X., & Wang, M. (2024). KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking. Proceedings of the AAAI Conference on Artificial Intelligence, 38(4), 3684-3692. https://doi.org/10.1609/aaai.v38i4.28158

Issue

Section

AAAI Technical Track on Computer Vision III