Event camera shows great potential in 3D hand pose estimation, especially addressing the challenges of fast motion and high dynamic range in a low-power way. However, due to the asynchronous differential imaging mechanism, it is challenging to design event representation to encode hand motion information especially when the hands are not moving (causing motion ambiguity), and it is infeasible to fully annotate the temporally dense event stream. In this paper, we propose EvHandPose with novel hand flow representations in Event-to-Pose module for accurate hand pose estimation and alleviating the motion ambiguity issue. To solve the problem under sparse annotation, we design contrast maximization and edge constraints in Pose-to-IWE (Image with Warped Events) module and formulate EvHandPose in a self-supervision framework. We further build EvRealHands, the first large-scale real-world event-based hand pose dataset on several challenging scenes to bridge the domain gap due to relying on synthetic data and facilitate future research. Experiments on EvRealHands demonstrate that EvHandPose outperforms previous event-based method under all evaluation scenes with 15 $\sim$ 20 mm lower MPJPE and achieves accurate and stable hand pose estimation in fast motion and strong light scenes compared with RGB-based methods. Furthermore, EvHandPose demonstrates 3D hand pose estimation at 120 fps or higher.
翻译:活动相机显示3D手的3D手的估测潜力巨大,特别是应对快速运动和低功率高动态范围高动态范围的挑战。然而,由于不同步差异成像机制,设计用于编码手动信息的活动演示,特别是当手不动时(造成动作模糊不清),很难设计用于编码手动信息的活动演示(造成动作模糊),无法充分注意到时间密集的事件流。在本文中,我们提议EvHandPose在事件到Pose模块中以新的手流演示方式展示EvHandPose,在事件到Pa-Pose模块中以新的手流表示准确手到Pose的缩缩缩缩缩缩缩图,以提供准确的手估测和减轻动作模糊问题。为了解决在稀释说明下的问题,我们设计了对Pose-IWE-WWWE(与Ward EWA)的最大化和边框限制和边框的EvHandPOS模块在自我监督的15-HM 和马的快速评估方法下展示了前一款的硬片,在15-MD-MFPFPA 和低的快速评估方法下,在硬影中展示了前的快速和低的图像中展示了以前的活动。</s>