In this paper, we present a method for unconstrained end-to-end head pose estimation. We address the problem of ambiguous rotation labels by introducing the rotation matrix formalism for our ground truth data and propose a continuous 6D rotation matrix representation for efficient and robust direct regression. This way, our method can learn the full rotation appearance which is contrary to previous approaches that restrict the pose prediction to a narrow-angle for satisfactory results. In addition, we propose a geodesic distance-based loss to penalize our network with respect to the SO(3) manifold geometry. Experiments on the public AFLW2000 and BIWI datasets demonstrate that our proposed method significantly outperforms other state-of-the-art methods by up to 20\%. We open-source our training and testing code along with our pre-trained models: https://github.com/thohemp/6DRepNet.
翻译:在本文中,我们提出了一个不受限制的终端到终端头部的估计方法。我们通过对地面真相数据采用轮换矩阵形式主义来解决模糊的轮换标签问题,并提议一个连续的6D轮换矩阵表,用于高效和稳健的直接回归。这样,我们的方法可以了解完全轮换的外观,这与以前将构成预测限制在窄角以取得令人满意的结果的做法相悖。此外,我们提议以大地测量的距离为基础的损失来惩罚我们的网络在SO(3)多重几何学方面。对公众的ALFW2000和BIWI数据集的实验表明,我们所提议的方法大大超过其他最先进的方法,最多20个。我们把我们的培训和测试代码与我们经过培训的模型:https://github.com/themp/6DreepNet一起开源。