Deep learning has had remarkable success in robotic perception, but its data-centric nature suffers when it comes to generalizing to ever-changing environments. By contrast, physics-based optimization generalizes better, but it does not perform as well in complicated tasks due to the lack of high-level semantic information and reliance on manual parametric tuning. To take advantage of these two complementary worlds, we present PyPose: a robotics-oriented, PyTorch-based library that combines deep perceptual models with physics-based optimization techniques, so that users can focus on their novel applications. The principle of PyPose is user-friendly, efficient, and interpretable, with a tidy and well-organized architecture. With an imperative style interface, it can be easily integrated into real-world vision and robotic applications. Besides, it supports parallel computing of any order gradients of Lie groups and Lie algebras and $2^{\text{nd}}$-order optimizers, such as trust region methods. Experiments show that PyPose achieves more than $10\times$ speedup in computation compared to the state-of-the-art libraries. To boost future research, we provide concrete examples for several fields of vision and robotic learning, including SLAM, planning, control, and inertial navigation.
翻译:深层次的学习在机器人感知方面取得了显著的成功,但当它向不断变化的环境推广时,它以数据为中心的性质却会受到影响。相反,物理学优化的优化则比较简单,但由于缺少高级语义信息和依赖手动参数调整,它并没有很好地完成复杂的任务。为了利用这两个互补的世界,我们介绍了PyPose:一个以机器人为主的、以PyTorch为主的图书馆,它把深层次的感知模型与基于物理的优化技术结合起来,使用户能够专注于其新的应用。PyPose的原则是方便用户的、高效的和可解释的,有条理的和完善的建筑。如果存在需要的风格界面,它可以很容易地融入到现实世界的视觉和机器人应用中。此外,它支持平行计算任何测序的Lie Group和Lie algebras 以及 2 ⁇ t{nd{nd{d{d_dánd_formall 优化,例如信任的区域方法。实验显示,PyPose在计算中比州级、SLAMA-listrual-lial-lieval listrual 和SAL-dal-listrual fistrucalsalsalsalsalsalsalsalprodustrucalsalsalsalsalsilproductions ex等的计算速度超过10美元。