In the field, robots often need to operate in unknown and unstructured environments, where accurate sensing and state estimation (SE) becomes a major challenge. Cameras have been used to great success in mapping and planning in such environments, as well as complex but quasi-static tasks such as grasping, but are rarely integrated into the control loop for unstable systems. Learning pixel-to-torque control promises to allow robots to flexibly handle a wider variety of tasks. Although they do not present additional theoretical obstacles, learning pixel-to-torque control for unstable systems that that require precise and high bandwidth control still poses a significant practical challenge, and best practices have not yet been established. To help drive reproducible research on the practical aspects of learning pixel-to-torque control, we propose a platform that can flexibly represent the entire process, from lab to deployment, for learning pixel-to-torque control on a robot with fast, unstable dynamics: the vision-based Furuta pendulum. The platform can be reproduced with either off-the-shelf or custom-built hardware. We expect that this platform will allow researchers to quickly and systematically test different approaches, as well as reproduce and benchmark case studies from other labs. We also present a first case study on this system using DNNs which, to the best of our knowledge, is the first demonstration of learning pixel-to-torque control on an unstable system with update rates faster than 100 Hz. A video synopsis can be found online at https://youtu.be/S2llScfG-8E, and in the supplementary material.
翻译:在外地,机器人往往需要在未知和无结构的环境中操作,而准确的感测和状态估计(SE)则成为一项重大挑战。相机已被使用,在这样的环境中,在绘图和规划方面非常成功,以及在掌握等复杂但准静态的任务方面非常成功,但很少被纳入不稳定系统的控制环圈。学习像素到托克控制会允许机器人灵活地处理更广泛的任务。虽然它们并不产生额外的理论障碍,但学习需要精确和高带宽控制的不稳定系统比像素到图克控制(SE)仍是一个重大的实际挑战,而且尚未确立最佳做法。为了帮助对学习像素到托克控制的实际方面进行可复制的研究,我们提议了一个平台,可以灵活地代表整个过程,从实验室到部署,学习具有快速、不稳定动态的机器人的像素到托克控制:基于愿景的Furuta/turculum。这个平台可以复制到离场的或自定制的系统硬件,现在还没有建立最佳的视频或自定制。我们期望这个平台将让研究人员能够从目前的数据库中快速地测试Snal-ral-ral 复制到现在的系统。我们目前的最佳数据库的模型,我们也可以在数据库中进行一个数据库中进行不同的研究。