基于替代性安全行为的无学习碰撞和托克限制的机器人轨迹 (Learning Collision-free and Torque-limited Robot Trajectories based on Alternative Safe Behaviors)

This paper presents an approach to learn online generation of collision-free and torque-limited trajectories for industrial robots. A neural network, which is trained via reinforcement learning, is periodically invoked to predict future motions. For each robot joint, the network outputs the kinematic state that is desired at the end of the current time interval. Compliance with kinematic joint limits is ensured by the design of the action space. Given the current kinematic state and the network prediction, a trajectory for the current time interval can be computed. The main idea of our paper is to execute the predicted motion only if a collision-free and torque-limited way to continue the trajectory is known. In practice, the predicted motion is expanded by a braking trajectory and simulated using a physics engine. If the simulated trajectory complies with all safety constraints, the predicted motion is carried out. Otherwise, the braking trajectory calculated in the previous decision step serves as an alternative safe behavior. For evaluation, up to three simulated robots are trained to reach as many randomly placed target points as possible. We show that our method reliably prevents collisions with static obstacles and collisions between the robots, while generating motions that respect both torque limits and kinematic joint limits. Experiments with a real robot demonstrate that safe trajectories can be generated in real-time.

翻译：本文展示了一种方法来学习工业机器人在网上生成无碰撞和不受反光限制的轨迹。一个通过强化学习培训的神经网络定期被引用来预测未来运动。对于每个机器人联合, 网络产生当前时间间隔结束时所需的运动状态。动作空间的设计确保了对运动联合限制的遵守。鉴于当前的运动状态和网络预测, 可以计算当前时间间隔的轨迹。我们纸的主要想法是, 只有在已知无碰撞和受托限制的方式可以继续轨迹的情况下, 才能执行预测的运动。实际上, 预测的运动会通过制动轨和模拟物理引擎来扩大。如果模拟轨迹符合当前时间间隔结束时所期望的动性状态, 预测的运动就会被进行。否则, 上一个决定步骤所计算的运动轨迹可以作为一种替代性的安全行为。评估时, 最多可以训练三个模拟的机器人尽可能随机地到达许多目标点。我们显示, 我们的方法可以可靠地防止碰撞, 与静止障碍和模拟机器人之间碰撞, 并且同时以真实的动作限制来显示真实的机体动作限制。。

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html