学习受 Kiinematic 联合制约的机器人学习轨迹 (Learning Robot Trajectories subject to Kinematic Joint Constraints)

We present an approach to learn fast and dynamic robot motions without exceeding limits on the position $\theta$, velocity $\dot{\theta}$, acceleration $\ddot{\theta}$ and jerk $\dddot{\theta}$ of each robot joint. Movements are generated by mapping the predictions of a neural network to safely executable joint accelerations. The neural network is invoked periodically and trained via reinforcement learning. Our main contribution is an analytical procedure for calculating safe joint accelerations, which considers the prediction frequency $f_N$ of the neural network. As a result, the frequency $f_N$ can be freely chosen and treated as a hyperparameter. We show that our approach is preferable to penalizing constraint violations as it provides explicit guarantees and does not distort the desired optimization target. In addition, the influence of the selected prediction frequency on the learning performance and on the computing effort is highlighted by various experiments.

翻译：我们提出一种方法来学习快速和动态机器人运动,而不会超过对每个机器人联合体的位置的限制,即$(theta),速度$(dot),速度$(theta),加速$(dddt) 美元(trick $(dddddt) 美元(theta) 美元。运动是通过绘制神经网络的预测,以安全地执行联合加速器而产生的。神经网络通过强化学习定期被援引和培训。我们的主要贡献是计算安全联合加速器的分析程序,该程序考虑到神经网络的预测频率$f_N美元。因此,可以自由选择美元频率,并将其作为超光量计处理。我们表明,我们的方法更可取于惩罚违反限制措施的行为,因为它提供了明确的保证,而且不会扭曲理想的优化目标。此外,选定的预测频率对学习表现和计算努力的影响也得到了各种实验的强调。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【干货书】机器学习特征工程，217页pdf

专知会员服务

128+阅读 · 2021年2月6日

Python计算导论，560页pdf，Introduction to Computing Using Python

专知会员服务

76+阅读 · 2020年5月5日