Industrial robots can solve very complex tasks in controlled environments, but modern applications require robots able to operate in unpredictable surroundings as well. An increasingly popular reactive policy architecture in robotics is Behavior Trees but as with other architectures, programming time still drives cost and limits flexibility. There are two main branches of algorithms to generate policies automatically, automated planning and machine learning, both with their own drawbacks. We propose a method for generating Behavior Trees using a Genetic Programming algorithm and combining the two branches by taking the result of an automated planner and inserting it into the population. Experimental results confirm that the proposed method of combining planning and learning performs well on a variety of robotic assembly problems and outperforms both of the base methods used separately. We also show that this type of high level learning of Behavior Trees can be transferred to a real system without further training.
翻译:工业机器人可以在受控制的环境中解决非常复杂的任务,但现代应用要求机器人能够在不可预测的环境中运作。机器人中日益流行的被动政策架构是行为树,但与其他建筑一样,程序制作时间仍然会推动成本和限制灵活性。有两个主要的算法分支可以自动生成政策,自动规划和机器学习,两者都有自己的缺点。我们建议一种方法,利用基因编程算法生成行为树,并将两个分支结合在一起,方法是采用自动化规划师的结果并将其插入人口。实验结果证实,将规划和学习相结合的拟议方法在各种机器人组装问题上表现良好,并且优于两种分别使用的基础方法。我们还表明,这种高水平的“行为树”学习无需进一步培训就可以转移到一个真正的系统。