Forestry machines are heavy vehicles performing complex manipulation tasks in unstructured production forest environments. Together with the complex dynamics of the on-board hydraulically actuated cranes, the rough forest terrains have posed a particular challenge in forestry automation. In this study, the feasibility of applying reinforcement learning control to forestry crane manipulators is investigated in a simulated environment. Our results show that it is possible to learn successful actuator-space control policies for energy efficient log grasping by invoking a simple curriculum in a deep reinforcement learning setup. Given the pose of the selected logs, our best control policy reaches a grasping success rate of 97%. Including an energy-optimization goal in the reward function, the energy consumption is significantly reduced compared to control policies learned without incentive for energy optimization, while the increase in cycle time is marginal. The energy-optimization effects can be observed in the overall smoother motion and acceleration profiles during crane manipulation.
翻译:林业机械是从事结构化生产森林环境中复杂操作任务的重型车辆。连同机载液压起重机的复杂动态,粗林地形对林业自动化构成特别的挑战。在这项研究中,对对林业起重机操纵机应用强化学习控制的可行性进行了模拟环境调查。我们的结果表明,有可能在深层强化学习设置中采用简单的课程,学习成功的激励器-空间控制政策来捕捉节能原木。鉴于所选的原木,我们的最佳控制政策达到了97%的成功率。在奖励功能中包括能源优化目标,能源消耗大大低于在没有能源优化奖励的情况下所学的控制政策,而周期时间的增加是边际的。在起重机操纵期间,在总体平稳运动和加速情况中可以观察到能源优化效应。