Socially aware robots should be able, among others, to support fluent human-robot collaboration in tasks that require interdependent actions in order to be solved. Towards enhancing mutual performance, collaborative robots should be equipped with adaptation and learning capabilities. However, co-learning can be a time consuming procedure. For this reason, transferring knowledge from an expert could potentially boost the overall team performance. In the present study, transfer learning was integrated in a deep Reinforcement Learning (dRL) agent. In a real-time and real-world set-up, two groups of participants had to collaborate with a cobot under two different conditions of dRL agents; one that was transferring knowledge and one that did not. A probabilistic policy reuse method was used for the transfer learning (TL). The results showed that there was a significant difference between the performance of the two groups; TL halved the time needed for the training of new participants to the task. Moreover, TL also affected the subjective performance of the teams and enhanced the perceived fluency. Finally, in many cases the objective performance metrics did not correlate with the subjective ones providing interesting insights about the design of transparent and explainable cobot behaviour.
翻译:具有社会意识的机器人应该能够支持流畅的人类机器人合作完成需要相互依赖的行动才能解决的任务。为了提高相互性能,协作机器人应当配备适应和学习能力。然而,共同学习可以是一个耗时的程序。为此,从专家转让知识可以提高团队总体业绩。在本研究中,转移学习被纳入深强化学习(dRL)代理。在实时和现实世界设置中,两组参与者不得不在 dRL 代理的两个不同条件下与cobt合作;一个是转让知识,另一个没有。在转移学习中使用一种概率性政策再利用方法(TL)。结果显示,两个小组的业绩有很大差异;TL将培训新参与者所需的时间减半。此外,TL还影响团队的主观性能,提高了人们所感觉到的流畅度。最后,在很多情况下,客观的绩效衡量标准与提供关于透明、解释设计行为的令人感兴趣的主观行为没有关联。