In the quest for autonomous agents learning open-ended repertoires of skills, most works take a Piagetian perspective: learning trajectories are the results of interactions between developmental agents and their physical environment. The Vygotskian perspective, on the other hand, emphasizes the centrality of the socio-cultural environment: higher cognitive functions emerge from transmissions of socio-cultural processes internalized by the agent. This paper argues that both perspectives could be coupled within the learning of autotelic agents to foster their skill acquisition. To this end, we make two contributions: 1) a novel social interaction protocol called Help Me Explore (HME), where autotelic agents can benefit from both individual and socially guided exploration. In social episodes, a social partner suggests goals at the frontier of the learning agent knowledge. In autotelic episodes, agents can either learn to master their own discovered goals or autonomously rehearse failed social goals; 2) GANGSTR, a graph-based autotelic agent for manipulation domains capable of decomposing goals into sequences of intermediate sub-goals. We show that when learning within HME, GANGSTR overcomes its individual learning limits by mastering the most complex configurations (e.g. stacks of 5 blocks) with only few social interventions.
翻译:在寻求自主代理者学习开放的技能库时,大多数工作都采用Piagetian视角:学习轨迹是发展代理者及其物理环境相互作用的结果。Vygotskian视角另一方面强调社会文化环境的中心作用:较高的认知功能产生于该代理者内在的社会文化过程的传播。本文认为,两种观点都可以在学习自动化代理者以培养其技能获取过程中结合。为此,我们做出两项贡献:1) 名为“帮助我探索”(HME)的新颖的社会互动协议,其中自动化代理者可以从个人和社会指导的探索中获益。在社会舞台上,一个社会伙伴在学习代理者知识的前沿提出目标。在自发性案例中,代理者要么学会掌握自己发现的目标,要么自主地排练失败的社会目标;2) GANGSTRA,一个基于图表的自动化代理者,用于操纵能够将目标分解成中间次级目标序列的域。我们显示,在HME内部学习时,GANGSTRATER只通过掌握最复杂的堆式结构来克服其个人学习极限。