Modern Artificial Intelligence (AI) systems excel at diverse tasks, from image classification to strategy games, even outperforming humans in many of these domains. After making astounding progress in language learning in the recent decade, AI systems, however, seem to approach the ceiling that does not reflect important aspects of human communicative capacities. Unlike human learners, communicative AI systems often fail to systematically generalize to new data, suffer from sample inefficiency, fail to capture common-sense semantic knowledge, and do not translate to real-world communicative situations. Cognitive Science offers several insights on how AI could move forward from this point. This paper aims to: (1) suggest that the dominant cognitively-inspired AI directions, based on nativist and symbolic paradigms, lack necessary substantiation and concreteness to guide progress in modern AI, and (2) articulate an alternative, "grounded", perspective on AI advancement, inspired by Embodied, Embedded, Extended, and Enactive Cognition (4E) research. I review results on 4E research lines in Cognitive Science to distinguish the main aspects of naturalistic learning conditions that play causal roles for human language development. I then use this analysis to propose a list of concrete, implementable components for building "grounded" linguistic intelligence. These components include embodying machines in a perception-action cycle, equipping agents with active exploration mechanisms so they can build their own curriculum, allowing agents to gradually develop motor abilities to promote piecemeal language development, and endowing the agents with adaptive feedback from their physical and social environment. I hope that these ideas can direct AI research towards building machines that develop human-like language abilities through their experiences with the world.
翻译:现代人工智能(AI)系统在从图像分类到战略游戏等不同任务方面表现优异,甚至在许多这些领域中表现优异的人。在近十年来语言学习取得了令人惊叹的进展之后,AI系统似乎接近了没有反映人类交流能力重要方面的上限。与人类学习者不同的是,交流的AI系统往往不能系统化地向新数据推广,缺乏效率,无法捕捉常识性流传学知识,也无法转化为真实世界的交流状况。认知科学为AI如何从这个角度前进提供了一些深刻的见解。本文的目的是:(1) 表明,基于授能和象征模式的主导性认知性启发性AI方向没有反映人类交流能力的重要方面。 与人类学习系统相比,交流性人工智能系统系统往往无法系统化为新数据提供“基础”视角,无法捕捉普通的、隐蔽的、扩展的和活跃的直流传学知识。 我审查了4E类研究的研究成果。 认知性科学中的4E类研究项目可以逐步建立自己的语言意识, 从而将自我思维的动力和感化的动力用于自然学的动力分析。