There has been an emerging paradigm shift from the era of "internet AI" to "embodied AI", where AI algorithms and agents no longer learn from datasets of images, videos or text curated primarily from the internet. Instead, they learn through interactions with their environments from an egocentric perception similar to humans. Consequently, there has been substantial growth in the demand for embodied AI simulators to support various embodied AI research tasks. This growing interest in embodied AI is beneficial to the greater pursuit of Artificial General Intelligence (AGI), but there has not been a contemporary and comprehensive survey of this field. This paper aims to provide an encyclopedic survey for the field of embodied AI, from its simulators to its research. By evaluating nine current embodied AI simulators with our proposed seven features, this paper aims to understand the simulators in their provision for use in embodied AI research and their limitations. Lastly, this paper surveys the three main research tasks in embodied AI -- visual exploration, visual navigation and embodied question answering (QA), covering the state-of-the-art approaches, evaluation metrics and datasets. Finally, with the new insights revealed through surveying the field, the paper will provide suggestions for simulator-for-task selections and recommendations for the future directions of the field.
翻译:从“互联网AI”时代到“互联网AI”时代的范式正在出现转变,大赦国际的算法和代理人不再从主要由互联网制作的图像、视频或文本的数据集中学习。相反,他们通过与环境的互动学习,从与人类相似的自我中心观念中学习。因此,对包含AI的模拟器的需求大幅增长,以支持各种体现的AI研究任务。对体现的AI的日益关注有利于进一步追求人工通用情报(AGI),但还没有对这一领域进行当代的全面调查。本文旨在为包含的AI领域提供一个百科全书调查,从其模拟器到其研究。通过评估9个当前体现的AI模拟器及其拟议的7个特征,本文件旨在理解其提供用于包含的AI研究及其局限性的模拟器。最后,本文件对包含的AI的三种主要研究任务 -- -- 视觉探索、视觉导航和包含的问题回答(QA) -- -- 涵盖最新艺术方法、评价指标和数据解析器领域领域,通过实地调查提供新的实地选择方向的建议。最后,为实地选择者提供新的实地选择方向的建议。