We present a retrospective on the state of Embodied AI research. Our analysis focuses on 13 challenges presented at the Embodied AI Workshop at CVPR. These challenges are grouped into three themes: (1) visual navigation, (2) rearrangement, and (3) embodied vision-and-language. We discuss the dominant datasets within each theme, evaluation metrics for the challenges, and the performance of state-of-the-art models. We highlight commonalities between top approaches to the challenges and identify potential future directions for Embodied AI research.
翻译:我们回顾了人工智能成形研究的现状,我们的分析侧重于在CVPR的人工智能成形研究研讨会上提出的13项挑战。这些挑战分为三个主题:(1)视觉导航,(2)重新安排,(3)包含愿景和语言。我们讨论了每个主题的主要数据集、挑战的评估指标以及最新模型的绩效。我们强调了应对挑战的最先进方法之间的共同点,并确定了人工智能成形研究的潜在未来方向。