Many approaches to Natural Language Processing (NLP) tasks often treat them as single-step problems, where an agent receives an instruction, executes it, and is evaluated based on the final outcome. However, human language is inherently interactive, as evidenced by the back-and-forth nature of human conversations. In light of this, we posit that human-AI collaboration should also be interactive, with humans monitoring the work of AI agents and providing feedback that the agent can understand and utilize. Further, the AI agent should be able to detect when it needs additional information and proactively ask for help. Enabling this scenario would lead to more natural, efficient, and engaging human-AI collaborations. In this work, we explore these directions using the challenging task defined by the IGLU competition, an interactive grounded language understanding task in a MineCraft-like world. We explore multiple types of help players can give to the AI to guide it and analyze the impact of this help in AI behavior, resulting in performance improvements.
翻译:许多自然语言处理(NLP)任务的方法通常将它们视为单步问题,其中代理接收指令,执行它,并根据最终结果进行评估。但是,人类语言本质上是交互性的,这可以从人类对话的来回交流的性质中得到证明。考虑到这一点,我们认为人工智能协作应该也是交互式的,人类监视人工智能代理的工作,并提供代理可以理解和利用的反馈。此外,AI代理应该能够检测到其需要更多信息的情况,并主动请求帮助。实现这种情况将导致更自然,更高效,更引人入胜的人与AI协作。在这项工作中,我们在MineCraft样式的世界中使用由IGLU竞赛定义的具有交互性的基于实体的语言理解任务。我们探讨了玩家可以给予AI的多种类型的帮助来指导它,并分析了这种帮助对AI行为的影响,从而导致性能改进。