When deployed, AI agents will encounter problems that are beyond their autonomous problem-solving capabilities. Leveraging human assistance can help agents overcome their inherent limitations and robustly cope with unfamiliar situations. We present a general interactive framework that enables an agent to determine and request contextually useful information from an assistant, and to incorporate rich forms of responses into its decision-making process. We demonstrate the practicality of our framework on a simulated human-assisted navigation problem. Aided with an assistance-requesting policy learned by our method, a navigation agent achieves up to a 7x improvement in success rate on tasks that take place in previously unseen environments, compared to fully autonomous behavior. We show that the agent can take advantage of different types of information depending on the context, and analyze the benefits and challenges of learning the assistance-requesting policy when the assistant can recursively decompose tasks into subtasks.
翻译:在部署时,AI代理商将遇到超出其自主解决问题能力范围的问题。利用人力援助可以帮助代理商克服其固有的局限性,并强有力地应对不熟悉的情况。我们提出了一个一般性的互动框架,使代理商能够确定和请求助理提供符合具体情况的信息,并将丰富的应对措施纳入决策过程。我们展示了我们模拟人手导航问题框架的实用性。借助我们方法所学的援助请求政策,导航代理商在以往的不为人知的环境中完成的任务的成功率比完全自主的行为提高了7x倍。我们表明,该代理商可以根据具体情况利用不同类型的信息,分析在助理能够将任务重复到子任务时学习援助请求政策的好处和挑战。