In recent years, the role of artificially intelligent (AI) agents has evolved from being basic tools to socially intelligent agents working alongside humans towards common goals. In such scenarios, the ability to predict future behavior by observing past actions of their human teammates is highly desirable in an AI agent. Goal-oriented human behavior is complex, hierarchical, and unfolds across multiple timescales. Despite this observation, relatively little attention has been paid towards using multi-timescale features to model such behavior. In this paper, we propose an LSTM network architecture that processes behavioral information at multiple timescales to predict future behavior. We demonstrate that our approach for modeling behavior in multiple timescales substantially improves prediction of future behavior compared to methods that do not model behavior at multiple timescales. We evaluate our architecture on data collected in an urban search and rescue scenario simulated in a virtual Minecraft-based testbed, and compare its performance to that of a number of valid baselines as well as other methods that do not process inputs at multiple timescales.
翻译:近年来,人工智能(AI)剂的作用已经从基本工具演变为社会智能剂,与人类一道为共同目标而工作。在这样的情景中,通过观察人类团队伙伴过去的行动来预测未来行为的能力在AI代理中是非常可取的。面向目标的人类行为是复杂的,等级分级的,跨越多个时间尺度。尽管如此,对使用多时间尺度特征来模拟这种行为的注意相对较少。在本文中,我们提议建立一个LSTM网络架构,在多个时间尺度上处理行为信息,以预测未来行为。我们证明,在多个时间尺度上,我们模拟行为的方法大大改进了未来行为的预测,而不是在多个时间尺度上模拟行为的方法。我们评估了在虚拟基于地雷的试验台模拟的城市搜索和救援情景中收集的数据结构,并将其绩效与若干有效基线的绩效以及其他不处理多个时间尺度投入的方法进行比较。