As autonomous agents enter complex environments, it becomes more difficult to adequately model the interactions between the two. Agents must therefore cope with greater ambiguity (e.g., unknown environments, underdefined models, and vague problem definitions). Despite the consequences of ignoring ambiguity, tools for decision making under ambiguity are understudied. The general approach has been to avoid ambiguity (exploit known information) using robust methods. This work contributes ambiguity attitude graph search (AAGS), generalizing robust methods with ambiguity attitudes--the ability to trade-off between seeking and avoiding ambiguity in the problem. AAGS solves online decision making problems with limited budget to learn about their environment. To evaluate this approach AAGS is tasked with path planning in static and dynamic environments. Results demonstrate that appropriate ambiguity attitudes are dependent on the quality of information from the environment. In relatively certain environments, AAGS can readily exploit information with robust policies. Conversely, model complexity reduces the information conveyed by individual samples; this allows the risks taken by optimistic policies to achieve better performance.
翻译:随着自主代理商进入复杂的环境,就更难充分模拟两者之间的相互作用。因此,代理商必须应对更大的模糊性(例如,未知的环境、定义不足的模式和模糊的问题定义)。尽管忽视模糊性的后果,但在模糊性的决策工具方面研究不足。一般的做法是使用稳健的方法避免模糊性(开发已知信息),这项工作有助于模糊性态度图搜索(AGS),推广稳健方法,在寻求和避免问题之间取舍的模棱两可性。AGS解决了在线决策问题,预算有限,了解环境。评估AAGS的任务是在静态和动态环境中进行路径规划。结果表明,适当的模糊性态度取决于环境信息的质量。在相对特定的环境中,AGS可以随时以稳健的政策利用信息。相反,模型的复杂性会减少单个样本提供的信息;这就使得乐观政策所冒的风险,从而取得更好的业绩。</s>