The level of autonomy is increasing in systems spanning multiple domains, but these systems still experience failures. One way to mitigate the risk of failures is to integrate human oversight of the autonomous systems and rely on the human to take control when the autonomy fails. In this work, we formulate a method of collaborative decision making through action suggestions that improves action selection without taking control of the system. Our approach uses each suggestion efficiently by incorporating the implicit information shared through suggestions to modify the agent's belief and achieves better performance with fewer suggestions than naively following the suggested actions. We assume collaborative agents share the same objective and communicate through valid actions. By assuming the suggested action is dependent only on the state, we can incorporate the suggested action as an independent observation of the environment. The assumption of a collaborative environment enables us to use the agent's policy to estimate the distribution over action suggestions. We propose two methods that use suggested actions and demonstrate the approach through simulated experiments. The proposed methodology results in increased performance while also being robust to suboptimal suggestions.
翻译:在涉及多个领域的系统中,自治程度正在不断提高,但这些系统仍然经历失败。减少失败风险的一种方法是整合自主系统的人的监督,在自主失败时依靠人来控制。在这项工作中,我们通过行动建议来制定合作决策方法,改进行动选择,而不受系统控制。我们的方法通过采纳通过修改代理人信仰的建议而分享的隐含信息来高效地使用每一种建议,在建议的行动之后,以比天真的建议更好地实现业绩。我们假定合作者共享同一目标,并通过有效行动进行沟通。假设建议的行动仅取决于国家,我们可以将建议的行动纳入,作为对环境的独立观察。假设合作环境使我们能够利用代理人的政策来估计行动建议的分布情况。我们建议采用两种方法,使用建议的行动并通过模拟试验来展示方法。拟议方法的结果是提高业绩,同时对不完善的建议也十分有力。