Reasoning with declarative knowledge (RDK) and sequential decision-making (SDM) are two key research areas in artificial intelligence. RDK methods reason with declarative domain knowledge, including commonsense knowledge, that is either provided a priori or acquired over time, while SDM methods (probabilistic planning and reinforcement learning) seek to compute action policies that maximize the expected cumulative utility over a time horizon; both classes of methods reason in the presence of uncertainty. Despite the rich literature in these two areas, researchers have not fully explored their complementary strengths. In this paper, we survey algorithms that leverage RDK methods while making sequential decisions under uncertainty. We discuss significant developments, open problems, and directions for future work.
翻译:以宣示性知识(RDK)和顺序决策(SDM)为根据,是人工智能的两个关键研究领域。RDK方法与宣示性领域知识(包括常识知识)为根据,这些知识要么先验提供,要么随时间推移获得,而SDM方法(概率规划和强化学习)力求计算出行动政策,在时间范围内最大限度地实现预期的累积效用;两种方法都存在不确定性。尽管这两个领域有丰富的文献,但研究人员并未充分探索其互补优势。在本文件中,我们调查了利用RDK方法并在不确定的情况下作出先后决策的算法。我们讨论了重大动态、公开问题和今后工作的方向。