The integration of reasoning, learning, and decision-making is key to build more general AI systems. As a step in this direction, we propose a novel neural-logic architecture that can solve both inductive logic programming (ILP) and deep reinforcement learning (RL) problems. Our architecture defines a restricted but expressive continuous space of first-order logic programs by assigning weights to predicates instead of rules. Therefore, it is fully differentiable and can be efficiently trained with gradient descent. Besides, in the deep RL setting with actor-critic algorithms, we propose a novel efficient critic architecture. Compared to state-of-the-art methods on both ILP and RL problems, our proposition achieves excellent performance, while being able to provide a fully interpretable solution and scaling much better, especially during the testing phase.
翻译:整合推理、学习和决策是建立更一般性的AI系统的关键。 作为朝这个方向迈出的一步,我们提出了一个新的神经-逻辑结构,它既能解决感性逻辑编程(ILP)问题,又能解决深层强化学习(RL)问题。我们的结构通过将权重分配给上游而不是规则,界定了一级逻辑程序的有限但明确的持续空间。因此,它完全可以区分,并且可以受到梯度下降的高效培训。此外,在与行为者-批评算法的深层RL环境中,我们提出了一个新的高效的批评结构。相比于ITP和RL问题的最新方法,我们的建议取得了卓越的成绩,同时能够提供完全可解释的解决办法,并推广得更好,特别是在测试阶段。