可区别逻辑机 (Differentiable Logic Machines)

The integration of reasoning, learning, and decision-making is key to build more general AI systems. As a step in this direction, we propose a novel neural-logic architecture that can solve both inductive logic programming (ILP) and deep reinforcement learning (RL) problems. Our architecture defines a restricted but expressive continuous space of first-order logic programs by assigning weights to predicates instead of rules. Therefore, it is fully differentiable and can be efficiently trained with gradient descent. Besides, in the deep RL setting with actor-critic algorithms, we propose a novel efficient critic architecture. Compared to state-of-the-art methods on both ILP and RL problems, our proposition achieves excellent performance, while being able to provide a fully interpretable solution and scaling much better, especially during the testing phase.

翻译：整合推理、学习和决策是建立更一般性的AI系统的关键。作为朝这个方向迈出的一步,我们提出了一个新的神经-逻辑结构,它既能解决感性逻辑编程(ILP)问题,又能解决深层强化学习(RL)问题。我们的结构通过将权重分配给上游而不是规则,界定了一级逻辑程序的有限但明确的持续空间。因此,它完全可以区分,并且可以受到梯度下降的高效培训。此外,在与行为者-批评算法的深层RL环境中,我们提出了一个新的高效的批评结构。相比于ITP和RL问题的最新方法,我们的建议取得了卓越的成绩,同时能够提供完全可解释的解决办法,并推广得更好,特别是在测试阶段。

相关内容

ILP

关注 130

归纳逻辑程序设计（ILP）是机器学习的一个分支，它依赖于逻辑程序作为一种统一的表示语言来表达例子、背景知识和假设。基于一阶逻辑的ILP具有很强的表示形式，为多关系学习和数据挖掘提供了一种很好的方法。International Conference on Inductive Logic Programming系列始于1991年，是学习结构化或半结构化关系数据的首要国际论坛。最初专注于逻辑程序的归纳，多年来，它大大扩展了研究范围，并欢迎在逻辑学习、多关系数据挖掘、统计关系学习、图形和树挖掘等各个方面作出贡献，学习其他（非命题）基于逻辑的知识表示框架，探索统计学习和其他概率方法的交叉点。官网链接：https://ilp2019.org/