Human reasoning can distill principles from observed patterns and generalize them to explain and solve novel problems. The most powerful artificial intelligence systems lack explainability and symbolic reasoning ability, and have therefore not achieved supremacy in domains requiring human understanding, such as science or common sense reasoning. Here we introduce deep distilling, a machine learning method that learns patterns from data using explainable deep learning and then condenses it into concise, executable computer code. The code, which can contain loops, nested logical statements, and useful intermediate variables, is equivalent to the neural network but is generally orders of magnitude more compact and human-comprehensible. On a diverse set of problems involving arithmetic, computer vision, and optimization, we show that deep distilling generates concise code that generalizes out-of-distribution to solve problems orders-of-magnitude larger and more complex than the training data. For problems with a known ground-truth rule set, deep distilling discovers the rule set exactly with scalable guarantees. For problems that are ambiguous or computationally intractable, the distilled rules are similar to existing human-derived algorithms and perform at par or better. Our approach demonstrates that unassisted machine intelligence can build generalizable and intuitive rules explaining patterns in large datasets that would otherwise overwhelm human reasoning.
翻译:人类的推理可以从所观察到的模式中提取原则,将其概括化,以解释和解决新问题。最强大的人工智能系统缺乏解释性和象征性推理能力,因此在需要人类理解的领域,例如科学或常识推理,没有达到至高无上的地位。在这里,我们引入了深层蒸馏,这是一种机器学习方法,用可解释的深层次学习的数据来学习模式,然后将其压缩为简洁、可执行的计算机代码。该代码可以包含循环、嵌套逻辑声明和有用的中间变量,它与神经网络相当,但一般而言,其规模更紧凑,人更能理解。在涉及算术、计算机视觉和优化的多种问题中,我们显示深层蒸馏产生简洁的代码,它能将分配范围外的问题普遍化,解决比培训数据更为广泛和复杂的各种问题。对于已知的地精度规则设置的问题,深层蒸馏中发现规则与可缩缩的保证完全相同。对于模糊或计算难解的问题,精准的规则与现有的人造算算算算法相近似,而精准的规则与现有的人造的算算算算算算算算算法方法相似,在高或更精确或更精确地解释人的推理学中,我们不重的模型中可以超越一般的推解方法。