Vampire has been for a long time the strongest first-order automatic theorem prover, widely used for hammer-style proof automation in ITPs such as Mizar, Isabelle, HOL, and Coq. In this work, we considerably improve the performance of Vampire in hammering over the full Mizar library by enhancing its saturation procedure with efficient neural guidance. In particular, we employ a recently proposed recursive neural network classifying the generated clauses based only on their derivation history. Compared to previous neural methods based on considering the logical content of the clauses, our architecture makes evaluating a single clause much less time consuming. The resulting system shows good learning capability and improves on the state-of-the-art performance on the Mizar library, while proving many theorems that the related ENIGMA system could not prove in a similar hammering evaluation.
翻译:长期以来,吸血鬼一直是最强的第一阶自动定理验证器,在诸如Mizar、Isabelle、HOL和Coq等ITP中广泛用于锤式验证自动化。 在这项工作中,我们通过提高吸血鬼在全Mizar图书馆的饱和程序并辅以有效的神经指导,大大改进了吸血鬼在整座Mizar图书馆上的表现。特别是,我们最近建议使用一个循环神经网络,仅根据其衍生历史对产生的条款进行分类。与以前基于考虑条款逻辑内容的神经方法相比,我们的结构使得对一个单一条款的评价花费的时间要少得多。 由此形成的系统显示了良好的学习能力,并改进了Mizar图书馆的最新表现,同时证明了许多相关的ENIGMA系统无法在类似的敲锤评价中证明的理论。