As a step towards improving the abstract reasoning capability of machines, we aim to solve Raven's Progressive Matrices (RPM) with neural networks, since solving RPM puzzles is highly correlated with human intelligence. Unlike previous methods that use auxiliary annotations or assume hidden rules to produce appropriate feature representation, we only use the ground truth answer of each question for model learning, aiming for an intelligent agent to have a strong learning capability with a small amount of supervision. Based on the RPM problem formulation, the correct answer filled into the missing entry of the third row/column has to best satisfy the same rules shared between the first two rows/columns. Thus we design a simple yet effective Dual-Contrast Network (DCNet) to exploit the inherent structure of RPM puzzles. Specifically, a rule contrast module is designed to compare the latent rules between the filled row/column and the first two rows/columns; a choice contrast module is designed to increase the relative differences between candidate choices. Experimental results on the RAVEN and PGM datasets show that DCNet outperforms the state-of-the-art methods by a large margin of 5.77%. Further experiments on few training samples and model generalization also show the effectiveness of DCNet. Code is available at https://github.com/visiontao/dcnet.
翻译:作为提高机器抽象推理能力的一个步骤,我们的目标是用神经网络解决雷文进步矩阵(RPM)的神经网络问题,因为解决RPM谜题与人类智能高度相关。与以前使用辅助说明或假设隐藏规则来产生适当特征代表的方法不同,我们只使用每个问题的地面真相答案进行示范学习,目的是让智能代理拥有强大的学习能力,并有少量监督。根据RPM问题配制,填补第三行/库伦缺失条目的正确答案必须最好地满足前两行/库伦共享的相同规则。因此,我们设计了一个简单而有效的双轨网(DCNet)来利用RPM谜题的内在结构。具体地说,一个规则对比模块旨在比较填充的行/库/库/库之间潜在的潜在规则;一个选择对比模块旨在增加候选人选择的相对差异。RAPVEN和PGM数据集的实验结果显示,DCNet超越了数行/库的模型/网际模型。在http://com 通用的模型中显示一个大的模型/域网域网域的效能。