We consider the problem of combining machine learning models to perform higher-level cognitive tasks with clear specifications. We propose the novel problem of Visual Discrimination Puzzles (VDP) that requires finding interpretable discriminators that classify images according to a logical specification. Humans can solve these puzzles with ease and they give robust, verifiable, and interpretable discriminators as answers. We propose a compositional neurosymbolic framework that combines a neural network to detect objects and relationships with a symbolic learner that finds interpretable discriminators. We create large classes of VDP datasets involving natural and artificial images and show that our neurosymbolic framework performs favorably compared to several purely neural approaches.
翻译:我们考虑了将机器学习模式结合起来以完成更高层次的认知任务和明确规格的问题。我们提出了视觉歧视拼图(VDP)的新问题,这需要找到可解释的歧视问题,按照逻辑规格对图像进行分类。人类可以轻松地解答这些谜题,并给出了强健、可核查和可解释的歧义作为解答。我们提出了一个组成性神经共振框架,将神经网络与发现可解释歧视对象的象征性学习者检测对象和关系结合起来。我们创造了大量涉及自然和人工图像的VDP数据集,并表明我们的神经共振框架比一些纯粹的神经方法运行得更好。