The detection of semantic relationships between objects represented in an image is one of the fundamental challenges in image interpretation. Neural-Symbolic techniques, such as Logic Tensor Networks (LTNs), allow the combination of semantic knowledge representation and reasoning with the ability to efficiently learn from examples typical of neural networks. We here propose Faster-LTN, an object detector composed of a convolutional backbone and an LTN. To the best of our knowledge, this is the first attempt to combine both frameworks in an end-to-end training setting. This architecture is trained by optimizing a grounded theory which combines labelled examples with prior knowledge, in the form of logical axioms. Experimental comparisons show competitive performance with respect to the traditional Faster R-CNN architecture.
翻译:发现图像所代表的物体之间的语义关系是图像判读的基本挑战之一。神经-交替技术,如逻辑天线网络(LTNs)允许将语义知识的表述和推理与有效学习典型神经网络实例的能力相结合。我们在此提议由革命骨干和LTN组成的天体探测器Apper-LTN。据我们所知,这是在端至端培训环境中将两个框架结合起来的第一次尝试。这一结构通过优化一个基础理论,将标注的事例与先前的知识相结合,以逻辑轴为形式。实验比较显示传统的快速R-CNN结构的竞争性表现。