Although it is well believed for years that modeling relations between objects would help object recognition, there has not been evidence that the idea is working in the deep learning era. All state-of-the-art object detection systems still rely on recognizing object instances individually, without exploiting their relations during learning. This work proposes an object relation module. It processes a set of objects simultaneously through interaction between their appearance feature and geometry, thus allowing modeling of their relations. It is lightweight and in-place. It does not require additional supervision and is easy to embed in existing networks. It is shown effective on improving object recognition and duplicate removal steps in the modern object detection pipeline. It verifies the efficacy of modeling object relations in CNN based detection. It gives rise to the first fully end-to-end object detector.
翻译:虽然多年来人们一直相信,物体之间的建模关系将有助于物体识别,但并没有证据表明这种想法在深层次的学习时代发挥作用,所有最先进的物体探测系统仍然依靠个别识别物体情况,而没有在学习期间利用它们之间的关系。这项工作提议了一个物体关系模块。它通过物体外观特征和几何的相互作用,同时处理一系列物体,从而允许对其关系进行建模。它是轻量级的,就位的。它不需要额外的监督,很容易嵌入现有的网络中。它证明在改进现代物体探测管道中的物体识别和重复清除步骤方面是有效的。它验证了在CNN探测中模拟物体关系的功效。它产生了第一个完全端到端的物体探测器。