走向开放世界物体探测 (Towards Open World Object Detection)

Humans have a natural instinct to identify unknown object instances in their environments. The intrinsic curiosity about these unknown instances aids in learning about them, when the corresponding knowledge is eventually available. This motivates us to propose a novel computer vision problem called: `Open World Object Detection', where a model is tasked to: 1) identify objects that have not been introduced to it as `unknown', without explicit supervision to do so, and 2) incrementally learn these identified unknown categories without forgetting previously learned classes, when the corresponding labels are progressively received. We formulate the problem, introduce a strong evaluation protocol and provide a novel solution, which we call ORE: Open World Object Detector, based on contrastive clustering and energy based unknown identification. Our experimental evaluation and ablation studies analyze the efficacy of ORE in achieving Open World objectives. As an interesting by-product, we find that identifying and characterizing unknown instances helps to reduce confusion in an incremental object detection setting, where we achieve state-of-the-art performance, with no extra methodological effort. We hope that our work will attract further research into this newly identified, yet crucial research direction.

翻译：人类有自然本能来识别其环境中的未知对象实例。这些未知实例的内在好奇心有助于在最终掌握相应的知识时了解这些未知实例。这促使我们提出一个新型的计算机视觉问题,名为“开放世界物体探测”,模型的任务是:(1) 在没有明确监督的情况下,确定尚未被引入“未知”的物体,并且(2) 在逐渐收到相应的标签时,在不忘记先前学到的类别的情况下,逐步了解这些被识别的未知类别。我们提出了问题,引入了强有力的评估协议,并提供了一种新颖的解决办法,我们称之为“开放世界物体探测器”,其基础是对比性集群和未知的能源识别。我们的实验性评估和放大研究分析了ORE在实现开放世界目标方面的功效。作为一个有趣的副产品,我们发现识别和定性未知事件有助于减少增量物体探测环境中的混乱,我们在此环境中取得了最新水平的性能,而没有额外的方法努力。我们希望我们的工作将吸引对这一新发现的关键研究方向进行进一步的研究。