Weakly supervised object localization is a challenging task in which the object of interest should be localized while learning its appearance. State-of-the-art methods recycle the architecture of a standard CNN by using the activation maps of the last layer for localizing the object. While this approach is simple and works relatively well, object localization relies on different features than classification, thus, a specialized localization mechanism is required during training to improve performance. In this paper, we propose a convolutional, multi-scale spatial localization network that provides accurate localization for the object of interest. Experimental results on CUB-200-2011 and ImageNet datasets show that our proposed approach provides competitive performance for weakly supervised localization.
翻译:微弱监督对象本地化是一项具有挑战性的任务,在这项工作中,在了解对象外观的同时,该对象应当本地化。最先进的方法利用最后一层的启动图对标准CNN架构进行循环利用,使对象本地化。虽然这一方法简单,效果相对较好,但对象本地化取决于与分类不同的特征,因此,在培训期间需要有一个专门的本地化机制来提高绩效。在本文件中,我们提议建立一个动态的多规模空间本地化网络,为对象提供准确本地化。 CUB-200-2011和图像网络数据集的实验结果显示,我们拟议的方法为监管不力的本地化提供了竞争性绩效。