Although modern object detectors rely heavily on a significant amount of training data, humans can easily detect novel objects using a few training examples. The mechanism of the human visual system is to interpret spatial relationships among various objects and this process enables us to exploit contextual information by considering the co-occurrence of objects. Thus, we propose a spatial reasoning framework that detects novel objects with only a few training examples in a context. We infer geometric relatedness between novel and base RoIs (Region-of-Interests) to enhance the feature representation of novel categories using an object detector well trained on base categories. We employ a graph convolutional network as the RoIs and their relatedness are defined as nodes and edges, respectively. Furthermore, we present spatial data augmentation to overcome the few-shot environment where all objects and bounding boxes in an image are resized randomly. Using the PASCAL VOC and MS COCO datasets, we demonstrate that the proposed method significantly outperforms the state-of-the-art methods and verify its efficacy through extensive ablation studies.
翻译:尽管现代天体探测器在很大程度上依赖大量培训数据,但人类可以很容易地利用几个培训实例探测新物体。人类视觉系统的机制是解释各种天体之间的空间关系,这一过程使我们能够通过考虑天体的共同发生来利用背景信息。因此,我们提议了一个空间推理框架来探测新天体,在一定范围内只举几个培训实例。我们推论小天体和基本天体(利益区域)之间的几何关联性,以便利用在基本类别上受过良好训练的天体探测器,加强新星类别的特征表现。我们使用图象演动网络,因为RoIs及其关联性被分别定义为节点和边缘。此外,我们提出空间数据增强,以克服图像中所有天体和捆绑框都随机调整大小的微小环境。我们用PASCAL VOC和MS COCO数据集来证明拟议的方法大大超越了在基准类别上受过良好训练的物体探测器。我们采用的方法,并通过广泛的通缩缩图研究来验证其功效。