Many emerging applications of intelligent robots need to explore and understand new environments, where it is desirable to detect objects of novel classes on the fly with minimum online efforts. This is an object detection on demand (ODOD) task. It is challenging, because it is impossible to annotate a large number of data on the fly, and the embedded systems are usually unable to perform back-propagation which is essential for training. Most existing few-shot detection methods are confronted here as they need extra training. We propose a novel morphable detector (MD), that simply "morphs" some of its changeable parameters online estimated from the few samples, so as to detect novel classes without any extra training. The MD has two sets of parameters, one for the feature embedding and the other for class representation (called "prototypes"). Each class is associated with a hidden prototype to be learned by integrating the visual and semantic embeddings. The learning of the MD is based on the alternate learning of the feature embedding and the prototypes in an EM-like approach which allows the recovery of an unknown prototype from a few samples of a novel class. Once an MD is learned, it is able to use a few samples of a novel class to directly compute its prototype to fulfill the online morphing process. We have shown the superiority of the MD in Pascal, COCO and FSOD datasets. The code is available https://github.com/Zhaoxiangyun/Morphable-Detector.
翻译:智能机器人的许多新兴应用都需要探索和理解新的环境, 在那里, 最好用最小的在线努力来探测在飞行上的新课程的物体。 这是根据需求检测物体的任务( ODOD ) 。 这是具有挑战性的任务, 因为无法在飞行上批注大量数据, 而嵌入系统通常无法进行对培训至关重要的反向分析。 大部分现有的微小探测方法都在这里遇到, 因为它们需要额外的培训。 我们建议了一个新颖的可变探测器( MD ), 只需要从少数样本中测出一些可以在线变异的参数, 就可以在网上检测到一些可以从少数样本中测出的新课程。 MD 有两套参数, 一个用于嵌入功能,另一个用于课堂演示( 称为“ 程序类型 ” ) 。 每个课程都与一个隐藏的原型相联起来, 要通过将视觉和语系嵌入的CO 。 学习MDMD是基于对特性和类似EM- 的方法的替代学习的特性嵌入和原型, 使得能够从少数的PO/ 类样本中回收一个未知的原型, 。 一旦学会将演示, 将演示, 我们将演示到演示,, 将演示, 将它演示到正在演示到正在演示到 。