The generic object detection (GOD) task has been successfully tackled by recent deep neural networks, trained by an avalanche of annotated training samples from some common classes. However, it is still non-trivial to generalize these object detectors to the novel long-tailed object classes, which have only few labeled training samples. To this end, the Few-Shot Object Detection (FSOD) has been topical recently, as it mimics the humans' ability of learning to learn, and intelligently transfers the learned generic object knowledge from the common heavy-tailed, to the novel long-tailed object classes. Especially, the research in this emerging field has been flourishing in recent years with various benchmarks, backbones, and methodologies proposed. To review these FSOD works, there are several insightful FSOD survey articles [58, 59, 74, 78] that systematically study and compare them as the groups of fine-tuning/transfer learning, and meta-learning methods. In contrast, we review the existing FSOD algorithms from a new perspective under a new taxonomy based on their contributions, i.e., data-oriented, model-oriented, and algorithm-oriented. Thus, a comprehensive survey with performance comparison is conducted on recent achievements of FSOD. Furthermore, we also analyze the technical challenges, the merits and demerits of these methods, and envision the future directions of FSOD. Specifically, we give an overview of FSOD, including the problem definition, common datasets, and evaluation protocols. The taxonomy is then proposed that groups FSOD methods into three types. Following this taxonomy, we provide a systematic review of the advances in FSOD. Finally, further discussions on performance, challenges, and future directions are presented.
翻译:通用物体检测(GOD)任务已经成功地被最近的深度神经网络解决,这些网络被训练来处理来自一些通用类别的大量带注释的训练样本。然而,将这些物体检测器推广到只有少量标记训练样本的新型长尾物体类别仍然是困难的。为此,Few-Shot目标检测(FSOD)近年来非常热门,因为它模拟了人类学习的能力,并将从通用的重尾部到新型的长尾物体类别中学到的通用物体知识智能传输。尤其是,近年来在这一新兴领域的研究已经蓬勃发展,提出了各种基准、骨干网络和方法。为了回顾这些FSOD作品,有几篇有见地的FSOD综述文章[58,59,74,78]系统地研究和比较它们,如微调/迁移学习和元学习方法。相反,我们从一种新的基于FSOD的贡献的分类方法的新角度重新审查现有的FSOD算法,即,以数据为导向、模型为导向和算法为导向的方法。因此,我们对最近FSOD的成就进行了全面的综述和性能比较。此外,我们还分析了这些方法的技术挑战、优缺点,并设想了FSOD的未来方向。具体而言,我们概述了FSOD,包括问题定义、常见数据集和评估协议。然后,我们提出了一个将FSOD方法分为三种类型的分类方法。在遵循这种分类方法的基础上,我们对FSOD的进展进行了系统的综述。最后,我们对性能、挑战和未来方向进行了进一步的讨论。