微小热视频对象探测 (Few-Shot Video Object Detection)

We introduce Few-Shot Video Object Detection (FSVOD) with three important contributions: 1) a large-scale video dataset FSVOD-500 comprising of 500 classes with class-balanced videos in each category for few-shot learning; 2) a novel Tube Proposal Network (TPN) to generate high-quality video tube proposals to aggregate feature representation for the target video object; 3) a strategically improved Temporal Matching Network (TMN+) to match representative query tube features and supports with better discriminative ability. Our TPN and TMN+ are jointly and end-to-end trained. Extensive experiments demonstrate that our method produces significantly better detection results on two few-shot video object detection datasets compared to image-based methods and other naive video-based extensions. Codes and datasets will be released at https://github.com/fanq15/FewX.

翻译：我们引入了几小片视频物体探测(FSVOD),有三个重要贡献:(1) 大型视频数据集FSVOD-500,由500个班组成,每类中各有500个带级平衡的视频,进行几发学习;(2) 新型Tube建议网络(TPN),以产生高质量的视频管建议,以汇总目标视频物体的特征;(3) 战略上改进的时空匹配网络(TMN+),以匹配有代表性的查询管特征,并以更好的歧视能力提供支持。我们的主题方案网络和TMN+受到联合和端至端培训。广泛的实验表明,与基于图像的方法和其他天真的视频扩展相比,我们的方法在两个几发视频物体探测数据集上产生更好的检测结果。代码和数据集将在https://github.com/fanq15/FewX上发布。

相关内容

小样本学习

关注 0

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

【三星AI-CVPR2020】增量小样本目标检测，Incremental Few-Shot Object Detection

专知会员服务

69+阅读 · 2020年3月11日