Bongard-HOI:为人体物体相互作用确定少拍的视觉理由基准 (Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions)

A significant gap remains between today's visual pattern recognition models and human-level visual cognition especially when it comes to few-shot learning and compositional reasoning of novel concepts. We introduce Bongard-HOI, a new visual reasoning benchmark that focuses on compositional learning of human-object interactions (HOIs) from natural images. It is inspired by two desirable characteristics from the classical Bongard problems (BPs): 1) few-shot concept learning, and 2) context-dependent reasoning. We carefully curate the few-shot instances with hard negatives, where positive and negative images only disagree on action labels, making mere recognition of object categories insufficient to complete our benchmarks. We also design multiple test sets to systematically study the generalization of visual learning models, where we vary the overlap of the HOI concepts between the training and test sets of few-shot instances, from partial to no overlaps. Bongard-HOI presents a substantial challenge to today's visual recognition models. The state-of-the-art HOI detection model achieves only 62% accuracy on few-shot binary prediction while even amateur human testers on MTurk have 91% accuracy. With the Bongard-HOI benchmark, we hope to further advance research efforts in visual reasoning, especially in holistic perception-reasoning systems and better representation learning.

翻译：今天的视觉模式识别模型和人类层面的视觉认知模型之间仍然存在着巨大差距,特别是在关于新概念的微小的学习和构思推理方面。我们引入了Bongard-HoI,这是一个新的视觉推理基准,其重点是自然图像中人类物体相互作用(HOIs)的构思学习;它受古老的Bongard问题(BPs)的两个理想特征的启发:(1) 略微的理念学习,和(2) 以背景为依据的推理。我们谨慎地用硬底片来校正反的微小例子,其中正面和负面图像只对行动标签有分歧,而仅仅承认对象类别不足以完成我们的基准。我们还设计了多个测试组,系统地研究视觉学习模型的一般化模型,我们从部分到零重叠,将HOI概念概念概念的重合起来。Bongard-HOI对今天的视觉识别模型提出了重大挑战。最先进的HOI探测模型只达到62%的精准度,而即使MTurk的业余人测试师则有91 %的精确度,我们更精确度的视觉推感测。我们更深的Bang-hisalalalisal。

相关内容

小样本学习

关注 215

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日