与人类判断相矛盾的图像,用于强力视觉事件分类 (Ambiguous Images With Human Judgments for Robust Visual Event Classification)

Contemporary vision benchmarks predominantly consider tasks on which humans can achieve near-perfect performance. However, humans are frequently presented with visual data that they cannot classify with 100% certainty, and models trained on standard vision benchmarks achieve low performance when evaluated on this data. To address this issue, we introduce a procedure for creating datasets of ambiguous images and use it to produce SQUID-E ("Squidy"), a collection of noisy images extracted from videos. All images are annotated with ground truth values and a test set is annotated with human uncertainty judgments. We use this dataset to characterize human uncertainty in vision tasks and evaluate existing visual event classification models. Experimental results suggest that existing vision models are not sufficiently equipped to provide meaningful outputs for ambiguous images and that datasets of this nature can be used to assess and improve such models through model training and direct evaluation of model calibration. These findings motivate large-scale ambiguous dataset creation and further research focusing on noisy visual data.

翻译：当代愿景基准主要考虑人类能够取得近乎完美业绩的任务。然而,人类经常得到他们无法百分之百确定分类的视觉数据,而接受过标准愿景基准培训的模型在对这些数据进行评估时表现低。为了解决这一问题,我们引入了创建模糊图像数据集的程序,并使用该数据集制作从视频中提取的噪音图像集(SQUID-E ) (“Squidy ” ) 。所有图像都附有地面真实值附加说明,测试集附有人类不确定性判断。我们使用这一数据集来描述人类在愿景任务中的不确定性,并评估现有的视觉事件分类模型。实验结果表明,现有的视觉模型不具备足够能力,无法为模糊图像提供有意义的产出,而且这种性质的数据集可以通过模型培训和直接评估模型校准来评估和改进这些模型。这些发现鼓励了大规模模糊数据集的创建和进一步研究,重点是杂乱的视觉数据。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日