打破常识:WOOPS 合成和合成图像的视觉和语言基准</s> (Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images)

Weird, unusual, and uncanny images pique the curiosity of observers because they challenge commonsense. For example, an image released during the 2022 world cup depicts the famous soccer stars Lionel Messi and Cristiano Ronaldo playing chess, which playfully violates our expectation that their competition should occur on the football field. Humans can easily recognize and interpret these unconventional images, but can AI models do the same? We introduce WHOOPS!, a new dataset and benchmark for visual commonsense. The dataset is comprised of purposefully commonsense-defying images created by designers using publicly-available image generation tools like Midjourney. We consider several tasks posed over the dataset. In addition to image captioning, cross-modal matching, and visual question answering, we introduce a difficult explanation generation task, where models must identify and explain why a given image is unusual. Our results show that state-of-the-art models such as GPT3 and BLIP2 still lag behind human performance on WHOOPS!. We hope our dataset will inspire the development of AI models with stronger visual commonsense reasoning abilities. Data, models and code are available at the project website: whoops-benchmark.github.io

翻译：人类可以很容易地识别和解释这些非常规的图像,但大赦国际模型也可以这样做吗?我们介绍WHOOPS!一个新的数据集和视觉常识基准。该数据集由2022年世界杯期间发行的图象组成,由设计者利用Midjourney等公开获得的图像生成,我们考虑在数据集上布置的几项任务。除了图像说明、跨模式匹配和视觉问题回答外,我们还引入了难以解释的生成任务,模型必须确定和解释某一图像为何不常见。我们的结果显示,GPT3和BLIP2等最新模型仍然落后于WHOOPS的人类性能!我们希望我们的数据集将激励开发AI模型,使其具有更强的视觉常识推理能力。数据模型和代码将在WHOOPS上运行。数据、模型和代码在网站中提供:谁可以使用?数据、模型和项目:谁可以使用?</s>

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日