零样本多物体场景内分布检测：基于视觉语言基础模型 (Zero-Shot In-Distribution Detection in Multi-Object Settings Using Vision-Language Foundation Models) - 专知论文

会员服务 ·

0

零样本 · 包含 · 样本 · 大模型 · 类别 ·

2023 年 4 月 10 日

Zero-Shot In-Distribution Detection in Multi-Object Settings Using Vision-Language Foundation Models

翻译：零样本多物体场景内分布检测：基于视觉语言基础模型

Atsuyuki Miyai,Qing Yu,Go Irie,Kiyoharu Aizawa

Removing out-of-distribution (OOD) images from noisy images scraped from the Internet is an important preprocessing for constructing datasets, which can be addressed by zero-shot OOD detection with vision language foundation models (CLIP). The existing zero-shot OOD detection setting does not consider the realistic case where an image has both in-distribution (ID) objects and OOD objects. However, it is important to identify such images as ID images when collecting the images of rare classes or ethically inappropriate classes that must not be missed. In this paper, we propose a novel problem setting called in-distribution (ID) detection, where we identify images containing ID objects as ID images, even if they contain OOD objects, and images lacking ID objects as OOD images. To solve this problem, we present a new approach, \textbf{G}lobal-\textbf{L}ocal \textbf{M}aximum \textbf{C}oncept \textbf{M}atching (GL-MCM), based on both global and local visual-text alignments of CLIP features, which can identify any image containing ID objects as ID images. Extensive experiments demonstrate that GL-MCM outperforms comparison methods on both multi-object datasets and single-object ImageNet benchmarks.

翻译：摘要：从互联网上爬取的嘈杂图像中去除已知分布外（OOD）的图像，是构建数据集的重要预处理，它可以通过使用视觉语言基础模型(CLIP)进行零样本OOD检测来解决。然而，现有的零样本OOD检测设置并不考虑实际情况，即图像同时包含内部分布（ID）物体和OOD物体的情况。然而，当收集罕见类别或道德上不适当类别的图像时，识别此类图像为ID图像非常重要。在本文中，我们提出了一种新颖的问题设置，称为内部分布（ID）检测，其中标识包含ID物体的图像为ID图像，即使它们包含OOD物体，而缺乏ID物体的图像则为OOD图像。为了解决这个问题，我们提出了一种新的方法称为GL-MCM（Global-Local Maximum Concept Matching），基于CLIP特征的全局和局部视觉-文本对齐，可以识别任何包含ID物体的图像作为ID图像。大量实验证明，GL-MCM在多物体数据集和单物体ImageNet基准测试上优于比较方法。

0

相关内容

零样本

【ICML2022】长尾识别中分布外检测的部分和非对称对比学习

【ICML2022】长尾识别中分布外检测的部分和非对称对比学习

专知会员服务

24+阅读 · 2022年7月5日

【Tel Aviv大学】StyleGAN的架构、方法和应用的最新进展，State-of-the-Art in the Architecture, Methods and Applications of StyleGAN

【Tel Aviv大学】StyleGAN的架构、方法和应用的最新进展，State-of-the-Art in the Architecture, Methods and Applications of StyleGAN

专知会员服务

20+阅读 · 2022年3月17日

【AAAI2022】对偶对比学习在人脸伪造检测中的应用

【AAAI2022】对偶对比学习在人脸伪造检测中的应用

专知会员服务

23+阅读 · 2022年1月9日

【CMU博士论文】开放世界目标检测与跟踪，168页pdf

【CMU博士论文】开放世界目标检测与跟踪，168页pdf

专知会员服务

60+阅读 · 2021年6月14日

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【论文推荐】深度学习中的异常实例检测:综述，Anomalous Instance Detection in Deep Learning: A Survey

【论文推荐】深度学习中的异常实例检测:综述，Anomalous Instance Detection in Deep Learning: A Survey

专知会员服务

96+阅读 · 2020年3月17日

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

专知会员服务

79+阅读 · 2020年2月12日

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

专知会员服务

60+阅读 · 2019年12月24日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

异常检测（Anomaly Detection）综述

异常检测（Anomaly Detection）综述

极市平台

20+阅读 · 2020年10月24日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

深度学习与计算机视觉任务应用综述

深度学习与计算机视觉任务应用综述

深度学习与NLP

50+阅读 · 2018年12月18日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

上百份文字的检测与识别资源，包含数据集、code和paper

上百份文字的检测与识别资源，包含数据集、code和paper

数据挖掘入门与实战

17+阅读 · 2017年12月7日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

促红细胞生成素衍生物对钙黏蛋白突变小鼠CDH23erl/erl的听力保护作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于线结构光的水下自主作业系统目标识别与定位方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

拷贝数变异在中国遗传性耳聋人群中的分布及筛查策略研究

国家自然科学基金

0+阅读 · 2015年12月31日

具有3D空间辨识力的视觉显著计算模型研究

国家自然科学基金

2+阅读 · 2014年12月31日

八声杜鹃与长尾缝叶莺的协同进化研究

国家自然科学基金

0+阅读 · 2014年12月31日

迁移学习在图像分类中的应用研究

国家自然科学基金

8+阅读 · 2013年12月31日

分泌型金属蛋白酶CLCA在哮喘气道重塑中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于四元数的彩色图像边缘检测和分割方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

铁磁岛纳米晶格中的人工几何磁阻挫

国家自然科学基金

0+阅读 · 2009年12月31日

基于Surfacelet多尺度积的三维SAR图像去噪与分割

国家自然科学基金

0+阅读 · 2009年12月31日

Crafting Training Degradation Distribution for the Accuracy-Generalization Trade-off

Arxiv

0+阅读 · 2023年5月29日

Zero-shot Visual Question Answering with Language Model Feedback

Arxiv

0+阅读 · 2023年5月26日

Learning to Imagine: Visually-Augmented Natural Language Generation

Arxiv

0+阅读 · 2023年5月26日

ReConPatch : Contrastive Patch Representation Learning for Industrial Anomaly Detection

Arxiv

0+阅读 · 2023年5月26日

Fake News Detection and Behavioral Analysis: Case of COVID-19

Arxiv

0+阅读 · 2023年5月25日

Generalized Out-of-Distribution Detection: A Survey

Generalized Out-of-Distribution Detection: A Survey

Arxiv

15+阅读 · 2021年10月21日

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Arxiv

17+阅读 · 2020年3月31日

Object Detection in 20 Years: A Survey

Object Detection in 20 Years: A Survey

Arxiv

48+阅读 · 2019年5月13日

Dynamic Zoom-in Network for Fast Object Detection in Large Images

Arxiv

20+阅读 · 2018年3月27日

Exploring Models and Data for Remote Sensing Image Caption Generation

Arxiv

14+阅读 · 2017年12月21日

VIP会员

文章信息

相关主题

相关VIP内容

【ICML2022】长尾识别中分布外检测的部分和非对称对比学习

【ICML2022】长尾识别中分布外检测的部分和非对称对比学习

专知会员服务

24+阅读 · 2022年7月5日

【Tel Aviv大学】StyleGAN的架构、方法和应用的最新进展，State-of-the-Art in the Architecture, Methods and Applications of StyleGAN

【Tel Aviv大学】StyleGAN的架构、方法和应用的最新进展，State-of-the-Art in the Architecture, Methods and Applications of StyleGAN

专知会员服务

20+阅读 · 2022年3月17日

【AAAI2022】对偶对比学习在人脸伪造检测中的应用

【AAAI2022】对偶对比学习在人脸伪造检测中的应用

专知会员服务

23+阅读 · 2022年1月9日

【CMU博士论文】开放世界目标检测与跟踪，168页pdf

【CMU博士论文】开放世界目标检测与跟踪，168页pdf

专知会员服务

60+阅读 · 2021年6月14日

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【论文推荐】深度学习中的异常实例检测:综述，Anomalous Instance Detection in Deep Learning: A Survey

【论文推荐】深度学习中的异常实例检测:综述，Anomalous Instance Detection in Deep Learning: A Survey

专知会员服务

96+阅读 · 2020年3月17日

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

专知会员服务

79+阅读 · 2020年2月12日

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

专知会员服务

60+阅读 · 2019年12月24日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

异常检测（Anomaly Detection）综述

异常检测（Anomaly Detection）综述

极市平台

20+阅读 · 2020年10月24日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

深度学习与计算机视觉任务应用综述

深度学习与计算机视觉任务应用综述

深度学习与NLP

50+阅读 · 2018年12月18日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

上百份文字的检测与识别资源，包含数据集、code和paper

上百份文字的检测与识别资源，包含数据集、code和paper

数据挖掘入门与实战

17+阅读 · 2017年12月7日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

相关论文

Crafting Training Degradation Distribution for the Accuracy-Generalization Trade-off

Arxiv

0+阅读 · 2023年5月29日

Zero-shot Visual Question Answering with Language Model Feedback

Arxiv

0+阅读 · 2023年5月26日

Learning to Imagine: Visually-Augmented Natural Language Generation

Arxiv

0+阅读 · 2023年5月26日

ReConPatch : Contrastive Patch Representation Learning for Industrial Anomaly Detection

Arxiv

0+阅读 · 2023年5月26日

Fake News Detection and Behavioral Analysis: Case of COVID-19

Arxiv

0+阅读 · 2023年5月25日

Generalized Out-of-Distribution Detection: A Survey

Generalized Out-of-Distribution Detection: A Survey

Arxiv

15+阅读 · 2021年10月21日

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Arxiv

17+阅读 · 2020年3月31日

Object Detection in 20 Years: A Survey

Object Detection in 20 Years: A Survey

Arxiv

48+阅读 · 2019年5月13日

Dynamic Zoom-in Network for Fast Object Detection in Large Images

Arxiv

20+阅读 · 2018年3月27日

Exploring Models and Data for Remote Sensing Image Caption Generation

Arxiv

14+阅读 · 2017年12月21日

相关基金

促红细胞生成素衍生物对钙黏蛋白突变小鼠CDH23erl/erl的听力保护作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于线结构光的水下自主作业系统目标识别与定位方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

拷贝数变异在中国遗传性耳聋人群中的分布及筛查策略研究

国家自然科学基金

0+阅读 · 2015年12月31日

具有3D空间辨识力的视觉显著计算模型研究

国家自然科学基金

2+阅读 · 2014年12月31日

八声杜鹃与长尾缝叶莺的协同进化研究

国家自然科学基金

0+阅读 · 2014年12月31日

迁移学习在图像分类中的应用研究

国家自然科学基金

8+阅读 · 2013年12月31日

分泌型金属蛋白酶CLCA在哮喘气道重塑中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于四元数的彩色图像边缘检测和分割方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

铁磁岛纳米晶格中的人工几何磁阻挫

国家自然科学基金

0+阅读 · 2009年12月31日

基于Surfacelet多尺度积的三维SAR图像去噪与分割

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员