V3Det：庞大词汇视觉检测数据集 (V3Det: Vast Vocabulary Visual Detection Dataset) - 专知论文

会员服务 ·

0

对象检测 · 类别 · 视觉检测 · 数据集 · 注释（编程） ·

2023 年 4 月 7 日

V3Det: Vast Vocabulary Visual Detection Dataset

翻译：V3Det：庞大词汇视觉检测数据集

Jiaqi Wang,Pan Zhang,Tao Chu,Yuhang Cao,Yujie Zhou,Tong Wu,Bin Wang,Conghui He,Dahua Lin

from arxiv, The dataset with 13,029 categories will be publicly available by June 2023

Recent advances in detecting arbitrary objects in the real world are trained and evaluated on object detection datasets with a relatively restricted vocabulary. To facilitate the development of more general visual object detection, we propose V3Det, a vast vocabulary visual detection dataset with precisely annotated bounding boxes on massive images. V3Det has several appealing properties: 1) Vast Vocabulary: It contains bounding boxes of objects from 13,029 categories on real-world images, which is 10 times larger than the existing large vocabulary object detection dataset, e.g., LVIS. 2) Hierarchical Category Organization: The vast vocabulary of V3Det is organized by a hierarchical category tree which annotates the inclusion relationship among categories, encouraging the exploration of category relationships in vast and open vocabulary object detection. 3) Rich Annotations: V3Det comprises precisely annotated objects in 245k images and professional descriptions of each category written by human experts and a powerful chatbot. By offering a vast exploration space, V3Det enables extensive benchmarks on both vast and open vocabulary object detection, leading to new observations, practices, and insights for future research. It has the potential to serve as a cornerstone dataset for developing more general visual perception systems.

翻译：近年来，现实世界中检测任意对象的最新进展是在具有相对受限的词汇表的对象检测数据集上训练和评估的。为了促进更一般的视觉对象检测的发展，我们提出了V3Det，这是一个庞大词汇视觉检测数据集，涵盖了巨大图像上精确标注的边界框。V3Det具有几个吸引人的特点：1）庞大词汇表：它包含来自13,029个类别的对象边界框，是现有大词汇表对象检测数据集（例如LVIS）的10倍。2）分层类别组织：V3Det的庞大词汇表由分层类别树组织，注释了类别之间的包含关系，鼓励在庞大和开放的词汇表对象检测中探索类别关系。 3）丰富的注释：V3Det包括245k图像中精确注释的对象和由人类专家和强大的聊天机器人编写的每个类别的专业描述。通过提供庞大的探索空间，V3Det可以在庞大和开放的词汇表对象检测上进行广泛的基准测试，从而推动未来研究的新观察，实践和见解。它有潜力成为发展更一般的视觉感知系统的基石数据集。

1

相关内容

对象检测

新杀器来了！Facebook AI提出DETR：用Transformers来进行端到端的目标检测

新杀器来了！Facebook AI提出DETR：用Transformers来进行端到端的目标检测

专知会员服务

51+阅读 · 2020年5月28日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【北卡罗莱纳州立大学】单场景视频异常检测综述，A Survey of Single-Scene Video Anomaly Detection

【北卡罗莱纳州立大学】单场景视频异常检测综述，A Survey of Single-Scene Video Anomaly Detection

专知会员服务

31+阅读 · 2020年4月13日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

专知会员服务

85+阅读 · 2019年11月15日

【目标检测 | 2019最新综述】目标检测的20年，附39页PDF，Object Detection in 20 Years: A Survey

【目标检测 | 2019最新综述】目标检测的20年，附39页PDF，Object Detection in 20 Years: A Survey

专知会员服务

60+阅读 · 2019年11月15日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

超全的人脸识别数据集汇总，附打包下载

超全的人脸识别数据集汇总，附打包下载

极市平台

90+阅读 · 2020年3月7日

文本生成公开数据集/开源工具/经典论文详细列表分享

文本生成公开数据集/开源工具/经典论文详细列表分享

深度学习与NLP

30+阅读 · 2019年9月22日

干货 | 视频显著性目标检测（文末附有完整源码）

干货 | 视频显著性目标检测（文末附有完整源码）

计算机视觉战队

14+阅读 · 2019年4月29日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Kaggle 新赛：Google AI Open Images 目标检测

Kaggle 新赛：Google AI Open Images 目标检测

AI研习社

18+阅读 · 2018年7月4日

上百份文字的检测与识别资源，包含数据集、code和paper

上百份文字的检测与识别资源，包含数据集、code和paper

数据挖掘入门与实战

17+阅读 · 2017年12月7日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

资源：10份机器阅读理解数据集 | 论文集精选 #02

资源：10份机器阅读理解数据集 | 论文集精选 #02

PaperWeekly

11+阅读 · 2017年9月16日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

Oct4及其翻译后修饰对脑胶质瘤细胞增殖和致瘤性的调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

破骨细胞在下颌骨放射性骨坏死形成及预防中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

Atrx 基因失活促进脑胶质瘤形成的表观遗传学机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

大规模汉语历时语料库建设及词汇语义变迁研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于稀疏隐语义分析与众包的查询意图发现与推荐算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于多信息融合的自然场景图像中的文本检测和识别方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于群智的开放式数据集成与分析技术研究

国家自然科学基金

2+阅读 · 2012年12月31日

增强现实中多目标3D跟踪定位和WH-SIFT特征识别方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

恶性肿瘤细胞凋亡新型小分子PET显像剂的研制

国家自然科学基金

0+阅读 · 2009年12月31日

三维模型语义分析与检索研究

国家自然科学基金

2+阅读 · 2008年12月31日

Linear Object Detection in Document Images using Multiple Object Tracking

Arxiv

0+阅读 · 2023年5月26日

GenerateCT: Text-Guided 3D Chest CT Generation

Arxiv

0+阅读 · 2023年5月26日

Multimodal Industrial Anomaly Detection via Hybrid Fusion

Multimodal Industrial Anomaly Detection via Hybrid Fusion

Arxiv

11+阅读 · 2023年3月1日

Deep Learning for Time Series Anomaly Detection: A Survey

Arxiv

21+阅读 · 2022年11月9日

Towards Large-Scale Small Object Detection: Survey and Benchmarks

Arxiv

40+阅读 · 2022年7月28日

Towards Open World Object Detection

Arxiv

13+阅读 · 2021年3月3日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark

Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark

Arxiv

46+阅读 · 2019年9月22日

3D Backbone Network for 3D Object Detection

Arxiv

12+阅读 · 2019年1月24日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

VIP会员

文章信息

相关主题

注释（编程）

相关VIP内容

新杀器来了！Facebook AI提出DETR：用Transformers来进行端到端的目标检测

新杀器来了！Facebook AI提出DETR：用Transformers来进行端到端的目标检测

专知会员服务

51+阅读 · 2020年5月28日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【北卡罗莱纳州立大学】单场景视频异常检测综述，A Survey of Single-Scene Video Anomaly Detection

【北卡罗莱纳州立大学】单场景视频异常检测综述，A Survey of Single-Scene Video Anomaly Detection

专知会员服务

31+阅读 · 2020年4月13日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

专知会员服务

85+阅读 · 2019年11月15日

【目标检测 | 2019最新综述】目标检测的20年，附39页PDF，Object Detection in 20 Years: A Survey

【目标检测 | 2019最新综述】目标检测的20年，附39页PDF，Object Detection in 20 Years: A Survey

专知会员服务

60+阅读 · 2019年11月15日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

美军小型无人机项目

无人机蜂群——作为执行非常规战争的创新工具 | 2025最新文献

不确定环境下无人机与无人地面车辆编队的地下勘探规划算法 | 122页

接纳无人机多样性：西方军事在无人机战争中适应的五个挑战 | 28页报告

相关资讯

超全的人脸识别数据集汇总，附打包下载

超全的人脸识别数据集汇总，附打包下载

极市平台

90+阅读 · 2020年3月7日

文本生成公开数据集/开源工具/经典论文详细列表分享

文本生成公开数据集/开源工具/经典论文详细列表分享

深度学习与NLP

30+阅读 · 2019年9月22日

干货 | 视频显著性目标检测（文末附有完整源码）

干货 | 视频显著性目标检测（文末附有完整源码）

计算机视觉战队

14+阅读 · 2019年4月29日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Kaggle 新赛：Google AI Open Images 目标检测

Kaggle 新赛：Google AI Open Images 目标检测

AI研习社

18+阅读 · 2018年7月4日

上百份文字的检测与识别资源，包含数据集、code和paper

上百份文字的检测与识别资源，包含数据集、code和paper

数据挖掘入门与实战

17+阅读 · 2017年12月7日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

资源：10份机器阅读理解数据集 | 论文集精选 #02

资源：10份机器阅读理解数据集 | 论文集精选 #02

PaperWeekly

11+阅读 · 2017年9月16日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

Linear Object Detection in Document Images using Multiple Object Tracking

Arxiv

0+阅读 · 2023年5月26日

GenerateCT: Text-Guided 3D Chest CT Generation

Arxiv

0+阅读 · 2023年5月26日

Multimodal Industrial Anomaly Detection via Hybrid Fusion

Multimodal Industrial Anomaly Detection via Hybrid Fusion

Arxiv

11+阅读 · 2023年3月1日

Deep Learning for Time Series Anomaly Detection: A Survey

Arxiv

21+阅读 · 2022年11月9日

Towards Large-Scale Small Object Detection: Survey and Benchmarks

Arxiv

40+阅读 · 2022年7月28日

Towards Open World Object Detection

Arxiv

13+阅读 · 2021年3月3日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark

Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark

Arxiv

46+阅读 · 2019年9月22日

3D Backbone Network for 3D Object Detection

Arxiv

12+阅读 · 2019年1月24日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

相关基金

Oct4及其翻译后修饰对脑胶质瘤细胞增殖和致瘤性的调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

破骨细胞在下颌骨放射性骨坏死形成及预防中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

Atrx 基因失活促进脑胶质瘤形成的表观遗传学机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

大规模汉语历时语料库建设及词汇语义变迁研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于稀疏隐语义分析与众包的查询意图发现与推荐算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于多信息融合的自然场景图像中的文本检测和识别方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于群智的开放式数据集成与分析技术研究

国家自然科学基金

2+阅读 · 2012年12月31日

增强现实中多目标3D跟踪定位和WH-SIFT特征识别方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

恶性肿瘤细胞凋亡新型小分子PET显像剂的研制

国家自然科学基金

0+阅读 · 2009年12月31日

三维模型语义分析与检索研究

国家自然科学基金

2+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员