打开图像 V5 文本注解和另一个遮罩文本斑点 (Open Images V5 Text Annotation and Yet Another Mask Text Spotter) - 专知论文

会员服务 ·

0

掩码 · MoDELS · Performer · CASES · state-of-the-art ·

2021 年 6 月 23 日

Open Images V5 Text Annotation and Yet Another Mask Text Spotter

翻译：打开图像 V5 文本注解和另一个遮罩文本斑点

Ilya Krylov,Sergei Nosov,Vladislav Sovrasov

A large scale human-labeled dataset plays an important role in creating high quality deep learning models. In this paper we present text annotation for Open Images V5 dataset. To our knowledge it is the largest among publicly available manually created text annotations. Having this annotation we trained a simple Mask-RCNN-based network, referred as Yet Another Mask Text Spotter (YAMTS), which achieves competitive performance or even outperforms current state-of-the-art approaches in some cases on ICDAR2013, ICDAR2015 and Total-Text datasets. Code for text spotting model available online at: https://github.com/openvinotoolkit/training_extensions. The model can be exported to OpenVINO-format and run on Intel CPUs.

翻译：大型人类标签数据集在创建高品质深层学习模型方面发挥了重要作用。在本文中, 我们为开放图像V5数据集提供文本说明。据我们所知, 这是公开手动创建的文本说明中最大的。有了这种说明, 我们训练了一个简单的Mask- RCNN 网络, 称为“ 另类面具文本显示器 ” (YAMTS), 取得竞争性的性能, 甚至在某些情况中, ICDAR2013、 ICDAR2015 和 Total-Text 数据集的当前最先进方法超前。文本识别模型的代码可以在以下网址上查到: https://github.com/ openvinotoolkit/ training_ extensions。该模型可以导出到 OpenVINO- 格式, 并在 Intel CPUs 上运行。

1

相关内容

电子科大最新《基于深度神经网络的关系提取》综述论文，20页pdf

电子科大最新《基于深度神经网络的关系提取》综述论文，20页pdf

专知会员服务

40+阅读 · 2021年1月8日

最新《文本深度学习模型压缩》综述论文，21页pdf

最新《文本深度学习模型压缩》综述论文，21页pdf

专知会员服务

26+阅读 · 2020年8月19日

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

专知会员服务

43+阅读 · 2020年8月18日

最新《知识蒸馏》2020综述论文，20页pdf，悉尼大学

最新《知识蒸馏》2020综述论文，20页pdf，悉尼大学

专知会员服务

158+阅读 · 2020年6月14日

【Manning新书】现代Java实战，592页pdf

【Manning新书】现代Java实战，592页pdf

专知会员服务

101+阅读 · 2020年5月22日

【Google】无监督机器翻译，Unsupervised Machine Translation

【Google】无监督机器翻译，Unsupervised Machine Translation

专知会员服务

36+阅读 · 2020年3月3日

【目标检测 | 2019最新综述】基于深度学习的目标检测综述，附30页PDF， A Survey of Deep Learning-based Object Detection（From Fast R-CNN to NAS-FPN）

【目标检测 | 2019最新综述】基于深度学习的目标检测综述，附30页PDF， A Survey of Deep Learning-based Object Detection（From Fast R-CNN to NAS-FPN）

专知会员服务

56+阅读 · 2019年11月15日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

159+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

已删除

将门创投

5+阅读 · 2018年6月7日

Ranking Models in Unlabeled New Environments

Arxiv

0+阅读 · 2021年8月23日

Is it Time to Replace CNNs with Transformers for Medical Images?

Arxiv

0+阅读 · 2021年8月20日

Knowledge-based Review Generation by Coherence Enhanced Text Planning

Arxiv

7+阅读 · 2021年5月9日

SwapText: Image Based Texts Transfer in Scenes

SwapText: Image Based Texts Transfer in Scenes

Arxiv

4+阅读 · 2020年3月18日

Low-Resource Response Generation with Template Prior

Arxiv

4+阅读 · 2019年9月26日

Text Summarization with Pretrained Encoders

Arxiv

5+阅读 · 2019年8月22日

Graph Convolutional Networks for Text Classification

Arxiv

31+阅读 · 2018年11月13日

Efficient semantic image segmentation with superpixel pooling

Arxiv

6+阅读 · 2018年6月7日

Arxiv

7+阅读 · 2018年1月24日

Detecting Curve Text in the Wild: New Dataset and New Solution

Arxiv

4+阅读 · 2017年12月6日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

电子科大最新《基于深度神经网络的关系提取》综述论文，20页pdf

电子科大最新《基于深度神经网络的关系提取》综述论文，20页pdf

专知会员服务

40+阅读 · 2021年1月8日

最新《文本深度学习模型压缩》综述论文，21页pdf

最新《文本深度学习模型压缩》综述论文，21页pdf

专知会员服务

26+阅读 · 2020年8月19日

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

专知会员服务

43+阅读 · 2020年8月18日

最新《知识蒸馏》2020综述论文，20页pdf，悉尼大学

最新《知识蒸馏》2020综述论文，20页pdf，悉尼大学

专知会员服务

158+阅读 · 2020年6月14日

【Manning新书】现代Java实战，592页pdf

【Manning新书】现代Java实战，592页pdf

专知会员服务

101+阅读 · 2020年5月22日

【Google】无监督机器翻译，Unsupervised Machine Translation

【Google】无监督机器翻译，Unsupervised Machine Translation

专知会员服务

36+阅读 · 2020年3月3日

【目标检测 | 2019最新综述】基于深度学习的目标检测综述，附30页PDF， A Survey of Deep Learning-based Object Detection（From Fast R-CNN to NAS-FPN）

【目标检测 | 2019最新综述】基于深度学习的目标检测综述，附30页PDF， A Survey of Deep Learning-based Object Detection（From Fast R-CNN to NAS-FPN）

专知会员服务

56+阅读 · 2019年11月15日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

159+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】逆强化学习中的部分可识别性与模型设定错误

投大模型岗？50道大型语言模型（LLM）面试问题汇总

深度学习的多视角三维重建技术综述

【ICML2025】扩散模型中参数高效微调的零样本适应

相关资讯

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

已删除

将门创投

5+阅读 · 2018年6月7日

相关论文

Ranking Models in Unlabeled New Environments

Arxiv

0+阅读 · 2021年8月23日

Is it Time to Replace CNNs with Transformers for Medical Images?

Arxiv

0+阅读 · 2021年8月20日

Knowledge-based Review Generation by Coherence Enhanced Text Planning

Arxiv

7+阅读 · 2021年5月9日

SwapText: Image Based Texts Transfer in Scenes

SwapText: Image Based Texts Transfer in Scenes

Arxiv

4+阅读 · 2020年3月18日

Low-Resource Response Generation with Template Prior

Arxiv

4+阅读 · 2019年9月26日

Text Summarization with Pretrained Encoders

Arxiv

5+阅读 · 2019年8月22日

Graph Convolutional Networks for Text Classification

Arxiv

31+阅读 · 2018年11月13日

Efficient semantic image segmentation with superpixel pooling

Arxiv

6+阅读 · 2018年6月7日

Arxiv

7+阅读 · 2018年1月24日

Detecting Curve Text in the Wild: New Dataset and New Solution

Arxiv

4+阅读 · 2017年12月6日

微信扫码咨询专知VIP会员