C3-STISSR: 场景文字图像超分辨率,带有三晶体 (C3-STISR: Scene Text Image Super-resolution with Triple Clues) - 专知论文

会员服务 ·

0

CLUES · CLUE · INFORMS · Extensibility · Performance ·

2022 年 4 月 29 日

C3-STISR: Scene Text Image Super-resolution with Triple Clues

翻译：C3-STISSR: 场景文字图像超分辨率,带有三晶体

Minyi Zhao,Miao Wang,Fan Bai,Bingjia Li,Jie Wang,Shuigeng Zhou

from arxiv, Accepted by IJCAI 2022

Scene text image super-resolution (STISR) has been regarded as an important pre-processing task for text recognition from low-resolution scene text images. Most recent approaches use the recognizer's feedback as clues to guide super-resolution. However, directly using recognition clue has two problems: 1) Compatibility. It is in the form of probability distribution, has an obvious modal gap with STISR - a pixel-level task; 2) Inaccuracy. it usually contains wrong information, thus will mislead the main task and degrade super-resolution performance. In this paper, we present a novel method C3-STISR that jointly exploits the recognizer's feedback, visual and linguistical information as clues to guide super-resolution. Here, visual clue is from the images of texts predicted by the recognizer, which is informative and more compatible with the STISR task; while linguistical clue is generated by a pre-trained character-level language model, which is able to correct the predicted texts. We design effective extraction and fusion mechanisms for the triple cross-modal clues to generate a comprehensive and unified guidance for super-resolution. Extensive experiments on TextZoom show that C3-STISR outperforms the SOTA methods in fidelity and recognition performance. Code is available in https://github.com/zhaominyiz/C3-STISR.

翻译：光文本图像超分辨率( STISR) 被视为来自低分辨率现场文本图像的文本识别的重要预处理任务( STISR) 。多数最近的方法都使用识别者的反馈作为引导超分辨率的线索。但是, 直接使用识别线索有两个问题:(1) 兼容性。它以概率分布形式出现, 与STISR( 像素级任务)有明显的模式差异; (2) 不准确性。它通常包含错误的信息, 从而误导主要任务, 并降低超级分辨率的性能。在本文中, 我们提出了一个创新方法 C3- STISR( C3- STISR), 共同利用识别者的反馈、视觉和语言信息作为引导超级分辨率的线索。这里, 视觉线索来自识别者预测的文本图像, 信息丰富且更符合STISR任务; 虽然语言线索是由预先训练的字级语言模型生成的, 能够校正文本。我们设计了三重跨模式的有效提取和聚合机制, 以生成超级分辨率的全面统一指导。 SOBSI3 和ASFSUSDSUDSUDSUDLADLADLADLADISDRASUDSUDSUDSUDSUDLADSUDSUDSUDSUDSUDLAUDSUDSUDSUDLADSUDSUDSUDSUDSUDSUDSUDSUDSUDSUDSUDSUDSUDSUDSUDSUDSUDSUDSUDSUDLA AS

0

相关内容

CLUES

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

结合医学影像先验信息的近红外漫射光血流血氧测量技术研究

国家自然科学基金

0+阅读 · 2017年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

生物支架材料的仿生性硫酸化对人骨髓间充质干细胞软骨分化及肥大的影响

国家自然科学基金

0+阅读 · 2013年12月31日

PKCε通路双重调控SDF-1/CXCR4轴介导间充质干细胞归巢与旁分泌治疗DHCA诱导肺损伤研究

国家自然科学基金

0+阅读 · 2013年12月31日

超支化有机硅离子液体增韧苯并噁嗪树脂的结构调控与性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

具有铁电磁功能性的配位聚合物的合成、结构和性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

微米级固体颗粒粘附与清除力学的研究

国家自然科学基金

0+阅读 · 2009年12月31日

激光陀螺捷联惯导复合动态环境适应性

国家自然科学基金

0+阅读 · 2009年12月31日

基于99mTc示踪和SPECT成像技术的纳米树状分子体内肿瘤靶向载药行为研究

国家自然科学基金

0+阅读 · 2009年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

Arxiv

0+阅读 · 2022年6月16日

Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens

Arxiv

0+阅读 · 2022年6月15日

VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection

Arxiv

0+阅读 · 2022年6月15日

Recent Advances in Scene Image Representation and Classification

Arxiv

0+阅读 · 2022年6月15日

Super-resolution image display using diffractive decoders

Arxiv

0+阅读 · 2022年6月15日

Self-Supervised Learning of Image Scale and Orientation

Arxiv

0+阅读 · 2022年6月15日

Scene Graph Generation: A Comprehensive Survey

Arxiv

26+阅读 · 2022年1月3日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

Image-to-Image Retrieval by Learning Similarity between Scene Graphs

Arxiv

21+阅读 · 2020年12月29日

A Decade Survey of Content Based Image Retrieval using Deep Learning

Arxiv

23+阅读 · 2020年11月23日

VIP会员

文章信息

相关主题

相关VIP内容

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

Arxiv

0+阅读 · 2022年6月16日

Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens

Arxiv

0+阅读 · 2022年6月15日

VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection

Arxiv

0+阅读 · 2022年6月15日

Recent Advances in Scene Image Representation and Classification

Arxiv

0+阅读 · 2022年6月15日

Super-resolution image display using diffractive decoders

Arxiv

0+阅读 · 2022年6月15日

Self-Supervised Learning of Image Scale and Orientation

Arxiv

0+阅读 · 2022年6月15日

Scene Graph Generation: A Comprehensive Survey

Arxiv

26+阅读 · 2022年1月3日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

Image-to-Image Retrieval by Learning Similarity between Scene Graphs

Arxiv

21+阅读 · 2020年12月29日

A Decade Survey of Content Based Image Retrieval using Deep Learning

Arxiv

23+阅读 · 2020年11月23日

相关基金

结合医学影像先验信息的近红外漫射光血流血氧测量技术研究

国家自然科学基金

0+阅读 · 2017年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

生物支架材料的仿生性硫酸化对人骨髓间充质干细胞软骨分化及肥大的影响

国家自然科学基金

0+阅读 · 2013年12月31日

PKCε通路双重调控SDF-1/CXCR4轴介导间充质干细胞归巢与旁分泌治疗DHCA诱导肺损伤研究

国家自然科学基金

0+阅读 · 2013年12月31日

超支化有机硅离子液体增韧苯并噁嗪树脂的结构调控与性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

具有铁电磁功能性的配位聚合物的合成、结构和性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

微米级固体颗粒粘附与清除力学的研究

国家自然科学基金

0+阅读 · 2009年12月31日

激光陀螺捷联惯导复合动态环境适应性

国家自然科学基金

0+阅读 · 2009年12月31日

基于99mTc示踪和SPECT成像技术的纳米树状分子体内肿瘤靶向载药行为研究

国家自然科学基金

0+阅读 · 2009年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员