Broack News: 使报纸能够接触到印刷品 -- -- 受损害者 (Broken News: Making Newspapers Accessible to Print-Impaired) - 专知论文

会员服务 ·

0

OCR · 损失函数（机器学习） · Analysis · 可约的 · 泛函 ·

2022 年 6 月 21 日

Broken News: Making Newspapers Accessible to Print-Impaired

翻译：Broack News: 使报纸能够接触到印刷品 -- -- 受损害者

Vishal Agarwal,Tanuja Ganu,Saikat Guha

from arxiv, Published at Accessibility, Vision, and Autonomy Meet, CVPR 2022 Workshop

Accessing daily news content still remains a big challenge for people with print-impairment including blind and low-vision due to opacity of printed content and hindrance from online sources. In this paper, we present our approach for digitization of print newspaper into an accessible file format such as HTML. We use an ensemble of instance segmentation and detection framework for newspaper layout analysis and then OCR to recognize text elements such as headline and article text. Additionally, we propose EdgeMask loss function for Mask-RCNN framework to improve segmentation mask boundary and hence accuracy of downstream OCR task. Empirically, we show that our proposed loss function reduces the Word Error Rate (WER) of news article text by 32.5 %.

翻译：每日获取新闻内容对于印刷缺陷的人来说,包括盲人和低视率的人来说,仍是一个巨大的挑战,因为印刷内容不透明,并且受到在线来源的阻碍。在本文中,我们提出我们的方法,将印刷报纸数字化为一种无障碍的文件格式,例如HTML。我们使用一个实例分解和检测框架来进行报纸布局分析,然后由OCR来识别头条和文章文本等文本要素。此外,我们提议为Mask-RCNN框架提供EdgeMask损失功能,以改善分解蒙面的界限,从而改进下游OCR任务的准确性。我们经常地表明,我们拟议的损失功能将新闻文章文本的文字错误率降低32.5%。

0

相关内容

OCR

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

反应堆用不锈钢氦泡肿胀微观机理的正电子湮没谱学研究

国家自然科学基金

0+阅读 · 2014年12月31日

ADS 次临界堆液态金属铅铋流动与强化传热机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

大气压直流微等离子体辅助液相合成和液相修饰纳米粒子研究

国家自然科学基金

0+阅读 · 2012年12月31日

非中心对称配合物的铁电、压电与介电

国家自然科学基金

0+阅读 · 2012年12月31日

Narf影响细胞衰老的分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Towards Enabling Next Generation Societal Virtual Reality Applications for Virtual Human Teleportation

Arxiv

0+阅读 · 2022年8月9日

Robust Dialogue State Tracking with Weak Supervision and Sparse Data

Robust Dialogue State Tracking with Weak Supervision and Sparse Data

Arxiv

0+阅读 · 2022年8月9日

ChiTransformer:Towards Reliable Stereo from Cues

Arxiv

0+阅读 · 2022年8月9日

A Computational Exploration of Emerging Methods of Variable Importance Estimation

Arxiv

0+阅读 · 2022年8月5日

Graph Enhanced Representation Learning for News Recommendation

Arxiv

24+阅读 · 2020年3月31日

VIP会员

文章信息

相关主题

损失函数（机器学习）

相关VIP内容

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

相关论文

Towards Enabling Next Generation Societal Virtual Reality Applications for Virtual Human Teleportation

Arxiv

0+阅读 · 2022年8月9日

Robust Dialogue State Tracking with Weak Supervision and Sparse Data

Robust Dialogue State Tracking with Weak Supervision and Sparse Data

Arxiv

0+阅读 · 2022年8月9日

ChiTransformer:Towards Reliable Stereo from Cues

Arxiv

0+阅读 · 2022年8月9日

A Computational Exploration of Emerging Methods of Variable Importance Estimation

Arxiv

0+阅读 · 2022年8月5日

Graph Enhanced Representation Learning for News Recommendation

Arxiv

24+阅读 · 2020年3月31日

相关基金

反应堆用不锈钢氦泡肿胀微观机理的正电子湮没谱学研究

国家自然科学基金

0+阅读 · 2014年12月31日

ADS 次临界堆液态金属铅铋流动与强化传热机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

大气压直流微等离子体辅助液相合成和液相修饰纳米粒子研究

国家自然科学基金

0+阅读 · 2012年12月31日

非中心对称配合物的铁电、压电与介电

国家自然科学基金

0+阅读 · 2012年12月31日

Narf影响细胞衰老的分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员