图纸: 打印- 损坏的自动图表解析 (ChartParser: Automatic Chart Parsing for Print-Impaired)

Infographics are often an integral component of scientific documents for reporting qualitative or quantitative findings as they make it much simpler to comprehend the underlying complex information. However, their interpretation continues to be a challenge for the blind, low-vision, and other print-impaired (BLV) individuals. In this paper, we propose ChartParser, a fully automated pipeline that leverages deep learning, OCR, and image processing techniques to extract all figures from a research paper, classify them into various chart categories (bar chart, line chart, etc.) and obtain relevant information from them, specifically bar charts (including horizontal, vertical, stacked horizontal and stacked vertical charts) which already have several exciting challenges. Finally, we present the retrieved content in a tabular format that is screen-reader friendly and accessible to the BLV users. We present a thorough evaluation of our approach by applying our pipeline to sample real-world annotated bar charts from research papers.

翻译：地图往往是报告定性或定量调查结果的科学文件的一个组成部分,因为这些文件使得理解基本复杂信息容易得多,但是,对地图的解释仍然是盲人、低视力者和其他印刷障碍者(BLV)的一项挑战。在本文中,我们提出“图纸”,这是一个完全自动化的管道,利用深层学习、OCR和图像处理技术从研究文件中提取所有数字,将其分为不同的图表类别(图、线图等),并从中获取相关信息,特别是已经存在若干令人兴奋的挑战的条形图(包括横向、纵向、叠叠叠的横向和叠叠叠的垂直图表)。最后,我们以表格形式介绍检索到的内容,便于屏幕阅读,便于BLV用户查阅。我们通过将我们的管道用于实际样本,从研究论文中提取附加注释的条形图,对我们的方法进行了彻底的评估。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日