分部分承认手写中文文本:无分部分注解-无分部分注解-方法 (Recognition of Handwritten Chinese Text by Segmentation: A Segment-annotation-free Approach)

Online and offline handwritten Chinese text recognition (HTCR) has been studied for decades. Early methods adopted oversegmentation-based strategies but suffered from low speed, insufficient accuracy, and high cost of character segmentation annotations. Recently, segmentation-free methods based on connectionist temporal classification (CTC) and attention mechanism, have dominated the field of HCTR. However, people actually read text character by character, especially for ideograms such as Chinese. This raises the question: are segmentation-free strategies really the best solution to HCTR? To explore this issue, we propose a new segmentation-based method for recognizing handwritten Chinese text that is implemented using a simple yet efficient fully convolutional network. A novel weakly supervised learning method is proposed to enable the network to be trained using only transcript annotations; thus, the expensive character segmentation annotations required by previous segmentation-based methods can be avoided. Owing to the lack of context modeling in fully convolutional networks, we propose a contextual regularization method to integrate contextual information into the network during the training stage, which can further improve the recognition performance. Extensive experiments conducted on four widely used benchmarks, namely CASIA-HWDB, CASIA-OLHWDB, ICDAR2013, and SCUT-HCCDoc, show that our method significantly surpasses existing methods on both online and offline HCTR, and exhibits a considerably higher inference speed than CTC/attention-based approaches.

翻译：几十年来,中国在线和离线手写中文文本识别(HTCR)已经研究了几十年。早期采用的方法是超分化战略,但以低速、低准确度和高成本的字元分解说明为主,最近,基于连接时间分类(CTC)和关注机制的无分解方法,在HCTR领域占主导地位。然而,人们实际上按字符阅读文字特征,特别是对于像中国这样的信息图来说,这提出了问题:无分化战略实际上是HCTR的最佳解决方案?为了探讨这一问题,我们建议采用基于分解的新方法,承认手写中文文本,该方法采用简单而高效的全演化网络实施。提出了一种新颖的、监督薄弱的无分解方法,使网络仅使用笔记说明进行培训;因此,可以避免以往的分解方法要求的昂贵的字符分解说明,特别是对于像中国这样的信息。由于完全革命性网络缺乏背景的建模,我们提议了一种背景化方法,在培训阶段将背景信息纳入网络,这可以进一步改进识别业绩。在四种简单但有效的全方写中文拼写中文分解方法上进行的广泛实验,即CSICCD-AR-HDFDFDS-HB的现有方法,即大大展示了C-IC-HDFDFDFS-S-HDFD-H-H-H-S-S-S-HD-S-S-S-S-S-S-C-C-C-C-C-C-HDFDFD-S-S-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-S-C-S-S-S-S-S-SDU-S-S-SDFDFB-S-C-S-SDFDB-S-S-S-SDFDB-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S

相关内容