用双向训练变形器手写数学表达式识别 (Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer)

Encoder-decoder models have made great progress on handwritten mathematical expression recognition recently. However, it is still a challenge for existing methods to assign attention to image features accurately. Moreover, those encoder-decoder models usually adopt RNN-based models in their decoder part, which makes them inefficient in processing long $\LaTeX{}$ sequences. In this paper, a transformer-based decoder is employed to replace RNN-based ones, which makes the whole model architecture very concise. Furthermore, a novel training strategy is introduced to fully exploit the potential of the transformer in bidirectional language modeling. Compared to several methods that do not use data augmentation, experiments demonstrate that our model improves the ExpRate of current state-of-the-art methods on CROHME 2014 by 2.23%. Similarly, on CROHME 2016 and CROHME 2019, we improve the ExpRate by 1.92% and 2.28% respectively.

翻译：编码器- 解码器模型最近在手写数学表达式识别方面取得了巨大进展。然而, 对现有方法来说, 准确关注图像特征仍是一项挑战。此外, 这些编码器- 解码器模型通常在其解码器部分采用基于 RNN 的模型, 这使得它们在处理长$\ LaTeX ⁇ ⁇ $序列方面效率低下。在本文中, 使用一个基于变压器的解码器取代基于 RNN 的解码器, 使整个模型结构非常简洁。此外, 引入了一个新颖的培训战略, 以充分利用变压器在双向语言建模中的潜力。与一些不使用数据增强功能的方法相比, 实验表明我们的模型将2014年CROHME当前最新工艺方法的开发率提高了2. 23%。同样, 在 CROHME 2016 和 CROHME 2019 上, 我们分别提高了1.92 % 和 2.28% 。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

【2020关键词提取】基于深度神经网络的关键词提取，Keywords extraction with deep neural network model

专知会员服务

60+阅读 · 2020年5月2日

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

专知会员服务

33+阅读 · 2020年5月2日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日