用于校正语法错误的多层进化编码器-Decoder神经网络 (A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction)

We improve automatic correction of grammatical, orthographic, and collocation errors in text using a multilayer convolutional encoder-decoder neural network. The network is initialized with embeddings that make use of character N-gram information to better suit this task. When evaluated on common benchmark test data sets (CoNLL-2014 and JFLEG), our model substantially outperforms all prior neural approaches on this task as well as strong statistical machine translation-based systems with neural and task-specific features trained on the same data. Our analysis shows the superiority of convolutional neural networks over recurrent neural networks such as long short-term memory (LSTM) networks in capturing the local context via attention, and thereby improving the coverage in correcting grammatical errors. By ensembling multiple models, and incorporating an N-gram language model and edit features via rescoring, our novel method becomes the first neural approach to outperform the current state-of-the-art statistical machine translation-based approach, both in terms of grammaticality and fluency.

翻译：我们利用多层进化编码器-解码器神经网络,改进了文本中的语法、正文和合用差错的自动校正。网络的初始化是嵌入,这些嵌入使用字符N克信息,以更好地适应这项任务。在对通用基准测试数据集(ConNLLL-2014和JFLEG)进行评估时,我们的模型大大优于所有先前关于这项任务的神经方法,以及强大的基于统计的机器翻译系统,这些系统具有同一数据方面的神经和任务特点。我们的分析显示,在通过注意力捕捉本地环境从而改进校正语法错误覆盖面方面,共进神经网络优于长期短期内存(LSTM)网络等经常性神经网络。通过聚合多个模型,并纳入N-gram语言模型和通过重新校正功能,我们的新方法成为第一个超越当前基于统计机的状态翻译方法的神经方法,在语法和流频度方面都是如此。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【AAAI 2019】双曲异构信息网络嵌入，Hyperbolic Heterogeneous Information Network Embedding

专知会员服务

60+阅读 · 2020年6月28日

【综述论文】A Survey on Dynamic Network Embedding，动态网络嵌入综述论文

专知会员服务

101+阅读 · 2020年6月16日

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

专知会员服务

110+阅读 · 2020年2月22日