直接QE:机器翻译质量估计直接预培训 (DirectQE: Direct Pretraining for Machine Translation Quality Estimation)

Machine Translation Quality Estimation (QE) is a task of predicting the quality of machine translations without relying on any reference. Recently, the predictor-estimator framework trains the predictor as a feature extractor, which leverages the extra parallel corpora without QE labels, achieving promising QE performance. However, we argue that there are gaps between the predictor and the estimator in both data quality and training objectives, which preclude QE models from benefiting from a large number of parallel corpora more directly. We propose a novel framework called DirectQE that provides a direct pretraining for QE tasks. In DirectQE, a generator is trained to produce pseudo data that is closer to the real QE data, and a detector is pretrained on these data with novel objectives that are akin to the QE task. Experiments on widely used benchmarks show that DirectQE outperforms existing methods, without using any pretraining models such as BERT. We also give extensive analyses showing how fixing the two gaps contributes to our improvements.

翻译：机器翻译质量估计(QE)是一项不依靠任何参考就预测机器翻译质量的任务。最近,预测器估计器框架将预测器培训成一个特性提取器,利用额外的平行公司而不贴上QE标签,从而实现有希望的QE性能绩效。然而,我们争辩说,预测器和估计器在数据质量和培训目标两方面都存在差距,这使得量化标准模型无法更直接地受益于大量平行公司。我们提议了一个称为直接QE的新颖框架,为量化标准任务提供直接的预培训。在DirectQE中,一个生成器受过培训,能够产生更接近真正的量化标准数据的伪数据,而一个检测器则事先接受过与量化标准任务相似的新数据的培训。关于广泛使用的基准的实验表明,在不使用任何诸如BERT等预培训模型的情况下, " 直接量化标准 " 超越了现有方法。我们还进行了广泛的分析,表明如何弥补这两个差距有助于我们的改进。

相关内容

Machine Translation

关注 209

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

专知会员服务

19+阅读 · 2020年4月25日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【Google】无监督机器翻译，Unsupervised Machine Translation

专知会员服务

36+阅读 · 2020年3月3日

机器视觉在织物疵点检测上的应用研究综述，Analysis on Application of Machine Vision in Fabric Defect Detection

专知会员服务

18+阅读 · 2020年2月16日