利用低计算资源分析神经机器翻译结构 (Analyzing Architectures for Neural Machine Translation Using Low Computational Resources)

With the recent developments in the field of Natural Language Processing, there has been a rise in the use of different architectures for Neural Machine Translation. Transformer architectures are used to achieve state-of-the-art accuracy, but they are very computationally expensive to train. Everyone cannot have such setups consisting of high-end GPUs and other resources. We train our models on low computational resources and investigate the results. As expected, transformers outperformed other architectures, but there were some surprising results. Transformers consisting of more encoders and decoders took more time to train but had fewer BLEU scores. LSTM performed well in the experiment and took comparatively less time to train than transformers, making it suitable to use in situations having time constraints.

翻译：随着最近自然语言处理领域的发展,使用不同结构来进行神经机器翻译的情况有所增加。变异结构被用于实现最先进的准确性,但是在计算上非常昂贵。每个人不能拥有由高端GPU和其他资源组成的这种设置。我们用低计算资源来培训模型并调查结果。正如所预期的那样,变异器优于其他结构,但有一些出人意料的结果。由更多的编码器和解码器组成的变异器需要更多的时间来培训,但BLEU的得分却较少。 LSTM在实验中表现良好,比变异器培训的时间要少得多,在有时间限制的情况下适合使用。

相关内容

Machine Translation

关注 209

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

专知会员服务

170+阅读 · 2020年5月10日

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

专知会员服务

93+阅读 · 2020年5月6日

【机器学习最优化课程笔记】Optimization for Machine Learning，36页pdf

专知会员服务

117+阅读 · 2020年3月25日

【论文】多语言神经机器翻译综述（A Comprehensive Survey of Multilingual Neural Machine Translation）