While recent advances in deep learning led to significant improvements in machine translation, neural machine translation is often still not able to continuously adapt to the environment. For humans, as well as for machine translation, bilingual dictionaries are a promising knowledge source to continuously integrate new knowledge. However, their exploitation poses several challenges: The system needs to be able to perform one-shot learning as well as model the morphology of source and target language. In this work, we proposed an evaluation framework to assess the ability of neural machine translation to continuously learn new phrases. We integrate one-shot learning methods for neural machine translation with different word representations and show that it is important to address both in order to successfully make use of bilingual dictionaries. By addressing both challenges we are able to improve the ability to translate new, rare words and phrases from 30% to up to 70%. The correct lemma is even generated by more than 90%.
翻译:虽然最近深层学习的进展导致机器翻译方面的重大改进,神经机翻译往往仍然无法持续适应环境。对于人类以及机器翻译来说,双语词典是持续整合新知识的有希望的知识来源。然而,其利用带来了若干挑战:系统需要能够进行一线学习,并模拟源和目标语言的形态学。在这项工作中,我们提出了一个评价框架,以评估神经机翻译不断学习新词的能力。我们把神经机翻译的一线学习方法与不同的文字表述结合起来,并表明为了成功使用双语词典,两者都很重要。通过应对这两个挑战,我们能够提高翻译新、稀有和短语的能力,从30%到70%不等。正确的利玛甚至由90%以上的人产生。