We present a literature and empirical survey that critically assesses the state of the art in character-level modeling for machine translation (MT). Despite evidence in the literature that character-level systems are comparable with subword systems, they are virtually never used in competitive setups in WMT competitions. We empirically show that even with recent modeling innovations in character-level natural language processing, character-level MT systems still struggle to match their subword-based counterparts. Character-level MT systems show neither better domain robustness, nor better morphological generalization, despite being often so motivated. However, we are able to show robustness towards source side noise and that translation quality does not degrade with increasing beam size at decoding time.
翻译:我们提出了一个文献和实证调查,对机器翻译品格模型(MT)的先进程度进行了严格的评估。尽管文献中有证据表明,品格级系统与子词系统具有可比性,但实际上从未用于WMT竞赛中的竞争性设置。我们的经验显示,即使最近对品格级自然语言处理进行模拟创新,品格级MT系统仍然难以与基于子词的对等系统相匹配。 品格级MT系统既未显示出更好的域性坚固性,也未显示出更好的形态概括性,尽管这种系统往往具有这样的动机。 但我们能够显示对源边噪音的稳健性,而且翻译质量不会随着在解码时不断增大的光束大小而下降。