We investigate the behavior of maps learned by machine translation methods. The maps translate words by projecting between word embedding spaces of different languages. We locally approximate these maps using linear maps, and find that they vary across the word embedding space. This demonstrates that the underlying maps are non-linear. Importantly, we show that the locally linear maps vary by an amount that is tightly correlated with the distance between the neighborhoods on which they are trained. Our results can be used to test non-linear methods, and to drive the design of more accurate maps for word translation.
翻译:我们调查了通过机器翻译方法学习的地图的行为。 地图通过在不同语言的字嵌入空间之间投影来翻译文字。 我们用线性地图在本地比较这些地图, 发现这些地图在嵌入空间的字上各不相同。 这说明基础地图是非线性地图。 重要的是, 我们显示本地线性地图的大小与所培训的邻里之间的距离密切相关。 我们的结果可以用来测试非线性方法, 并驱动更准确的文字翻译地图的设计。