Although emotions are universal concepts, transferring the different shades of emotion from one language to another may not always be straightforward for human translators, let alone for machine translation systems. Moreover, the cognitive states are established by verbal explanations of experience which is shaped by both the verbal and cultural contexts. There are a number of verbal contexts where expression of emotions constitutes the pivotal component of the message. This is particularly true for User-Generated Content (UGC) which can be in the form of a review of a product or a service, a tweet, or a social media post. Recently, it has become common practice for multilingual websites such as Twitter to provide an automatic translation of UGC to reach out to their linguistically diverse users. In such scenarios, the process of translating the user's emotion is entirely automatic with no human intervention, neither for post-editing nor for accuracy checking. In this research, we assess whether automatic translation tools can be a successful real-life utility in transferring emotion in user-generated multilingual data such as tweets. We show that there are linguistic phenomena specific of Twitter data that pose a challenge in translation of emotions in different languages. We summarise these challenges in a list of linguistic features and show how frequent these features are in different language pairs. We also assess the capacity of commonly used methods for evaluating the performance of an MT system with respect to the preservation of emotion in the source text.


翻译:虽然情绪是普遍性的概念,但将不同情绪的阴影从一种语言转移到另一种语言,对翻译者来说可能并不总是直截了当,更不要说机器翻译系统。此外,认知状态是由口头和文化背景所塑造的经验的口头解释而建立的。有一些口头背景,情感的表达构成信息的关键组成部分。对于用户感化内容(UGC)来说尤其如此,这种内容可以采取审查产品或服务、推文或社交媒体文章的形式。最近,Twitter等多语种网站的常见做法是提供UGC的自动翻译,以便接触其语言多样性的用户。在这种情况下,翻译用户情感的过程是完全自动的,没有人的干预,既不用于编辑后或准确性检查。在这项研究中,我们评估自动翻译工具能否在以用户感化多语种数据(如推特)传递情感方面成功地发挥真实的效用。我们显示,在翻译不同语言的情感翻译中,有特定的语言现象构成挑战。我们用语言特性来评估这些常态语言特征,并用这些常态语言特征来评估。

0
下载
关闭预览

相关内容

Twitter(推特)是一个社交网络及微博客服务的网站。它利用无线网络,有线网络,通信技术,进行即时通讯,是微博客的典型应用。
专知会员服务
40+阅读 · 2020年9月6日
Keras François Chollet 《Deep Learning with Python 》, 386页pdf
专知会员服务
154+阅读 · 2019年10月12日
机器学习入门的经验与建议
专知会员服务
94+阅读 · 2019年10月10日
最新BERT相关论文清单,BERT-related Papers
专知会员服务
53+阅读 · 2019年9月29日
已删除
将门创投
3+阅读 · 2019年1月15日
Arxiv
7+阅读 · 2018年1月30日
Arxiv
5+阅读 · 2018年1月30日
VIP会员
相关资讯
已删除
将门创投
3+阅读 · 2019年1月15日
Top
微信扫码咨询专知VIP会员