People associate affective meanings to words - "death" is scary and sad while "party" is connotated with surprise and joy. This raises the question if the association is purely a product of the learned affective imports inherent to semantic meanings, or is also an effect of other features of words, e.g., morphological and phonological patterns. We approach this question with an annotation-based analysis leveraging nonsense words. Specifically, we conduct a best-worst scaling crowdsourcing study in which participants assign intensity scores for joy, sadness, anger, disgust, fear, and surprise to 272 non-sense words and, for comparison of the results to previous work, to 68 real words. Based on this resource, we develop character-level and phonology-based intensity regressors. We evaluate them on both nonsense words and real words (making use of the NRC emotion intensity lexicon of 7493 words), across six emotion categories. The analysis of our data reveals that some phonetic patterns show clear differences between emotion intensities. For instance, s as a first phoneme contributes to joy, sh to surprise, p as last phoneme more to disgust than to anger and fear. In the modelling experiments, a regressor trained on real words from the NRC emotion intensity lexicon shows a higher performance (r = 0.17) than regressors that aim at learning the emotion connotation purely from nonsense words. We conclude that humans do associate affective meaning to words based on surface patterns, but also based on similarities to existing words ("juy" to "joy", or "flike" to "like").
翻译:将“ 死亡” 和“ 死亡” 等字联系起来, 让人感到恐惧和悲伤, 而“ 党” 则令人惊讶和喜悦。 这就提出了这样一个问题: 协会是否纯粹是语义含义所固有的、 或同时也是其他词特征的影响, 例如形态学和声调模式。 我们用一个以注解为基础的分析, 利用无稽的词来处理这个问题。 具体地说, 我们用一个以注解为基础的分析, 来处理这个问题。 我们用六种情感类别来进行一个最差劲的扩大众包包罗式研究, 参与者在欢乐、 悲伤、 愤怒、 厌恶、 恐惧和惊喜中指定强度的强度分数。 例如, 272个非感知性词汇, 比较结果与先前工作相比, 和68个真实的词汇。 基于此资源, 我们开发了个性层次和声调的调级和声调的强度反向。 我们用一个无稽的词来评价它们( 使用NRC 情绪强度 ) 和真实的语系, 结束一个真实的 。