We describe EmoBank, a corpus of 10k English sentences balancing multiple genres, which we annotated with dimensional emotion metadata in the Valence-Arousal-Dominance (VAD) representation format. EmoBank excels with a bi-perspectival and bi-representational design. On the one hand, we distinguish between writer's and reader's emotions, on the other hand, a subset of the corpus complements dimensional VAD annotations with categorical ones based on Basic Emotions. We find evidence for the supremacy of the reader's perspective in terms of IAA and rating intensity, and achieve close-to-human performance when mapping between dimensional and categorical formats.
翻译:我们描述EmoBank, 共有10k个英文句子, 平衡多种类型。 我们用Valence-Arousal-Dominance(VAD)代表格式的维维度情感元数据附加了这些句子。 EmoBank 具有双倍分数和双代表式设计优异。 一方面, 我们区分作家和读者的情绪, 另一方面, 该文集的一组内容补充了维维维的VAD说明, 以基本情感为基础。 我们找到证据表明读者的观点在宇航科学院和评级强度方面是至高无上的, 并在描述维度和直截面格式时达到接近人类的性能。