Telling stories is an integral part of human communication which can evoke emotions and influence the affective states of the audience. Automatically modelling emotional trajectories in stories has thus attracted considerable scholarly interest. However, as most existing works have been limited to unsupervised dictionary-based approaches, there is no labelled benchmark for this task. We address this gap by introducing continuous valence and arousal annotations for an existing dataset of children's stories annotated with discrete emotion categories. We collect additional annotations for this data and map the originally categorical labels to the valence and arousal space. Leveraging recent advances in Natural Language Processing, we propose a set of novel Transformer-based methods for predicting valence and arousal signals over the course of written stories. We explore several strategies for fine-tuning a pretrained ELECTRA model and study the benefits of considering a sentence's context when inferring its emotionality. Moreover, we experiment with additional LSTM and Transformer layers. The best configuration achieves a Concordance Correlation Coefficient (CCC) of .7338 for valence and .6302 for arousal on the test set, demonstrating the suitability of our proposed approach. Our code and additional annotations are made available at https://github.com/lc0197/emotion_modelling_stories.
翻译:讲述故事是人类交流的一个组成部分,可以唤起情感,影响观众的情感状态。自动模拟故事中的情感轨迹,因此吸引了相当的学术兴趣。然而,由于大多数现有作品都局限于不受监督的字典方法,因此没有关于这项任务的标记基准。我们通过为儿童故事的现有数据集引入持续的价值和令人振奋的注释来弥补这一差距,并配有离散的情感类别。我们为这些数据收集更多的说明,并绘制原始的直线标签以显示价值和振奋空间。利用自然语言处理的最新进展,我们提出了一套新型的基于变异器的方法,用于预测价值和在书面故事过程中的令人振奋的信号。我们探索了几项战略,以微调一个未经培训的 ELECTRA 模型,并研究在判断其情感特性时考虑判决背景的好处。此外,我们实验了更多的LSTM和变异体层次。最佳配置实现了调调调调调调调调调(CCC) 7338 以预测价值和振奋性信号,我们在书面故事过程中提出了一套测试规则。02 我们的调调调调校准和图示。