Portraying emotion and trustworthiness is known to increase the appeal of video content. However, the causal relationship between these signals and online user engagement is not well understood. This limited understanding is partly due to a scarcity in emotionally annotated data and the varied modalities which express user engagement online. In this contribution, we utilise a large dataset of YouTube review videos which includes ca. 600 hours of dimensional arousal, valence and trustworthiness annotations. We investigate features extracted from these signals against various user engagement indicators including views, like/dislike ratio, as well as the sentiment of comments. In doing so, we identify the positive and negative influences which single features have, as well as interpretable patterns in each dimension which relate to user engagement. Our results demonstrate that smaller boundary ranges and fluctuations for arousal lead to an increase in user engagement. Furthermore, the extracted time-series features reveal significant (p<0.05) correlations for each dimension, such as, count below signal mean (arousal), number of peaks (valence), and absolute energy (trustworthiness). From this, an effective combination of features is outlined for approaches aiming to automatically predict several user engagement indicators. In a user engagement prediction paradigm we compare all features against semi-automatic (cross-task), and automatic (task-specific) feature selection methods. These selected feature sets appear to outperform the usage of all features, e.g., using all features achieves 1.55 likes per day (Lp/d) mean absolute error from valence; this improves through semi-automatic and automatic selection to 1.33 and 1.23 Lp/d, respectively (data mean 9.72 Lp/d with a std. 28.75 Lp/d).
翻译:已知的触摸情感和信任度可以增加视频内容的吸引力。然而,这些信号和在线用户参与之间的因果关系并没有很好地理解。这种有限的理解部分是由于情感上附加说明的数据缺乏,以及表达用户在线参与的不同模式。在这个贡献中,我们使用一个大型的YouTube审查视频数据集,其中包括600小时的维度振动、价值和信任性说明。我们根据各种用户参与指标,包括观点、类似/不同比率以及评论的绝对感,对从这些信号中提取的特征进行了调查。我们这样做,我们确定了单一特征具有的正负影响,以及每个与用户参与有关的层面的可解释模式。我们的结果表明,振动的边界范围和波动导致用户参与的增加。此外,提取的时间序列特征显示了每个层面的重大关联性(p<0.05),例如,低于信号平均值、峰值(valence)数量(valent)/绝对能量(信任性)。从此过程中,我们有效地结合了旨在自动预测若干用户参与程度的方法,例如:9.preal-creal-creport speal exisional exisional exignal exisional as as asion settain settain settain settild.