Affect modeling is viewed, traditionally, as the process of mapping measurable affect manifestations from multiple modalities of user input to affect labels. That mapping is usually inferred through end-to-end (manifestation-to-affect) machine learning processes. What if, instead, one trains general, subject-invariant representations that consider affect information and then uses such representations to model affect? In this paper we assume that affect labels form an integral part, and not just the training signal, of an affect representation and we explore how the recent paradigm of contrastive learning can be employed to discover general high-level affect-infused representations for the purpose of modeling affect. We introduce three different supervised contrastive learning approaches for training representations that consider affect information. In this initial study we test the proposed methods for arousal prediction in the RECOLA dataset based on user information from multiple modalities. Results demonstrate the representation capacity of contrastive learning and its efficiency in boosting the accuracy of affect models. Beyond their evidenced higher performance compared to end-to-end arousal classification, the resulting representations are general-purpose and subject-agnostic, as training is guided though general affect information available in any multimodal corpus.
翻译:传统上,影响建模被视为影响建模的过程,因为可计量的绘图过程会影响多种用户输入模式的表现形式,从而影响标签。这种绘图通常通过端到端(manifestand-tofect-Affect)的机器学习过程推断出来。如果用一个普通的、主题差异性表述方法来训练考虑影响信息然后使用这种表述方法来影响模型呢?在本文件中,我们假设影响标签是一个影响代表性的组成部分,而不仅仅是培训信号,我们探索如何利用最近的对比性学习模式来发现一般高层次影响应用的演示,以便影响建模。我们为考虑影响信息的培训展示采用了三种不同的监督对比性学习方法。在这项初步研究中,我们测试RECOLA数据集中基于多种模式的用户信息而提出的振动预测方法。结果显示对比性学习的体现能力及其提高影响模型准确性的效率。除了证明与端到端的分类相比,由此产生的表述是通用的和主题的,因为培训是指导任何模式中的一般信息。