In the field of multimodal sentiment analysis (MSA), a few studies have leveraged the inherent modality correlation information stored in samples for self-supervised learning. However, they feed the training pairs in a random order without consideration of difficulty. Without human annotation, the generated training pairs of self-supervised learning often contain noise. If noisy or hard pairs are used for training at the easy stage, the model might be stuck in bad local optimum. In this paper, we inject curriculum learning into weakly supervised modality correlation learning. The weakly supervised correlation learning leverages the label information to generate scores for negative pairs to learn a more discriminative embedding space, where negative pairs are defined as two unimodal embeddings from different samples. To assist the correlation learning, we feed the training pairs to the model according to difficulty by the proposed curriculum learning, which consists of elaborately designed scoring and feeding functions. The scoring function computes the difficulty of pairs using pre-trained and current correlation predictors, where the pairs with large losses are defined as hard pairs. Notably, the hardest pairs are discarded in our algorithm, which are assumed as noisy pairs. Moreover, the feeding function takes the difference of correlation losses as feedback to determine the feeding actions (`stay', `step back', or `step forward'). The proposed method reaches state-of-the-art performance on MSA.
翻译:在多式联运情绪分析(MSA)领域,一些研究利用了在自我监督学习的样本中储存的内在模式相关性信息;然而,它们以随机顺序向培训对对方提供不考虑困难的学习;没有人注解,产生的自监督学习培训对方往往含有噪音;如果在容易的阶段将吵闹或硬对方用于培训,模型可能陷于当地最差的状态;在本文件中,我们将课程学习引入监管不力的模式相关性学习;监管不力的关联学习利用标签信息为负对方生成分数,以学习一个更加歧视性的嵌入空间,将负对方定义为来自不同样本的两种单式嵌入空间;为协助相关学习,我们根据拟议课程学习的难度,将培训对对对方纳入模式,其中包括精心设计的评分和喂养功能;评分功能通过预先培训和当前相关性预测器计算对对方的困难,将损失大的对方确定为硬对方。 值得注意的是,最硬的对方在算法中被丢弃了“最硬的对方”,“最硬的对方”的对方被定义为“从不同的算算法中选择,“制为“反馈”后方,从而决定了“制为“制式”的后向前向前方的制式的状态”的制。