Considering a conversation thread, stance classification aims to identify the opinion (e.g. agree or disagree) of replies towards a given target. The target of the stance is expected to be an essential component in this task, being one of the main factors that make it different from sentiment analysis. However, a recent study shows that a target-oblivious model outperforms target-aware models, suggesting that targets are not useful when predicting stance. This paper re-examines this phenomenon for rumour stance classification (RSC) on social media, where a target is a rumour story implied by the source tweet in the conversation. We propose adversarial attacks in the test data, aiming to assess the models robustness and evaluate the role of the data in the models performance. Results show that state-of-the-art models, including approaches that use the entire conversation thread, overly relying on superficial signals. Our hypothesis is that the naturally high occurrence of target-independent direct replies in RSC (e.g. "this is fake" or just "fake") results in the impressive performance of target-oblivious models, highlighting the risk of target instances being treated as noise during training.
翻译:在对话串中,立场分类旨在确定回复对给定目标的意见(例如同意或不同意)。立场的目标预计是此任务的一个关键组成部分,是使其不同于情感分析的主要因素之一。然而,最近的一项研究显示,一个忽略目标的模型优于目标感知模型,表明目标在预测立场时并不实用。本文重新审视了社交媒体上的谣言立场分类(RSC)这一现象,其中目标是由会话中源推文暗示的谣言故事。我们在测试数据中提出了对抗性攻击,旨在评估模型的鲁棒性并评估数据在模型性能中的作用。结果表明,包括使用整个对话串的方法在内的最先进的模型过度依赖表面信号。我们的假设是,在RSC中,与目标无关的直接回复(例如“这是假的”或只是“假的”)的自然高发生率导致了忽略目标的模型的出色性能,凸显了在训练过程中目标实例被视为噪声的风险。