Deceptive news posts shared in online communities can be detected with NLP models, and much recent research has focused on the development of such models. In this work, we use characteristics of online communities and authors -- the context of how and where content is posted -- to explain the performance of a neural network deception detection model and identify sub-populations who are disproportionately affected by model accuracy or failure. We examine who is posting the content, and where the content is posted to. We find that while author characteristics are better predictors of deceptive content than community characteristics, both characteristics are strongly correlated with model performance. Traditional performance metrics such as F1 score may fail to capture poor model performance on isolated sub-populations such as specific authors, and as such, more nuanced evaluation of deception detection models is critical.
翻译:在线社区共享的欺骗性新闻文章可以通过NLP模式探测出来,最近许多研究都侧重于开发这些模式。在这项工作中,我们使用在线社区和作者的特点 -- -- 如何和在哪里张贴内容的背景 -- -- 来解释神经网络欺骗性检测模型的性能,并查明受模型准确性或失败影响过大的人口群体。我们检查谁在张贴内容,内容张贴到哪里。我们发现作者特征比社区特征更好地预测欺骗性内容,但两者都与模型性能密切相关。F1得分等传统性能衡量标准可能无法捕捉孤立的亚群体,如具体作者的不良模式性能,因此,对欺骗性检测模型进行更细致的评估至关重要。