Although deep neural networks have been widely employed and proven effective in sentiment analysis tasks, it remains challenging for model developers to assess their models for erroneous predictions that might exist prior to deployment. Once deployed, emergent errors can be hard to identify in prediction run-time and impossible to trace back to their sources. To address such gaps, in this paper we propose an error detection framework for sentiment analysis based on explainable features. We perform global-level feature validation with human-in-the-loop assessment, followed by an integration of global and local-level feature contribution analysis. Experimental results show that, given limited human-in-the-loop intervention, our method is able to identify erroneous model predictions on unseen data with high precision.
翻译:尽管在情感分析任务中,深层神经网络已被广泛使用,并证明是有效的,但模型开发者评估其可能部署前存在的错误预测模型仍然具有挑战性。一旦部署,突发错误在预测运行时可能很难确定,无法追溯到其来源。为了弥补这些差距,本文件建议根据可解释的特征,为情绪分析建立一个错误检测框架。我们通过对人与地分析进行全球层面的特征验证,然后将全球和地方层面的特征贡献分析结合起来。实验结果显示,由于对流动中的人的干预有限,我们的方法能够非常精确地识别对不可见数据的错误模型预测。