Ensemble-based debiasing methods have been shown effective in mitigating the reliance of classifiers on specific dataset bias, by exploiting the output of a bias-only model to adjust the learning target. In this paper, we focus on the bias-only model in these ensemble-based methods, which plays an important role but has not gained much attention in the existing literature. Theoretically, we prove that the debiasing performance can be damaged by inaccurate uncertainty estimations of the bias-only model. Empirically, we show that existing bias-only models fall short in producing accurate uncertainty estimations. Motivated by these findings, we propose to conduct calibration on the bias-only model, thus achieving a three-stage ensemble-based debiasing framework, including bias modeling, model calibrating, and debiasing. Experimental results on NLI and fact verification tasks show that our proposed three-stage debiasing framework consistently outperforms the traditional two-stage one in out-of-distribution accuracy.
翻译:在本文中,我们侧重于这些基于共性方法中唯一偏差的模式,这些模式起着重要作用,但在现有文献中没有引起多大注意。理论上,我们证明,对偏差模式的不准确的不确定性估计可能损害偏差性业绩。我们经常地表明,现有的偏差模式在得出准确的不确定性估计方面做得不够。我们受这些发现的影响,建议对偏差模式进行校准,从而实现基于三个阶段的共论偏差框架,包括偏差模型、模型校准和偏差。关于国家学习和事实核查的实验结果显示,我们提议的三阶段偏差框架始终超越传统的分配外精确性两阶段框架。