Label smoothing is a regularization technique widely used in supervised learning to improve the generalization of models on various tasks, such as image classification and machine translation. However, the effectiveness of label smoothing in multi-hop question answering (MHQA) has yet to be well studied. In this paper, we systematically analyze the role of label smoothing on various modules of MHQA and propose F1 smoothing, a novel label smoothing technique specifically designed for machine reading comprehension (MRC) tasks. We evaluate our method on the HotpotQA dataset and demonstrate its superiority over several strong baselines, including models that utilize complex attention mechanisms. Our results suggest that label smoothing can be effective in MHQA, but the choice of smoothing strategy can significantly affect performance.
翻译:标签滑动是一种正规化技术,在监督学习中广泛使用,目的是改进图像分类和机器翻译等各种任务模型的概括化,然而,多点答题(MHQA)中标签滑动的有效性还有待研究,在本文件中,我们系统地分析标签滑动作用在MHQA各模块中的作用,并提议F1滑动,这是一种新颖的标签滑动技术,专门为机器阅读任务设计。我们评估了我们在热点卡数据集上的方法,并展示了它优于若干强力基线,包括使用复杂关注机制的模型。我们的结果表明,标签滑动在MHQA中可能有效,但选择滑动战略会大大影响业绩。