Pre-trained language models have achieved noticeable performance on the intent detection task. However, due to assigning an identical weight to each sample, they suffer from the overfitting of simple samples and the failure to learn complex samples well. To handle this problem, we propose a density-based dynamic curriculum learning model. Our model defines the sample's difficulty level according to their eigenvectors' density. In this way, we exploit the overall distribution of all samples' eigenvectors simultaneously. Then we apply a dynamic curriculum learning strategy, which pays distinct attention to samples of various difficulty levels and alters the proportion of samples during the training process. Through the above operation, simple samples are well-trained, and complex samples are enhanced. Experiments on three open datasets verify that the proposed density-based algorithm can distinguish simple and complex samples significantly. Besides, our model obtains obvious improvement over the strong baselines.
翻译:预先培训的语言模型在意图检测任务上取得了显著的成绩,然而,由于给每个样本都赋予了相同的重量,它们因简单样本的过重和未能很好地学习复杂的样本而受到损害。为了解决这个问题,我们建议了一个基于密度的动态课程学习模型。我们的模型根据样本的精密密度来界定样本的难度水平。这样,我们同时利用所有样本的精液的分布。然后,我们采用动态课程学习战略,对不同难度的样本给予不同的关注,并在培训过程中改变样本的比例。通过上述操作,简单样本经过良好培训,复杂的样本得到加强。对三个开放数据集的实验证实,拟议的基于密度的算法可以显著地区分简单和复杂的样本。此外,我们的模型在强大的基线上取得了明显的改进。