This work explores the effect of noisy sample selection in active learning strategies. We show on both synthetic problems and real-life use-cases that knowledge of the sample noise can significantly improve the performance of active learning strategies. Building on prior work, we propose a robust sampler, Incremental Weighted K-Means that brings significant improvement on the synthetic tasks but only a marginal uplift on real-life ones. We hope that the questions raised in this paper are of interest to the community and could open new paths for active learning research.
翻译:这项工作探索了在积极学习战略中进行吵闹抽样选择的影响。我们从合成问题和实际使用情况中发现,对抽样噪音的了解可以大大改善积极学习战略的绩效。我们在过去的工作的基础上,建议建立一个强大的取样器,即递增加权K-Means,使合成任务大有改进,但实际生活中的任务却只有微小提升。我们希望本文件提出的问题对社区有利,能够开辟积极学习研究的新途径。