While deep learning (DL) is data-hungry and usually relies on extensive labeled data to deliver good performance, Active Learning (AL) reduces labeling costs by selecting a small proportion of samples from unlabeled data for labeling and training. Therefore, Deep Active Learning (DAL) has risen as a feasible solution for maximizing model performance under a limited labeling cost/budget in recent years. Although abundant methods of DAL have been developed and various literature reviews conducted, the performance evaluation of DAL methods under fair comparison settings is not yet available. Our work intends to fill this gap. In this work, We construct a DAL toolkit, DeepAL+, by re-implementing 19 highly-cited DAL methods. We survey and categorize DAL-related works and construct comparative experiments across frequently used datasets and DAL algorithms. Additionally, we explore some factors (e.g., batch size, number of epochs in the training process) that influence the efficacy of DAL, which provides better references for researchers to design their DAL experiments or carry out DAL-related applications.
翻译:虽然深层次学习(DL)是缺乏数据的,通常依靠大量标签数据来提供良好的业绩,但积极学习(AL)通过从未贴标签的数据中选择一小部分样本来进行标签和培训,降低了标签成本,因此,深层次积极学习(DAL)近年来在有限的标签成本/预算下,作为最大限度地发挥模型性能的一个可行解决办法,虽然开发了多种DAL方法并进行了各种文献审查,但在公平比较情况下,尚不具备对DAL方法的绩效评估。我们的工作打算填补这一空白。在这项工作中,我们通过重新采用19种高层次的DAL方法,建立了一个DAL工具包,DeepAL+。我们调查和分类了与DAL相关的工作,并在经常使用的数据集和DAL算法中建立了比较性实验。此外,我们探索了影响DAL效率的一些因素(例如批量尺寸、培训过程中的Peochs数目),这些因素为研究人员设计其DAL实验或开展DAL相关应用提供了更好的参考。