Active learning is an important technology for automated machine learning systems. In contrast to Neural Architecture Search (NAS) which aims at automating neural network architecture design, active learning aims at automating training data selection. It is especially critical for training a long-tailed task, in which positive samples are sparsely distributed. Active learning alleviates the expensive data annotation issue through incrementally training models powered with efficient data selection. Instead of annotating all unlabeled samples, it iteratively selects and annotates the most valuable samples. Active learning has been popular in image classification, but has not been fully explored in object detection. Most of current approaches on object detection are evaluated with different settings, making it difficult to fairly compare their performance. To facilitate the research in this field, this paper contributes an active learning benchmark framework named as ALBench for evaluating active learning in object detection. Developed on an automatic deep model training system, this ALBench framework is easy-to-use, compatible with different active learning algorithms, and ensures the same training and testing protocols. We hope this automated benchmark system help researchers to easily reproduce literature's performance and have objective comparisons with prior arts. The code will be release through Github.
翻译:主动学习是自动机器学习系统的一个重要技术。 与旨在将神经网络结构结构设计自动化的神经结构搜索(NAS)相比,积极学习的目的是将培训数据选择自动化。 积极学习对于培训长尾任务尤为重要,因为积极抽样分布很少。 积极学习通过渐进式培训模型,通过高效数据选择来缓解昂贵的数据注释问题。 它不是对所有未贴标签的样本进行注解,而是迭接式选择,点注最有价值的样本。 积极学习在图像分类方面很受欢迎,但在物体探测方面没有得到充分探索。 目前对物体探测的方法大多是用不同设置来评估的,因此很难公平地比较其性能。 为了便利这一领域的研究,本文提供了名为ALBench的积极学习基准框架,用于评价在物体探测中的积极学习。 这个框架在自动深度培训系统上开发,很容易使用,与不同的积极学习算法相兼容,并确保同样的培训和测试协议。 我们希望这个自动基准系统有助于研究人员容易复制文献的性能,并且通过以前的艺术进行客观比较。