We provide a stochastic strategy for adapting well-known acquisition functions to allow batch active learning. In deep active learning, labels are often acquired in batches for efficiency. However, many acquisition functions are designed for single-sample acquisition and fail when naively used to construct batches. In contrast, state-of-the-art batch acquisition functions are costly to compute. We show how to extend single-sample acquisition functions to the batch setting. Instead of acquiring the top-K points from the pool set, we account for the fact that acquisition scores are expected to change as new points are acquired. This motivates simple stochastic acquisition strategies using score-based or rank-based distributions. Our strategies outperform the standard top-K acquisition with virtually no computational overhead and can be used as a drop-in replacement. In fact, they are even competitive with much more expensive methods despite their linear computational complexity. We conclude that there is no reason to use top-K batch acquisition in practice.
翻译:我们为调整众所周知的购置功能以允许批量积极学习提供了一种随机化战略,在深入积极学习中,标签往往以批量方式获得,以提高效率。然而,许多购置功能是为单一抽样获取而设计的,在天真地用来制造批量时失败。相比之下,最先进的批量获取功能的计算成本很高。我们展示了如何将单个抽样获取功能扩大到批量设置。我们没有从集合组中获取最高K点,而是考虑到随着新点的获取,预期获取分将发生变化。这促使采用基于分数或按级分配的简单随机收购策略。我们的策略超过了标准的顶级采购,几乎没有计算间接费用,可以用作空置替换。事实上,尽管其线性计算复杂,它们甚至具有更昂贵的竞争力。我们的结论是,在实践中没有理由使用最高K批量获取。