In this paper, we study a sequential decision-making problem, called Adaptive Sampling for Discovery (ASD). Starting with a large unlabeled dataset, algorithms for ASD adaptively label the points with the goal to maximize the sum of responses. This problem has wide applications to real-world discovery problems, for example drug discovery with the help of machine learning models. ASD algorithms face the well-known exploration-exploitation dilemma. The algorithm needs to choose points that yield information to improve model estimates but it also needs to exploit the model. We rigorously formulate the problem and propose a general information-directed sampling (IDS) algorithm. We provide theoretical guarantees for the performance of IDS in linear, graph and low-rank models. The benefits of IDS are shown in both simulation experiments and real-data experiments for discovering chemical reaction conditions.
翻译:在本文中,我们研究一个顺序决策问题,称为“探索的适应性抽样(ASD) ” 。从大型无标签数据集开始,ASD的算法适应性地标出各个点,目的是最大限度地增加反应的总和。这个问题在现实世界发现问题中广泛应用,例如在机器学习模型的帮助下发现药物。ASD的算法面临着众所周知的探索-开发困境。算法需要选择能够产生信息的点来改进模型估计,但也需要利用模型。我们严格地拟订问题,并提出一般的信息导向抽样算法(IDS) 。我们为IDS在线性、图形和低级别模型中的性能提供了理论保证。IDS的好处在模拟实验和发现化学反应条件的实际数据实验中都显示出来。