The problem of how to genetically modify cells in order to maximize a certain cellular phenotype has taken center stage in drug development over the last few years (with, for example, genetically edited CAR-T, CAR-NK, and CAR-NKT cells entering cancer clinical trials). Exhausting the search space for all possible genetic edits (perturbations) or combinations thereof is infeasible due to cost and experimental limitations. This work provides a theoretically sound framework for iteratively exploring the space of perturbations in pooled batches in order to maximize a target phenotype under an experimental budget. Inspired by this application domain, we study the problem of batch query bandit optimization and introduce the Optimistic Arm Elimination ($\mathrm{OAE}$) principle designed to find an almost optimal arm under different functional relationships between the queries (arms) and the outputs (rewards). We analyze the convergence properties of $\mathrm{OAE}$ by relating it to the Eluder dimension of the algorithm's function class and validate that $\mathrm{OAE}$ outperforms other strategies in finding optimal actions in experiments on simulated problems, public datasets well-studied in bandit contexts, and in genetic perturbation datasets when the regression model is a deep neural network. OAE also outperforms the benchmark algorithms in 3 of 4 datasets in the GeneDisco experimental planning challenge.
翻译:如何基因修改细胞以便最大限度地增加某种细胞苯菌类型的问题在过去几年药物开发中占据了中心位置(例如,遗传编辑的CAR-T、CAR-NK、CAR-NK和CAR-NKT细胞进入癌症临床试验);由于成本和实验限制,无法利用所有可能的基因编辑(干涉)或其组合的搜索空间。这项工作为迭接探索集合批次中扰动空间提供了一个理论上健全的框架,以便在实验预算下最大限度地增加目标型苯菌类。受这个应用域的启发,我们研究批次查询波状优化问题,并引入最佳 Arm消除(mathrm{OAE}$)原则,目的是在查询(武器)和产出(上层)之间的不同功能关系下找到几乎最佳的臂。我们分析了美元模型{OA}的趋同特性,将它与算法类的 Eluder 维度维度联系起来,并验证 $mathrem{OAAE$在深度的磁度实验中,在模拟的磁带基数据系统中,在模拟的实验中,在模拟模型中,在模拟基底基数中,数据系统中的模型分析其他数据系统中的模型中,也在其他数据系统中的模型中发现其他数据系统。