Antibodies are canonically Y-shaped multimeric proteins capable of highly specific molecular recognition. The CDRH3 region located at the tip of variable chains of an antibody dominates antigen-binding specificity. Therefore, it is a priority to design optimal antigen-specific CDRH3 regions to develop therapeutic antibodies to combat harmful pathogens. However, the combinatorial nature of CDRH3 sequence space makes it impossible to search for an optimal binding sequence exhaustively and efficiently, especially not experimentally. Here, we present AntBO: a Combinatorial Bayesian Optimisation framework enabling efficient in silico design of the CDRH3 region. Ideally, antibodies should bind to their target antigen and be free from any harmful outcomes. Therefore, we introduce the CDRH3 trust region that restricts the search to sequences with feasible developability scores. To benchmark AntBO, we use the Absolut! software suite as a black-box oracle because it can score the target specificity and affinity of designed antibodies in silico in an unconstrained fashion. The results across 188 antigens demonstrate the benefit of AntBO in designing CDRH3 regions with diverse biophysical properties. In under 200 protein designs, AntBO can suggest antibody sequences that outperform the best binding sequence drawn from 6.9 million experimentally obtained CDRH3s and a commonly used genetic algorithm baseline. Additionally, AntBO finds very-high affinity CDRH3 sequences in only 38 protein designs whilst requiring no domain knowledge. We conclude AntBO brings automated antibody design methods closer to what is practically viable for in vitro experimentation.
翻译:共RH3 区域位于抗体可变链端端端的CDRH3 区域,其抗体约束特性占主导地位。因此,优先设计最佳抗原特有的CDRH3 区域,开发治疗性抗体,以防治有害的病原体。然而,CDRH3 序列空间的组合性质使得无法详尽而有效地寻找一个最佳的、约束性序列,特别是实验性。在这里,我们介绍了安特博:一个能高效地设计CDRH3 区域硅设计效率的混合Bayesian优化框架。理想的是,抗体应粘合其目标抗原,不受任何有害结果的影响。因此,我们引入CDRH3 信任区域,将搜索限制于具有可行开发性分数的序列。我们用Absolout! 软件套件作为黑盒,因为它能分辨目标特性,而且能以不受约束的方式实现防腐蚀的防腐蚀性BO。在188 抗精准的防腐蚀性3 域域域中,要求更接近其目标抗菌性设计型BO 更接近的序列,在设计 CDB3 级的CDBR3 序列中,在设计中, CDBRBR3 最精确的BADRBR3 中只能定序中只能测测测到最精确的BR3 。