We study the problem of adaptively identifying patient subpopulations that benefit from a given treatment during a confirmatory clinical trial. This type of adaptive clinical trial, often referred to as adaptive enrichment design, has been thoroughly studied in biostatistics with a focus on a limited number of subgroups (typically two) which make up (sub)populations, and a small number of interim analysis points. In this paper, we aim to relax classical restrictions on such designs and investigate how to incorporate ideas from the recent machine learning literature on adaptive and online experimentation to make trials more flexible and efficient. We find that the unique characteristics of the subpopulation selection problem -- most importantly that (i) one is usually interested in finding subpopulations with any treatment benefit (and not necessarily the single subgroup with largest effect) given a limited budget and that (ii) effectiveness only has to be demonstrated across the subpopulation on average -- give rise to interesting challenges and new desiderata when designing algorithmic solutions. Building on these findings, we propose AdaGGI and AdaGCPI, two meta-algorithms for subpopulation construction, which focus on identifying good subgroups and good composite subpopulations, respectively. We empirically investigate their performance across a range of simulation scenarios and derive insights into their (dis)advantages across different settings.
翻译:我们研究了在确认临床试验期间从特定治疗中受益的病人亚群的适应性识别问题。这种适应性临床试验,通常称为适应性浓缩设计,已经在生物统计学中进行了彻底研究,重点是构成(子)人口数目有限的少数分组(通常为2个分组),以及少量临时分析点。在本文件中,我们力求放松对此类设计的传统限制,并调查如何纳入最近关于适应性和在线实验的机器学习文献中的想法,以使试验更加灵活和高效。我们发现,亚人口选择问题的独特性 -- -- 最重要的是,(一) 通常有兴趣找到具有任何治疗益处的亚群(不一定是效果最大的单一分组),预算有限,而且(二) 效力仅能平均地显示在整个亚群群群中的有效性 -- -- 产生有趣的挑战,在设计算法解决办法时又出现新的偏斜。我们根据这些调查结果,建议AdaGGI和AdaGCPI, 两种亚人口结构构造的元性分类,其重点是确定良好的分组和良好的复合性预测,分别针对不同背景的模拟和综合性。