精确的顺序单臂试验设计与二元终点的提前终止 (Exact sequential single-arm trial design with curtailment for binary endpoint)

Due to ethical and economical reasons, sequential single-arm trial designs are used for assessing the therapeutic efficacy of new treatments in phase II trials. Simon's 2-stage design and Lan-DeMets' $\alpha$-spending function method with O'Brien-Fleming type are widely recognized as the traditional methods for futility stopping and efficacy stopping, respectively. These methods have two practical problems, which are the difficulty of interpretation for stopping under staggered entry and the inflation of error rate due to a small-sample trial. In this research, we propose the exact sequential design making the threshold value for efficacy fixed, and compare with traditional designs in sample size. Since the maximum sample size and average sample number of the proposed design are generally smaller than those of traditional designs containing fixed design, the proposed design is expected to be enrolled fewer subjects. In addition, we evaluate several kinds of point estimators and confidence intervals at the end of trials in the proposed design. If one is concerned with bias, the bias-adjusted estimator may be better. As for a confidence interval, the mid-p approach will be a good choice.

翻译：由于伦理和经济原因，顺序单臂试验设计被用于评估新治疗方法在二期试验中的疗效。西蒙二阶段设计和Lan-DeMets 的α-消费功能方法（带有O'Brien-Fleming 类型）被广泛认为是传统方法，可以分别用于无用性停止和疗效停止。这些方法有两个实际问题，一个是在分阶段进入条件下进行停止判定的解释困难，还有一种是在小样本试验中存在误差率膨胀的问题。在这项研究中，我们提出了一种精确的依据设定疗效阈值的顺序设计，并在样本大小方面与传统设计进行了比较。由于所提出的设计的最大样本量和平均样本数通常小于那些包含固定设计的传统设计，因此预计所提出的设计将招募较少的受试者。此外，在所提出的设计结束时，我们评估了几种点估计量和置信区间。如果关注偏差，那么偏差调整估计器可能更好。至于置信区间，mid-p 方法将是一个好选择。