Constructing an optimal adaptive treatment strategy becomes complex when there are a large number of potential tailoring variables. In such scenarios, many of these extraneous variables may contribute little or no benefit to an adaptive strategy while increasing implementation costs and putting an undue burden on patients. Although existing methods allow selection of the informative prognostic factors, statistical inference is complicated by the data-driven selection process. To remedy this deficiency, we adapt the Universal Post-Selection Inference procedure to the semiparametric Robust Q-learning method and the unique challenges encountered in such multistage decision methods. In the process, we also identify a uniform improvement to confidence intervals constructed in this post-selection inference framework. Under certain rate assumptions, we provide theoretical results that demonstrate the validity of confidence regions and tests constructed from our proposed procedure. The performance of our method is compared to the Selective Inference framework through simulation studies, demonstrating the strengths of our procedure and its applicability to multiple selection mechanisms.
翻译:当存在大量潜在的定制变量时,建立最佳适应性治疗战略就变得复杂。在这种情况下,许多这些外在变量可能很少或根本没有对适应性战略带来好处,同时增加执行成本和给病人带来过重的负担。虽然现有方法允许选择信息性预测因素,但数据驱动选择过程使统计推论复杂化。为了弥补这一缺陷,我们调整通用选择后推论程序,使之适应半参数性硬质复习方法以及在这种多阶段决策方法中遇到的独特挑战。在这个过程中,我们还确定了在选择后推论框架中构建的信任间隔的统一改进办法。在某些费率假设下,我们提供了理论结果,表明信任区的有效性和根据我们拟议程序构建的测试。我们方法的绩效通过模拟研究与选择性推论框架进行比较,展示我们程序的优点及其对多种选择机制的适用性。