Testing whether a variable of interest affects the outcome is one of the most fundamental problems in statistics. To tackle this problem, the conditional randomization test (CRT) is a design-based method that is widely used to test the independence of a variable of interest (X) with an outcome (Y) holding some controls (Z) fixed. The CRT relies solely on the random iid sampling of (X,Z) to produce exact finite-sample p-values that are constructed using any test statistic. We propose a new method, the adaptive randomization test (AdapRT), that similarly tackles the independence problem but allows the data to be sequentially sampled. Like the CRT, the AdapRT relies solely on knowing the (adaptive) sampling distribution of (X,Z). In this paper, we additionally show the significant power increase by adaptively sampling in two illustrative settings. We first showcase the AdapRT in a particular multi-arm bandit problem known as the normal-mean model. Under this setting, we theoretically characterize the powers of both the iid sampling scheme and the AdapRT and empirically find that the AdapRT can uniformly outperform the typical uniform iid sampling scheme that pulls all arms with equal probability. We also surprisingly find that the AdapRT can be more powerful than even the oracle iid sampling scheme when the signal is relatively strong. We believe that the proposed adaptive procedure is successful mainly because it stabilizes arms that may initially look like "fake" signal. We additionally showcase the AdapRT to a popular factorial survey design setting known as conjoint analysis and find similar results through both simulations and application. Lastly, we also provide a power analysis pipeline for practitioners to diagnosis the effectiveness of their proposed adaptive procedures and apply the pipeline to the two aforementioned settings.
翻译:为解决这一问题,有条件随机测试(CRT)是一种基于设计的方法,它被广泛用于测试一个利益变量(X)的独立性(X),结果(Y)保持某种控制(Z)固定。CRT完全依靠随机的iid抽样(X,Z)来生成精确的定点抽样(Sample p-value),这是使用任何测试统计来构建的。我们提出了一种新的方法,即适应性随机测试(AdapRT),它同样解决独立问题,但允许对数据进行连续抽查。与CRT一样,AdapRT完全依靠了解一个利益变量(X,Z)的独立性,结果(Y)保持某种控制(Z)固定。在本文中,我们进一步展示了在两个说明性环境中通过随机随机抽样(X,Z)来生成精确的能量。我们首先将AdapRT展示给一个特殊的多波段问题,即通常的模型。在这个背景下,我们从理论上界定了iid取样方案和AdapRT的信号的两种能力,并且从实验性角度发现,AdapRT的精确性分析,因为我们所认识的精确的精确性分析方法能够将所有的精确地显示,我们所认识的精确的概率都能够使Adravilismid 。