$P$-values that are derived from continuously distributed test statistics are typically uniformly distributed on $(0,1)$ under least favorable parameter configurations (LFCs) in the null hypothesis. Conservativeness of a $p$-value $P$ (meaning that $P$ is under the null hypothesis stochastically larger than a random variable which is uniformly distributed on $(0,1)$) can occur if the test statistic from which $P$ is derived is discrete, or if the true parameter value under the null is not an LFC. To deal with both of these sources of conservativeness, we present two approaches utilizing randomized $p$-values, namely single-stage and two-stage randomization. We illustrate their effectiveness for testing a composite null hypothesis under a binomial model. We also give an example of how the proposed $p$-values can be used to test a composite null in group testing designs. Similar to previous findings, we find that the proposed randomized $p$-values are less conservative compared to non-randomized $p$-values under the null hypothesis, but that they are stochastically not smaller under the alternative. The problem of establishing the validity of randomized $p$-values is not trivial and has received attention in previous literature. We show that our proposed randomized $p$-values are valid under various discrete statistical models which are such that the distribution of the corresponding test statistic belongs to an exponential family. The behaviour of the power function for the tests based on the proposed randomized $p$-values as a function of the sample size is also investigated. Simulations and a real data analysis are used to compare the different considered $p$-values.
翻译:从连续分发的测试统计数据中得出的美元价值,通常均以美元(0,1美元)在无效假设中最不有利的参数配置(LFCs)下以美元(0,1美元)统一分配。美元价值(P$)的保守性比以美元(0,1美元)统一分配的随机变量大得多(指美元值比以美元(0,1美元)统一分配的随机变量大得多),如果得出美元值的测试统计数据是离散的,或者如果无效数据下的真实参数值不是LFC。为了处理这两种保守性来源,我们提出了两种方法,即使用随机化美元(美元)的汇率值(LFCs),即单阶段和两阶段随机化的美元值。我们展示了在二进制模型下测试一个复合的无效假设值(美元)是否有效。我们还举了一个例子,说明在组测试设计中如何使用美元价值来测试一个复合的变量。与先前的随机值值相比,我们认为拟议随机值的美元值值与非调整的美元值值是保守的。在无效假设下,但根据无效的Simpal假设,我们提出的统计值的数值的数值是用来评估的。我们之前的随机值的数值是用来判断的。我们之前的随机值的数值的。我们提出的一种随机值的数值的数值,而使用的。我们所要显示的数值是用来显示的数值。</s>