Two-sample tests evaluate whether two samples are realizations of the same distribution (the null hypothesis) or two different distributions (the alternative hypothesis). We consider a new setting for this problem where sample features are easily measured whereas sample labels are unknown and costly to obtain. Accordingly, we devise a three-stage framework in service of performing an effective two-sample test with only a small number of sample label queries: first, a classifier is trained with samples uniformly labeled to model the posterior probabilities of the labels; second, a novel query scheme dubbed \emph{bimodal query} is used to query labels of samples from both classes, and last, the classical Friedman-Rafsky (FR) two-sample test is performed on the queried samples. Theoretical analysis and extensive experiments performed on several datasets demonstrate that the proposed test controls the Type I error and has decreased Type II error relative to uniform querying and certainty-based querying. Source code for our algorithms and experimental results is available at \url{https://github.com/wayne0908/Label-Efficient-Two-Sample}.
翻译:两次抽样测试评估两个样本是同一分布(无效假设)的实现还是两种不同分布(替代假设)的实现。我们考虑为这一问题设置一个新的环境,即可以很容易地测量样品特征,而样品标签又不为人所知,而且获取成本很高。因此,我们设计了一个三阶段框架,用于进行有效的两层抽样测试,只有少量的抽样标签查询:首先,对一个分类员进行培训,样品统一标签标签,以模拟标签的后方概率;其次,使用一个称为emph{bimodal查询}的新奇的查询办法,以查询两个类别样本的标签,最后,对被查询的样品进行经典的Friedman-Rafsky(FR)两层抽样测试。在几个数据集上进行的理论分析和广泛实验表明,拟议的测试控制了第一类错误,减少了二类错误,与统一的查询和基于确定性的查询相比。我们的算法和实验结果的源代码可在以下网站查阅:<url{https://github.com/way09/ple0908/LAffeal-Ambel_Effalicial_E.