活动连续两个抽样测试 (Active Sequential Two-Sample Testing) - 专知论文

会员服务 ·

0

样本 · 标注 · 统计量 · 分类模型 · 似然 ·

2023 年 2 月 2 日

Active Sequential Two-Sample Testing

翻译：活动连续两个抽样测试

Weizhi Li,Karthikeyan Natesan Ramamurthy,Prad Kadambi,Pouria Saidi,Gautam Dasarathy,Visar Berisha

Two-sample testing tests whether the distributions generating two samples are identical. We pose the two-sample testing problem in a new scenario where the sample measurements (or sample features) are inexpensive to access, but their group memberships (or labels) are costly. We devise the first \emph{active sequential two-sample testing framework} that not only sequentially but also \emph{actively queries} sample labels to address the problem. Our test statistic is a likelihood ratio where one likelihood is found by maximization over all class priors, and the other is given by a classification model. The classification model is adaptively updated and then used to guide an active query scheme called bimodal query to label sample features in the regions with high dependency between the feature variables and the label variables. The theoretical contributions in the paper include proof that our framework produces an \emph{anytime-valid} $p$-value; and, under reachable conditions and a mild assumption, the framework asymptotically generates a minimum normalized log-likelihood ratio statistic that a passive query scheme can only achieve when the feature variable and the label variable have the highest dependence. Lastly, we provide a \emph{query-switching (QS)} algorithm to decide when to switch from passive query to active query and adapt bimodal query to increase the testing power of our test. Extensive experiments justify our theoretical contributions and the effectiveness of QS.

翻译：以两个模样测试产生两个样本的分布是否完全相同。我们在一个新的假设中提出两个模样测试问题, 在新的假设中, 样本测量( 或样本特征) 价格低廉, 但其组成员( 或标签) 费用高。我们设计了第一个 emph{ 活性顺序顺序双样样测试框架}, 不仅按顺序进行测试, 而且还按顺序进行 emph{ 活性询问} 样本标签, 以解决问题。我们的测试统计是一个可能性比, 一种可能性是通过对所有类前题的最大化发现, 而另一种可能性则由分类模型提供。分类模型是适应性更新的, 然后用来指导一个名为双调查询的动态查询方案, 在特性变量和标签变量变量变量变量之间高度依赖的区域, 本文的理论贡献包括证明我们的框架产生了一个 emph{ 时间- valid} $p- 价值 ; 在可实现的条件和温和假设下, 框架产生一个最小的标准化的逻辑比值比值比值比值比值比值比值比值比值比值比值比值比值比值比值比值比值比值。当我们进行被动的试性查询方案在特性变数测试时, 我们的变数和变数调算算算算算算算算算算算算算法的系统只能的系统只能在的系统只能只能只能在最后才算算算算算算算算算算算算算算算算算算算得得得得得最高时, Q。

0

相关内容

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

基于蛋白组学技术筛选卵巢癌早期诊断生物学标记物的前瞻性研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

基于WorldView-3和OP-ELM的矿化蚀变提取方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

高超声速再入滑翔飞行器多约束全局滑模精确制导研究

国家自然科学基金

5+阅读 · 2013年12月31日

石墨烯-无机粒子协同增强PVDF电介质的构筑、电性能及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Dectin-1受体识别的酵母葡聚糖酶解片段的链结构及构效关系的研究

国家自然科学基金

0+阅读 · 2013年12月31日

免疫促凋亡分子用于治疗HBsAg阳性肝细胞癌的研究

国家自然科学基金

0+阅读 · 2012年12月31日

二元混合溶剂中聚异丙基丙烯酰胺与溶剂分子的相互作用

国家自然科学基金

0+阅读 · 2009年12月31日

SDF-1/CXCR4信号通路的干预及调节关节软骨退变的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Detecting Interference in A/B Testing with Increasing Allocation

Arxiv

0+阅读 · 2023年3月24日

Semantic Prompt for Few-Shot Image Recognition

Arxiv

0+阅读 · 2023年3月24日

The SPDE approach for spatio-temporal datasets with advection and diffusion

Arxiv

0+阅读 · 2023年3月24日

Bivariate Distribution Regression with Application to Insurance Data

Arxiv

0+阅读 · 2023年3月24日

Towards Dynamic Causal Discovery with Rare Events: A Nonparametric Conditional Independence Test

Arxiv

0+阅读 · 2023年3月24日

Enhancement of theColor Image Compression Using a New Algorithm based on Discrete Hermite Wavelet Transform

Arxiv

0+阅读 · 2023年3月23日

Learning a Practical SDR-to-HDRTV Up-conversion using New Dataset and Degradation Models

Arxiv

0+阅读 · 2023年3月23日

Evaluating the Role of Target Arguments in Rumour Stance Classification

Arxiv

0+阅读 · 2023年3月22日

Curvature-Balanced Feature Manifold Learning for Long-Tailed Classification

Arxiv

0+阅读 · 2023年3月22日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

赋能真实世界：基于大语言模型的产业智能体技术、实践与评测综述

军事行动中人工智能系统目标交战的附带损伤评估模型 | 最新文献

【普林斯顿博士论文】面向人本机器人学的安全与学习博弈论融合

美陆军协会（AUSA）2025 年会公布的美国十大武器与防务产品创新

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Detecting Interference in A/B Testing with Increasing Allocation

Arxiv

0+阅读 · 2023年3月24日

Semantic Prompt for Few-Shot Image Recognition

Arxiv

0+阅读 · 2023年3月24日

The SPDE approach for spatio-temporal datasets with advection and diffusion

Arxiv

0+阅读 · 2023年3月24日

Bivariate Distribution Regression with Application to Insurance Data

Arxiv

0+阅读 · 2023年3月24日

Towards Dynamic Causal Discovery with Rare Events: A Nonparametric Conditional Independence Test

Arxiv

0+阅读 · 2023年3月24日

Enhancement of theColor Image Compression Using a New Algorithm based on Discrete Hermite Wavelet Transform

Arxiv

0+阅读 · 2023年3月23日

Learning a Practical SDR-to-HDRTV Up-conversion using New Dataset and Degradation Models

Arxiv

0+阅读 · 2023年3月23日

Evaluating the Role of Target Arguments in Rumour Stance Classification

Arxiv

0+阅读 · 2023年3月22日

Curvature-Balanced Feature Manifold Learning for Long-Tailed Classification

Arxiv

0+阅读 · 2023年3月22日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

基于蛋白组学技术筛选卵巢癌早期诊断生物学标记物的前瞻性研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

基于WorldView-3和OP-ELM的矿化蚀变提取方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

高超声速再入滑翔飞行器多约束全局滑模精确制导研究

国家自然科学基金

5+阅读 · 2013年12月31日

石墨烯-无机粒子协同增强PVDF电介质的构筑、电性能及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Dectin-1受体识别的酵母葡聚糖酶解片段的链结构及构效关系的研究

国家自然科学基金

0+阅读 · 2013年12月31日

免疫促凋亡分子用于治疗HBsAg阳性肝细胞癌的研究

国家自然科学基金

0+阅读 · 2012年12月31日

二元混合溶剂中聚异丙基丙烯酰胺与溶剂分子的相互作用

国家自然科学基金

0+阅读 · 2009年12月31日

SDF-1/CXCR4信号通路的干预及调节关节软骨退变的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员