顺序内内心独立测试 (Sequential Kernelized Independence Testing) - 专知论文

会员服务 ·

0

相互独立的 · 核化 · 统计量 · 准则 · 情景 ·

2022 年 12 月 14 日

Sequential Kernelized Independence Testing

翻译：顺序内内心独立测试

Aleksandr Podkopaev,Patrick Blöbaum,Shiva Prasad Kasiviswanathan,Aaditya Ramdas

Independence testing is a fundamental and classical statistical problem that has been extensively studied in the batch setting when one fixes the sample size before collecting data. However, practitioners often prefer procedures that adapt to the complexity of a problem at hand instead of setting sample size in advance. Ideally, such procedures should (a) allow stopping earlier on easy tasks (and later on harder tasks), hence making better use of available resources, and (b) continuously monitor the data and efficiently incorporate statistical evidence after collecting new data, while controlling the false alarm rate. It is well known that classical batch tests are not tailored for streaming data settings, since valid inference after data peeking requires correcting for multiple testing, but such corrections generally result in low power. In this paper, we design sequential kernelized independence tests (SKITs) that overcome such shortcomings based on the principle of testing by betting. We exemplify our broad framework using bets inspired by kernelized dependence measures such as the Hilbert-Schmidt independence criterion (HSIC) and the constrained-covariance criterion (COCO). Importantly, we also generalize the framework to non-i.i.d. time-varying settings, for which there exist no batch tests. We demonstrate the power of our approaches on both simulated and real data.

翻译：独立测试是一个基本和典型的统计问题,在收集数据之前,当确定抽样规模时,在批量设置中对独立测试进行了广泛研究;然而,执业者往往倾向于采用适应所处理问题复杂性的程序,而不是事先确定抽样规模;理想的做法是,这种程序应当(a) 允许更早停止容易的任务(以及以后更难的任务),从而更好地利用现有资源,以及(b) 不断监测数据,并在收集新数据后有效纳入统计证据,同时控制虚假警报率;众所周知,典型的批量测试不是针对数据流设置的,因为数据偷看后的有效推断需要纠正多次测试,但这类纠正通常导致低功率;在本文件中,我们设计了基于测试原则克服这些缺陷的连续封闭式独立测试(SKITs),我们通过赌注来展示我们的广泛框架,我们利用了诸如Hilbert-Schmidt独立标准(HISIC)和制约性差度标准(CO)等内嵌式依赖性措施的启发,同时控制误差度标准;众所周知,因为数据浏览后的有效推断需要纠正多次测试,但通常导致低功率。

0

相关内容

相互独立的

相互独立的

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

采用pinball loss的MEE算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

ADAMTS-4/5和Aggrecan基因改造的间充质干细胞和软骨细胞在软骨组织工程上的应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多元异构信息融合的个性化推荐系统研究

国家自然科学基金

6+阅读 · 2012年12月31日

肠浒苔（Enteromorpha intestinalis）多糖的结构和抗肿瘤活性随季节与地域变化规律研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

三维流形上的Heegaard分解及其在纽结理论中应用

国家自然科学基金

0+阅读 · 2011年12月31日

组合导航系统中基于混沌、小波和神经网络的信息融合方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

CFRP构件与Al合金构件胶接界面特性与失效机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

Model-based Clustering with Missing Not At Random Data

Arxiv

0+阅读 · 2023年2月15日

Large-scale Multiple Testing: Fundamental Limits of False Discovery Rate Control and Compound Oracle

Arxiv

0+阅读 · 2023年2月14日

A Projection Approach to Local Regression and Clustering with Variable-Dimension Covariates

Arxiv

0+阅读 · 2023年2月14日

Precise Asymptotic Analysis of Deep Random Feature Models

Arxiv

0+阅读 · 2023年2月13日

On Testing and Comparing Fair classifiers under Data Bias

Arxiv

0+阅读 · 2023年2月12日

Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised Learning

Arxiv

0+阅读 · 2023年2月12日

Approximation and Structured Prediction with Sparse Wasserstein Barycenters

Arxiv

0+阅读 · 2023年2月10日

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Arxiv

12+阅读 · 2021年6月9日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

Consensus Based Medical Image Segmentation Using Semi-Supervised Learning And Graph Cuts

Arxiv

11+阅读 · 2018年5月21日

VIP会员

文章信息

相关主题

相互独立的

相关VIP内容

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Model-based Clustering with Missing Not At Random Data

Arxiv

0+阅读 · 2023年2月15日

Large-scale Multiple Testing: Fundamental Limits of False Discovery Rate Control and Compound Oracle

Arxiv

0+阅读 · 2023年2月14日

A Projection Approach to Local Regression and Clustering with Variable-Dimension Covariates

Arxiv

0+阅读 · 2023年2月14日

Precise Asymptotic Analysis of Deep Random Feature Models

Arxiv

0+阅读 · 2023年2月13日

On Testing and Comparing Fair classifiers under Data Bias

Arxiv

0+阅读 · 2023年2月12日

Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised Learning

Arxiv

0+阅读 · 2023年2月12日

Approximation and Structured Prediction with Sparse Wasserstein Barycenters

Arxiv

0+阅读 · 2023年2月10日

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Arxiv

12+阅读 · 2021年6月9日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

Consensus Based Medical Image Segmentation Using Semi-Supervised Learning And Graph Cuts

Arxiv

11+阅读 · 2018年5月21日

相关基金

采用pinball loss的MEE算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

ADAMTS-4/5和Aggrecan基因改造的间充质干细胞和软骨细胞在软骨组织工程上的应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多元异构信息融合的个性化推荐系统研究

国家自然科学基金

6+阅读 · 2012年12月31日

肠浒苔（Enteromorpha intestinalis）多糖的结构和抗肿瘤活性随季节与地域变化规律研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

三维流形上的Heegaard分解及其在纽结理论中应用

国家自然科学基金

0+阅读 · 2011年12月31日

组合导航系统中基于混沌、小波和神经网络的信息融合方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

CFRP构件与Al合金构件胶接界面特性与失效机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员