粗简统一度测试 (Sparse Uniformity Testing) - 专知论文

会员服务 ·

0

UniFormer · 稀疏 · 离散化 · 阈值 · 邦弗朗尼校正 ·

2022 年 2 月 16 日

Sparse Uniformity Testing

翻译：粗简统一度测试

Bhaswar B. Bhattacharya,Rajarshi Mukherjee

from arxiv, 33 pages, 1 figure

In this paper we consider the uniformity testing problem for high-dimensional discrete distributions (multinomials) under sparse alternatives. More precisely, we derive sharp detection thresholds for testing, based on $n$ samples, whether a discrete distribution supported on $d$ elements differs from the uniform distribution only in $s$ (out of the $d$) coordinates and is $\varepsilon$-far (in total variation distance) from uniformity. Our results reveal various interesting phase transitions which depend on the interplay of the sample size $n$ and the signal strength $\varepsilon$ with the dimension $d$ and the sparsity level $s$. For instance, if the sample size is less than a threshold (which depends on $d$ and $s$), then all tests are asymptotically powerless, irrespective of the magnitude of the signal strength. On the other hand, if the sample size is above the threshold, then the detection boundary undergoes a further phase transition depending on the signal strength. Here, a $\chi^2$-type test attains the detection boundary in the dense regime, whereas in the sparse regime a Bonferroni correction of two maximum-type tests and a version of the Higher Criticism test is optimal up to sharp constants. These results combined provide a complete description of the phase diagram for the sparse uniformity testing problem across all regimes of the parameters $n$, $d$, and $s$. One of the challenges in dealing with multinomials is that the parameters are always constrained to lie in the simplex. This results in the aforementioned two-layered phase transition, a new phenomenon which does not arise in classical high-dimensional sparse testing problems.

翻译：在本文中,我们考虑在稀少的替代品下对高维离散分布(多元体)进行统一测试的问题。更准确地说,我们根据美元样本得出测试的敏锐检测阈值,如果以美元元素支持的离散分布值与仅以美元(美元美元)坐标表示的统一分布值不同,那么所有测试都与仅以美元(美元美元)坐标表示的统一分布值不同,并且是美元-远方(完全变异距离)与统一度(美元)相异)。我们的结果表明,不同阶段的过渡取决于抽样规模(美元)和信号强度(美元)的相互作用。更精确度(美元)和松散度水平(美元)的测试值。例如,如果抽样规模低于一个阈值(美元和美元),那么所有测试都是暂时的,无论信号强度有多大。另一方面,如果抽样大小高于临界值,则检测边界会随着信号强度的强度而发生进一步阶段过渡。在这里,在最稠密的货币交易体系中,需要2美元的参数测试边界(美元-美元-美元),而在最短的等级测试中,最短的阶段,最短的测试阶段是C级测试结果将产生。

0

相关内容

UniFormer

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

50+阅读 · 2020年12月14日

【MIT】硬负样本的对比学习

【MIT】硬负样本的对比学习

专知会员服务

39+阅读 · 2020年10月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

94+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

47+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

33+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

92+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

81+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

103+阅读 · 2019年10月9日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

273+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

24+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

紫薯糖基化修饰酶Ib3GGT对花青素修饰和富集的研究

国家自然科学基金

0+阅读 · 2015年12月31日

多变量形态学分水岭理论及其在多通道图像处理中的应用研究

国家自然科学基金

0+阅读 · 2014年12月31日

非合作星载SAR图像定位技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

随机扰动理论和随机算法在大规模矩阵计算中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

一类拟线性Schrodinger方程(组)解的存在性和集中现象研究

国家自然科学基金

0+阅读 · 2012年12月31日

结构化过完备稀疏性约束的超分辨率图像重建研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于边缘点的折反射图像立体匹配与三维重建研究

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

冗余特征检测与利用技术的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Likelihood-Free Frequentist Inference: Confidence Sets with Correct Conditional Coverage

Arxiv

0+阅读 · 2022年4月19日

IsoScore: Measuring the Uniformity of Embedding Space Utilization

IsoScore: Measuring the Uniformity of Embedding Space Utilization

Arxiv

0+阅读 · 2022年4月18日

Nonparametric Estimation for SDE with Sparsely Sampled Paths: an FDA Perspective

Nonparametric Estimation for SDE with Sparsely Sampled Paths: an FDA Perspective

Arxiv

0+阅读 · 2022年4月18日

Fast optimization of common basis for matrix set through Common Singular Value Decomposition

Fast optimization of common basis for matrix set through Common Singular Value Decomposition

Arxiv

0+阅读 · 2022年4月18日

Multiple-Splitting Projection Test for High-Dimensional Mean Vectors

Arxiv

0+阅读 · 2022年4月18日

On the Differential Properties of the Power Mapping $x^{p^m+2}$

Arxiv

0+阅读 · 2022年4月18日

Estimation of smooth functionals in high-dimensional models: bootstrap chains and Gaussian approximation

Arxiv

0+阅读 · 2022年4月16日

Detection and Mitigation of Algorithmic Bias via Predictive Rate Parity

Arxiv

0+阅读 · 2022年4月15日

Evaluating the Effectiveness of Corrective Demonstrations and a Low-Cost Sensor for Dexterous Manipulation

Arxiv

0+阅读 · 2022年4月15日

Testing distributional assumptions of learning algorithms

Arxiv

0+阅读 · 2022年4月14日

VIP会员

文章信息

相关主题

邦弗朗尼校正

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

50+阅读 · 2020年12月14日

【MIT】硬负样本的对比学习

【MIT】硬负样本的对比学习

专知会员服务

39+阅读 · 2020年10月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

94+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

47+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

33+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

92+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

81+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

103+阅读 · 2019年10月9日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

273+阅读 · 2019年10月9日

热门VIP内容

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

24+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Likelihood-Free Frequentist Inference: Confidence Sets with Correct Conditional Coverage

Arxiv

0+阅读 · 2022年4月19日

IsoScore: Measuring the Uniformity of Embedding Space Utilization

IsoScore: Measuring the Uniformity of Embedding Space Utilization

Arxiv

0+阅读 · 2022年4月18日

Nonparametric Estimation for SDE with Sparsely Sampled Paths: an FDA Perspective

Nonparametric Estimation for SDE with Sparsely Sampled Paths: an FDA Perspective

Arxiv

0+阅读 · 2022年4月18日

Fast optimization of common basis for matrix set through Common Singular Value Decomposition

Fast optimization of common basis for matrix set through Common Singular Value Decomposition

Arxiv

0+阅读 · 2022年4月18日

Multiple-Splitting Projection Test for High-Dimensional Mean Vectors

Arxiv

0+阅读 · 2022年4月18日

On the Differential Properties of the Power Mapping $x^{p^m+2}$

Arxiv

0+阅读 · 2022年4月18日

Estimation of smooth functionals in high-dimensional models: bootstrap chains and Gaussian approximation

Arxiv

0+阅读 · 2022年4月16日

Detection and Mitigation of Algorithmic Bias via Predictive Rate Parity

Arxiv

0+阅读 · 2022年4月15日

Evaluating the Effectiveness of Corrective Demonstrations and a Low-Cost Sensor for Dexterous Manipulation

Arxiv

0+阅读 · 2022年4月15日

Testing distributional assumptions of learning algorithms

Arxiv

0+阅读 · 2022年4月14日

相关基金

紫薯糖基化修饰酶Ib3GGT对花青素修饰和富集的研究

国家自然科学基金

0+阅读 · 2015年12月31日

多变量形态学分水岭理论及其在多通道图像处理中的应用研究

国家自然科学基金

0+阅读 · 2014年12月31日

非合作星载SAR图像定位技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

随机扰动理论和随机算法在大规模矩阵计算中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

一类拟线性Schrodinger方程(组)解的存在性和集中现象研究

国家自然科学基金

0+阅读 · 2012年12月31日

结构化过完备稀疏性约束的超分辨率图像重建研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于边缘点的折反射图像立体匹配与三维重建研究

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

冗余特征检测与利用技术的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员