Chi-Square 远程关系测试 (The Chi-Square Test of Distance Correlation) - 专知论文

会员服务 ·

0

相关系数 · 估计/估计量 · 随机变量 · 统计量 · 样本 ·

2021 年 5 月 14 日

The Chi-Square Test of Distance Correlation

翻译：Chi-Square 远程关系测试

Cencheng Shen,Sambit Panda,Joshua T. Vogelstein

from arxiv, 21 pages, 4 figures, 1 table

Distance correlation has gained much recent attention in the data science community: the sample statistic is straightforward to compute and asymptotically equals zero if and only if independence, making it an ideal choice to discover any type of dependency structure given sufficient sample size. One major bottleneck is the testing process: because the null distribution of distance correlation depends on the underlying random variables and metric choice, it typically requires a permutation test to estimate the null and compute the p-value, which is very costly for large amount of data. To overcome the difficulty, in this paper we propose a chi-square test for distance correlation. Method-wise, the chi-square test is non-parametric, extremely fast, and applicable to bias-corrected distance correlation using any strong negative type metric or characteristic kernel. The test exhibits a similar testing power as the standard permutation test, and can be utilized for K-sample and partial testing. Theory-wise, we show that the underlying chi-square distribution well approximates and dominates the limiting null distribution in upper tail, prove the chi-square test can be valid and universally consistent for testing independence, and establish a testing power inequality with respect to the permutation test.

翻译：数据科学界最近非常关注远程相关关系:抽样统计直截了当,只有在独立的情况下,才能计算零,且无瞬间等于零,使发现任何类型的依赖结构具有足够样本大小的理想选择成为理想的选择。一个主要瓶颈是测试过程:由于距离相关性的无效分布取决于潜在的随机变量和量度选择,因此通常需要一次变换测试来估计纯值和计算 p值,这对大量数据来说成本很高。为了克服这一困难,我们在本文件中提议对距离相关关系进行奇平方测试。从方法上看,奇平方测试是非对准的,非常快速的,并且适用于偏差修正的距离相关关系,使用任何强的负型指标或特性内核。测试显示类似于标准调测试的测试力,可用于K-sample和部分测试。从理论上看,我们表明,基正方分布非常接近并控制着限制上尾部无线分布的测试。从上尾部的测算,证明奇平方测试是有效的,并且符合每个测试的独立性。

0

相关内容

相关系数

【机器学习术语宝典】机器学习中英文术语表

【机器学习术语宝典】机器学习中英文术语表

专知会员服务

61+阅读 · 2020年7月12日

【剑桥大学】图网络的主邻域聚合，Principal Neighbourhood Aggregation for Graph Nets

【剑桥大学】图网络的主邻域聚合，Principal Neighbourhood Aggregation for Graph Nets

专知会员服务

42+阅读 · 2020年4月22日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

统计学习理论之父Vapnik-MIT2020报告《完全学习统计理论Statistical Theory of Learning》

统计学习理论之父Vapnik-MIT2020报告《完全学习统计理论Statistical Theory of Learning》

专知会员服务

85+阅读 · 2020年2月16日

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

专知会员服务

43+阅读 · 2019年11月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

论文浅尝 | Global Relation Embedding for Relation Extraction

论文浅尝 | Global Relation Embedding for Relation Extraction

开放知识图谱

12+阅读 · 2019年3月3日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Distributed Adaptive Huber Regression

Distributed Adaptive Huber Regression

Arxiv

0+阅读 · 2021年7月6日

A Theory of the Distortion-Perception Tradeoff in Wasserstein Space

Arxiv

0+阅读 · 2021年7月6日

Least Squares Normalized Cross Correlation

Arxiv

0+阅读 · 2021年7月5日

On The Distribution of Penultimate Activations of Classification Networks

On The Distribution of Penultimate Activations of Classification Networks

Arxiv

0+阅读 · 2021年7月5日

Bayesian two-interval test

Arxiv

0+阅读 · 2021年7月2日

Become a better you: correlation between the change of research direction and the change of scientific performance

Arxiv

0+阅读 · 2021年7月2日

Structure Learning from Related Data Sets with a Hierarchical Bayesian Score

Arxiv

0+阅读 · 2021年7月2日

Asymptotic Analysis of Statistical Estimators related to MultiGraphex Processes under Misspecification

Arxiv

0+阅读 · 2021年7月2日

Two edge-count tests and relevance analysis in k high-dimensional samples

Arxiv

0+阅读 · 2021年7月1日

Stable Distribution Alignment Using the Dual of the Adversarial Distance

Arxiv

3+阅读 · 2018年1月30日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【机器学习术语宝典】机器学习中英文术语表

【机器学习术语宝典】机器学习中英文术语表

专知会员服务

61+阅读 · 2020年7月12日

【剑桥大学】图网络的主邻域聚合，Principal Neighbourhood Aggregation for Graph Nets

【剑桥大学】图网络的主邻域聚合，Principal Neighbourhood Aggregation for Graph Nets

专知会员服务

42+阅读 · 2020年4月22日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

统计学习理论之父Vapnik-MIT2020报告《完全学习统计理论Statistical Theory of Learning》

统计学习理论之父Vapnik-MIT2020报告《完全学习统计理论Statistical Theory of Learning》

专知会员服务

85+阅读 · 2020年2月16日

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

专知会员服务

43+阅读 · 2019年11月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】多目标奖励与偏好优化：理论与算法

《无形的防御者？将定向能武器集成到反无人机框架的机遇与挑战》报告

自主化海军：海上无人系统与未来海战

迈向智能体系统规模化的科学

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

论文浅尝 | Global Relation Embedding for Relation Extraction

论文浅尝 | Global Relation Embedding for Relation Extraction

开放知识图谱

12+阅读 · 2019年3月3日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Distributed Adaptive Huber Regression

Distributed Adaptive Huber Regression

Arxiv

0+阅读 · 2021年7月6日

A Theory of the Distortion-Perception Tradeoff in Wasserstein Space

Arxiv

0+阅读 · 2021年7月6日

Least Squares Normalized Cross Correlation

Arxiv

0+阅读 · 2021年7月5日

On The Distribution of Penultimate Activations of Classification Networks

On The Distribution of Penultimate Activations of Classification Networks

Arxiv

0+阅读 · 2021年7月5日

Bayesian two-interval test

Arxiv

0+阅读 · 2021年7月2日

Become a better you: correlation between the change of research direction and the change of scientific performance

Arxiv

0+阅读 · 2021年7月2日

Structure Learning from Related Data Sets with a Hierarchical Bayesian Score

Arxiv

0+阅读 · 2021年7月2日

Asymptotic Analysis of Statistical Estimators related to MultiGraphex Processes under Misspecification

Arxiv

0+阅读 · 2021年7月2日

Two edge-count tests and relevance analysis in k high-dimensional samples

Arxiv

0+阅读 · 2021年7月1日

Stable Distribution Alignment Using the Dual of the Adversarial Distance

Arxiv

3+阅读 · 2018年1月30日

微信扫码咨询专知VIP会员