高多维下同声推断的分布布设布设布设 (Distributed Bootstrap for Simultaneous Inference Under High Dimensionality) - 专知论文

会员服务 ·

0

自助法/自举法 · 推断 · Extensibility · Performer · 统计量 ·

2022 年 6 月 14 日

Distributed Bootstrap for Simultaneous Inference Under High Dimensionality

翻译：高多维下同声推断的分布布设布设布设

Yang Yu,Shih-Kang Chao,Guang Cheng

from arxiv, To appear in JMLR. arXiv admin note: text overlap with arXiv:2002.08443

We propose a distributed bootstrap method for simultaneous inference on high-dimensional massive data that are stored and processed with many machines. The method produces an $\ell_\infty$-norm confidence region based on a communication-efficient de-biased lasso, and we propose an efficient cross-validation approach to tune the method at every iteration. We theoretically prove a lower bound on the number of communication rounds $\tau_{\min}$ that warrants the statistical accuracy and efficiency. Furthermore, $\tau_{\min}$ only increases logarithmically with the number of workers and the intrinsic dimensionality, while nearly invariant to the nominal dimensionality. We test our theory by extensive simulation studies, and a variable screening task on a semi-synthetic dataset based on the US Airline On-Time Performance dataset. The code to reproduce the numerical results is available at GitHub: https://github.com/skchao74/Distributed-bootstrap.

翻译：我们建议了一种分布式靴套方法,用于同时推断用多种机器储存和处理的高维大规模数据。该方法产生一个基于通信效率低偏向的诺尔姆信任区,我们建议了一种有效的交叉校准方法,以调和每次迭代的方法。我们理论上证明,对于需要统计准确性和效率的通信回合数($tau ⁇ min}$)的制约较低。此外,$sau ⁇ min}美元只增加了对数,与工人的数量和内在的维度相比,几乎与名义的维度不相容。我们通过广泛的模拟研究测试我们的理论,以及基于美国空线实时性能数据集的半合成数据集的可变筛选任务。复制数字结果的代码可以在 GitHub: https://github.com/skchao74/diplated-botstspreg。

0

相关内容

自助法/自举法

自助法/自举法

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

压缩感知与稀疏信号恢复

国家自然科学基金

2+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

15-kDa硒蛋白在内质网应激（ERS）和阿尔茨海默病(AD)中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

DFT+Gutzwiller方法研究过渡金属氧化物

国家自然科学基金

0+阅读 · 2012年12月31日

水库水沙联合优化调度目标函数研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多性能退化参数的装备关键系统实时可靠性评估与预测方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

非线性软测量系统递推量子随机滤波方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

遥感数据支持的不同时间尺度气象因子与东亚飞蝗发生关系机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

水力机械转轮动应力问题的数值研究

国家自然科学基金

0+阅读 · 2008年12月31日

Bootstrap inference in the presence of bias

Arxiv

0+阅读 · 2022年8月3日

Time-Varying Dispersion Integer-Valued GARCH Models

Arxiv

0+阅读 · 2022年8月3日

The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift

Arxiv

0+阅读 · 2022年8月3日

Collective Proposal Distributions for Nonlinear MCMC samplers: Mean-Field Theory and Fast Implementation

Arxiv

0+阅读 · 2022年8月3日

Robust Training under Label Noise by Over-parameterization

Arxiv

0+阅读 · 2022年8月2日

Distributed Computations with Layered Resolution

Arxiv

0+阅读 · 2022年8月2日

Maximum pseudo-likelihood estimation in copula models for small weakly dependent samples

Arxiv

0+阅读 · 2022年8月2日

Bias Reduction for Sum Estimation

Arxiv

0+阅读 · 2022年8月2日

On the Distribution function of area and perimeter for planar poisson line process

Arxiv

0+阅读 · 2022年7月31日

A Comparative Study of Application-level Caching Recommendations at the Method Level

Arxiv

0+阅读 · 2022年7月30日

VIP会员

文章信息

相关主题

自助法/自举法

相关VIP内容

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

【斯坦福博士论文】数据、决策与依赖：构建可信人工智能的挑战

人工智能时代背景下的未来海战

接触战中的无人机优势：美军旅级部队面临的小型无人机系统挑战与调整

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Bootstrap inference in the presence of bias

Arxiv

0+阅读 · 2022年8月3日

Time-Varying Dispersion Integer-Valued GARCH Models

Arxiv

0+阅读 · 2022年8月3日

The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift

Arxiv

0+阅读 · 2022年8月3日

Collective Proposal Distributions for Nonlinear MCMC samplers: Mean-Field Theory and Fast Implementation

Arxiv

0+阅读 · 2022年8月3日

Robust Training under Label Noise by Over-parameterization

Arxiv

0+阅读 · 2022年8月2日

Distributed Computations with Layered Resolution

Arxiv

0+阅读 · 2022年8月2日

Maximum pseudo-likelihood estimation in copula models for small weakly dependent samples

Arxiv

0+阅读 · 2022年8月2日

Bias Reduction for Sum Estimation

Arxiv

0+阅读 · 2022年8月2日

On the Distribution function of area and perimeter for planar poisson line process

Arxiv

0+阅读 · 2022年7月31日

A Comparative Study of Application-level Caching Recommendations at the Method Level

Arxiv

0+阅读 · 2022年7月30日

相关基金

压缩感知与稀疏信号恢复

国家自然科学基金

2+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

15-kDa硒蛋白在内质网应激（ERS）和阿尔茨海默病(AD)中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

DFT+Gutzwiller方法研究过渡金属氧化物

国家自然科学基金

0+阅读 · 2012年12月31日

水库水沙联合优化调度目标函数研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多性能退化参数的装备关键系统实时可靠性评估与预测方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

非线性软测量系统递推量子随机滤波方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

遥感数据支持的不同时间尺度气象因子与东亚飞蝗发生关系机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

水力机械转轮动应力问题的数值研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员