平行排序分割的优化回合和样本化复杂度 (Optimal Round and Sample-Size Complexity for Partitioning in Parallel Sorting) - 专知论文

会员服务 ·

0

样本 · 优化器 · 划分 · 输入分布 · state-of-the-art ·

2022 年 7 月 17 日

Optimal Round and Sample-Size Complexity for Partitioning in Parallel Sorting

翻译：平行排序分割的优化回合和样本化复杂度

Wentao Yang,Vipul Harsh,Edgar Solomonik

from arxiv, 21 pages

State-of-the-art parallel sorting algorithms for distributed-memory architectures are based on computing a balanced partitioning via sampling and histogramming. By finding samples that partition the sorted keys into evenly-sized chunks, these algorithms minimize the number of communication rounds required. Histogramming (computing positions of samples) guides sampling, enabling a decrease in the overall number of samples collected. We derive lower and upper bounds on the number of sampling/histogramming rounds required to compute a balanced partitioning. We improve on prior results to demonstrate that when using $p$ processors, $O(\log^* p)$ rounds with $O(p/\log^* p)$ samples per round suffice. We match that with a lower bound that shows that any algorithm with $O(p)$ samples per round requires at least $\Omega(\log^* p)$ rounds. Additionally, we prove the $\Omega(p \log p)$ samples lower bound for one round, thus proving that existing one round algorithms: sample sort, AMS sort and HSS have optimal sample size complexity. To derive the lower bound, we propose a hard randomized input distribution and apply classical results from the distribution theory of runs.

翻译：通过取样和直方图绘制,对分布式模拟结构进行最先进的平行排序算法,其基础是通过取样和直方图绘制来计算平衡的分隔法。通过找到将分类键分割成平均大小块的样本,这些算法最大限度地减少了所需的通信轮数。直方图( 计算样品的位置) 引导取样, 使所采集样品的总数减少。我们从计算平衡分区所需的取样/ 希方位数中得出下方和上方的界限。我们改进了先前的结果,以证明在使用美元处理器时, $O( log) p( $ p) 每轮用$O( p/\ log) p( p) 的样本将分解成平均大小。我们比较了下限的算法, 显示每轮样本中含有$O( p) 的算法至少需要$\ Omega (\ log) p) 。此外, 我们证明了美元( p\log p) 的样本数量要小于一回合, 证明现有的一次圆算法: 样本的样本排序、 AMS 和 HSS 最精确的配置的样本配置的模型分析结果要由我们提出。

0

相关内容

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

专知会员服务

107+阅读 · 2021年10月30日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

猪瘟病毒非结构蛋白对猪巨噬细胞Toll样受体介导天然免疫应答的影响及机制

国家自然科学基金

0+阅读 · 2014年12月31日

Clifford分析中的算子有界性研究及其在高阶边值问题中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

Partial Spread Bent函数与Bent-Negabent函数的构造及密码学性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

通过消除分子链中的缺陷结构提高PMMA树脂的热稳定性和透光性

国家自然科学基金

0+阅读 · 2012年12月31日

基于EMD的复杂几何模型处理方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

活动星系核中的硅酸盐尘埃

国家自然科学基金

0+阅读 · 2011年12月31日

粤西海域CTW（Coastal Trapped Wave）特征分析与数值模拟研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

Bilevel Optimization with a Lower-level Contraction: Optimal Sample Complexity without Warm-Start

Arxiv

0+阅读 · 2022年9月12日

A Differentiable Loss Function for Learning Heuristics in A*

Arxiv

0+阅读 · 2022年9月12日

On predictive inference for intractable models via approximate Bayesian computation

Arxiv

0+阅读 · 2022年9月12日

Bilevel Optimization for Feature Selection in the Data-Driven Newsvendor Problem

Arxiv

0+阅读 · 2022年9月12日

Centroids Matching: an efficient Continual Learning approach operating in the embedding space

Arxiv

0+阅读 · 2022年9月10日

Parallelizing Explicit and Implicit Extrapolation Methods for Ordinary Differential Equations

Arxiv

0+阅读 · 2022年9月10日

ApproxTrain: Fast Simulation of Approximate Multipliers for DNN Training and Inference

Arxiv

0+阅读 · 2022年9月9日

Universal Solutions of Feedforward ReLU Networks for Interpolations

Arxiv

0+阅读 · 2022年9月9日

What can be sampled locally?

Arxiv

0+阅读 · 2022年9月8日

Inapproximability of a Pair of Forms Defining a Partial Boolean Function

Arxiv

0+阅读 · 2022年9月8日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

专知会员服务

107+阅读 · 2021年10月30日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Bilevel Optimization with a Lower-level Contraction: Optimal Sample Complexity without Warm-Start

Arxiv

0+阅读 · 2022年9月12日

A Differentiable Loss Function for Learning Heuristics in A*

Arxiv

0+阅读 · 2022年9月12日

On predictive inference for intractable models via approximate Bayesian computation

Arxiv

0+阅读 · 2022年9月12日

Bilevel Optimization for Feature Selection in the Data-Driven Newsvendor Problem

Arxiv

0+阅读 · 2022年9月12日

Centroids Matching: an efficient Continual Learning approach operating in the embedding space

Arxiv

0+阅读 · 2022年9月10日

Parallelizing Explicit and Implicit Extrapolation Methods for Ordinary Differential Equations

Arxiv

0+阅读 · 2022年9月10日

ApproxTrain: Fast Simulation of Approximate Multipliers for DNN Training and Inference

Arxiv

0+阅读 · 2022年9月9日

Universal Solutions of Feedforward ReLU Networks for Interpolations

Arxiv

0+阅读 · 2022年9月9日

What can be sampled locally?

Arxiv

0+阅读 · 2022年9月8日

Inapproximability of a Pair of Forms Defining a Partial Boolean Function

Arxiv

0+阅读 · 2022年9月8日

相关基金

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

猪瘟病毒非结构蛋白对猪巨噬细胞Toll样受体介导天然免疫应答的影响及机制

国家自然科学基金

0+阅读 · 2014年12月31日

Clifford分析中的算子有界性研究及其在高阶边值问题中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

Partial Spread Bent函数与Bent-Negabent函数的构造及密码学性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

通过消除分子链中的缺陷结构提高PMMA树脂的热稳定性和透光性

国家自然科学基金

0+阅读 · 2012年12月31日

基于EMD的复杂几何模型处理方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

活动星系核中的硅酸盐尘埃

国家自然科学基金

0+阅读 · 2011年12月31日

粤西海域CTW（Coastal Trapped Wave）特征分析与数值模拟研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员