通过 fincX 启用的 Pyhf 分布式统计推断 (Distributed statistical inference with pyhf enabled through funcX) - 专知论文

会员服务 ·

0

Performer · 可约的 · 统计量 · 推断 · MoDELS ·

2021 年 3 月 3 日

Distributed statistical inference with pyhf enabled through funcX

翻译：通过 fincX 启用的 Pyhf 分布式统计推断

Matthew Feickert,Lukas Heinrich,Giordon Stark,Ben Galewsky

from arxiv, 9 pages, 1 figure, 2 listings, 1 table, submitted to the 25th International Conference on Computing in High Energy & Nuclear Physics

In High Energy Physics facilities that provide High Performance Computing environments provide an opportunity to efficiently perform the statistical inference required for analysis of data from the Large Hadron Collider, but can pose problems with orchestration and efficient scheduling. The compute architectures at these facilities do not easily support the Python compute model, and the configuration scheduling of batch jobs for physics often requires expertise in multiple job scheduling services. The combination of the pure-Python libraries pyhf and funcX reduces the common problem in HEP analyses of performing statistical inference with binned models, that would traditionally take multiple hours and bespoke scheduling, to an on-demand (fitting) "function as a service" that can scalably execute across workers in just a few minutes, offering reduced time to insight and inference. We demonstrate execution of a scalable workflow using funcX to simultaneously fit 125 signal hypotheses from a published ATLAS search for new physics using pyhf with a wall time of under 3 minutes. We additionally show performance comparisons for other physics analyses with openly published probability models and argue for a blueprint of fitting as a service systems at HPC centers.

翻译：在提供高性能计算机环境的高能物理设施中,提供高性能计算环境的高能物理设施为高效地进行分析大型强子对撞机数据所需的统计推断提供了机会,但可能会对管弦化和高效时间安排造成问题。这些设施的计算结构不易支持Python计算模型,物理分批工作的配置安排往往需要多种工作时间安排服务方面的专业知识。纯-Python图书馆pyhf和funcX的结合减少了高能实验分析中常见的问题,即用被捆绑模型进行统计推断,这通常需要多小时时间并进行发言安排,即时(安装)“服务功能”可在几分钟内在工人中间按需执行,为洞察和推断提供更短的时间。我们演示了使用可缩放的工作流程,以便同时使用已出版的 ATLAS 搜索使用pyhf的新物理学的125个信号假体与短3分钟的墙时段。我们还展示了其他物理学分析的性能比较,以公开公布的概率模型作为HPC中心服务系统的蓝图。

0

相关内容

Performer

【经典书】线性代数，436页pdf

专知会员服务

77+阅读 · 2021年3月16日

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

426+阅读 · 2021年1月11日

【经典书】Python金融大数据分析（Yves Hilpsch 著），566页pdf

【经典书】Python金融大数据分析（Yves Hilpsch 著），566页pdf

专知会员服务

97+阅读 · 2021年1月9日

【经典书】现代统计方法基础，267页pdf，Fundamentals of Modern Statistical Methods

【经典书】现代统计方法基础，267页pdf，Fundamentals of Modern Statistical Methods

专知会员服务

64+阅读 · 2020年8月10日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【新书】Python数据科学食谱（Python Data Science Cookbook）

【新书】Python数据科学食谱（Python Data Science Cookbook）

专知会员服务

117+阅读 · 2020年1月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

[机器学习] 用KNN识别MNIST手写字符实战

[机器学习] 用KNN识别MNIST手写字符实战

机器学习和数学

4+阅读 · 2018年5月13日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【推荐】Python机器学习生态圈(Scikit-Learn相关项目)

【推荐】Python机器学习生态圈(Scikit-Learn相关项目)

机器学习研究会

6+阅读 · 2017年8月23日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Processes, Systems \& Tests: Defining Contextual Equivalences

Arxiv

0+阅读 · 2021年4月27日

On dependent generalized sensitivity indices and asymptotic distributions

Arxiv

0+阅读 · 2021年4月27日

A method to integrate and classify normal distributions

Arxiv

0+阅读 · 2021年4月26日

On Determining the Distribution of a Goodness-of-Fit Test Statistic

Arxiv

0+阅读 · 2021年4月26日

3D Scene Compression through Entropy Penalized Neural Representation Functions

Arxiv

0+阅读 · 2021年4月26日

String Indexing with Compressed Patterns

Arxiv

0+阅读 · 2021年4月23日

Complexity-based permutation entropies: from deterministic time series to white noise

Arxiv

0+阅读 · 2021年4月23日

Normalized multivariate time series causality analysis and causal graph reconstruction

Arxiv

0+阅读 · 2021年4月23日

A Survey on Distributed Machine Learning

Arxiv

45+阅读 · 2019年12月20日

Nonparametric Topic Modeling with Neural Inference

Arxiv

3+阅读 · 2018年6月18日

VIP会员

文章信息

相关主题

相关VIP内容

【经典书】线性代数，436页pdf

专知会员服务

77+阅读 · 2021年3月16日

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

426+阅读 · 2021年1月11日

【经典书】Python金融大数据分析（Yves Hilpsch 著），566页pdf

【经典书】Python金融大数据分析（Yves Hilpsch 著），566页pdf

专知会员服务

97+阅读 · 2021年1月9日

【经典书】现代统计方法基础，267页pdf，Fundamentals of Modern Statistical Methods

【经典书】现代统计方法基础，267页pdf，Fundamentals of Modern Statistical Methods

专知会员服务

64+阅读 · 2020年8月10日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【新书】Python数据科学食谱（Python Data Science Cookbook）

【新书】Python数据科学食谱（Python Data Science Cookbook）

专知会员服务

117+阅读 · 2020年1月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

[机器学习] 用KNN识别MNIST手写字符实战

[机器学习] 用KNN识别MNIST手写字符实战

机器学习和数学

4+阅读 · 2018年5月13日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【推荐】Python机器学习生态圈(Scikit-Learn相关项目)

【推荐】Python机器学习生态圈(Scikit-Learn相关项目)

机器学习研究会

6+阅读 · 2017年8月23日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Processes, Systems \& Tests: Defining Contextual Equivalences

Arxiv

0+阅读 · 2021年4月27日

On dependent generalized sensitivity indices and asymptotic distributions

Arxiv

0+阅读 · 2021年4月27日

A method to integrate and classify normal distributions

Arxiv

0+阅读 · 2021年4月26日

On Determining the Distribution of a Goodness-of-Fit Test Statistic

Arxiv

0+阅读 · 2021年4月26日

3D Scene Compression through Entropy Penalized Neural Representation Functions

Arxiv

0+阅读 · 2021年4月26日

String Indexing with Compressed Patterns

Arxiv

0+阅读 · 2021年4月23日

Complexity-based permutation entropies: from deterministic time series to white noise

Arxiv

0+阅读 · 2021年4月23日

Normalized multivariate time series causality analysis and causal graph reconstruction

Arxiv

0+阅读 · 2021年4月23日

A Survey on Distributed Machine Learning

Arxiv

45+阅读 · 2019年12月20日

Nonparametric Topic Modeling with Neural Inference

Arxiv

3+阅读 · 2018年6月18日

微信扫码咨询专知VIP会员