加速GAN培训,在公共云上使用高度平行的硬件 (Accelerating GAN training using highly parallel hardware on public cloud) - 专知论文

会员服务 ·

0

Processing（编程语言） · TPU · GAN · 张量处理单元 · Performer ·

2021 年 11 月 8 日

Accelerating GAN training using highly parallel hardware on public cloud

翻译：加速GAN培训,在公共云上使用高度平行的硬件

Renato Cardoso,Dejan Golubovic,Ignacio Peluaga Lozada,Ricardo Rocha,João Fernandes,Sofia Vallecorsa

With the increasing number of Machine and Deep Learning applications in High Energy Physics, easy access to dedicated infrastructure represents a requirement for fast and efficient R&D. This work explores different types of cloud services to train a Generative Adversarial Network (GAN) in a parallel environment, using Tensorflow data parallel strategy. More specifically, we parallelize the training process on multiple GPUs and Google Tensor Processing Units (TPU) and we compare two algorithms: the TensorFlow built-in logic and a custom loop, optimised to have higher control of the elements assigned to each GPU worker or TPU core. The quality of the generated data is compared to Monte Carlo simulation. Linear speed-up of the training process is obtained, while retaining most of the performance in terms of physics results. Additionally, we benchmark the aforementioned approaches, at scale, over multiple GPU nodes, deploying the training process on different public cloud providers, seeking for overall efficiency and cost-effectiveness. The combination of data science, cloud deployment options and associated economics allows to burst out heterogeneously, exploring the full potential of cloud-based services.

翻译：随着高能物理中机器和深层学习应用数量的增加,对专门基础设施的方便访问是快速高效研发的一项要求。这项工作探索了不同类型的云服务,利用Tensorflow数据平行战略,在一个平行的环境中培训创生反对流网络(GAN),更具体地说,我们将多个GPU和Google Tensor处理器(TPU)的培训过程平行进行,我们比较了两种算法:TensorFlow内在逻辑和定制循环,优化了对分配给每个GPU工人或TPU核心的元素的控制。生成的数据的质量与Monte Carlo模拟相比较。实现了培训过程的线性加速,同时保留了物理结果方面的大多数绩效。此外,我们从规模上将上述方法作为基准,超过多个GPU节点,将培训过程部署到不同的公共云提供商,以寻求总体效率和成本效益。数据科学、云部署选项和相关经济学的结合,使得数据科学、云库服务的全部潜力得以破碎。

0

相关内容

Processing（编程语言）

Processing（编程语言）

Processing 是一门开源编程语言和与之配套的集成开发环境（IDE）的名称。Processing 在电子艺术和视觉设计社区被用来教授编程基础，并运用于大量的新媒体和互动艺术作品中。

15%接受率！AAAI2022结果出炉，1349篇上榜，你的paper中了吗？

15%接受率！AAAI2022结果出炉，1349篇上榜，你的paper中了吗？

专知会员服务

37+阅读 · 2021年12月2日

【KDD2019|讲座推荐】成本敏感多类多标签分类研究进展：Advances in Cost-sensitive Multiclass and Multilabel Classification

【KDD2019|讲座推荐】成本敏感多类多标签分类研究进展：Advances in Cost-sensitive Multiclass and Multilabel Classification

专知会员服务

20+阅读 · 2019年12月9日

【O'Reilly TensorFlow World 2019】使用transformer架构的自然语言处理（Natural language processing using transformer architectures），Kiwisoft的机器学习顾问Aurelien Geron

【O'Reilly TensorFlow World 2019】使用transformer架构的自然语言处理（Natural language processing using transformer architectures），Kiwisoft的机器学习顾问Aurelien Geron

专知会员服务

17+阅读 · 2019年11月14日

【Freddy Lecue博士】Thales嵌入式可解释AI：关键系统中AI的采用（Thales Embedded Explainable AI: Towards the Adoption of AI in Critical Systems.），AI Accelerator Summit 2019

【Freddy Lecue博士】Thales嵌入式可解释AI：关键系统中AI的采用（Thales Embedded Explainable AI: Towards the Adoption of AI in Critical Systems.），AI Accelerator Summit 2019

专知会员服务

21+阅读 · 2019年11月11日

【O'Reilly AI Conference 2019】使用GPU和Docker容器进行Horovod和Spark深度学习（Deep learning with Horovod and Spark using GPUs and Docker containers），BlueData的联合创始人兼首席架构师Thomas Phelan

【O'Reilly AI Conference 2019】使用GPU和Docker容器进行Horovod和Spark深度学习（Deep learning with Horovod and Spark using GPUs and Docker containers），BlueData的联合创始人兼首席架构师Thomas Phelan

专知会员服务

21+阅读 · 2019年11月5日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

生成对抗网络GANs学习路线

生成对抗网络GANs学习路线

专知

36+阅读 · 2019年6月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

专知《深度学习:算法到实战》300多位同学在学习！网易云课堂人工智能畅销榜首位！

专知《深度学习:算法到实战》300多位同学在学习！网易云课堂人工智能畅销榜首位！

专知

10+阅读 · 2018年12月30日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

专知《深度学习:算法到实战》126位同学在学习！网易云课堂深度学习畅销榜首位！

专知《深度学习:算法到实战》126位同学在学习！网易云课堂深度学习畅销榜首位！

专知

3+阅读 · 2018年12月23日

计算机类 | 11月截稿会议信息9条

计算机类 | 11月截稿会议信息9条

Call4Papers

6+阅读 · 2018年10月14日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

ConsumerCheck: A Software for Analysis of Sensory and Consumer Data

ConsumerCheck: A Software for Analysis of Sensory and Consumer Data

Arxiv

0+阅读 · 2022年1月11日

DynO: Dynamic Onloading of Deep Neural Networks from Cloud to Device

Arxiv

0+阅读 · 2022年1月11日

Trade-off on Sim2Real Learning: Real-world Learning Faster than Simulations

Arxiv

0+阅读 · 2022年1月10日

A Review of Deep Learning Techniques for Markerless Human Motion on Synthetic Datasets

Arxiv

0+阅读 · 2022年1月7日

A unified software/hardware scalable architecture for brain-inspired computing based on self-organizing neural models

Arxiv

0+阅读 · 2022年1月6日

RTNN: Accelerating Neighbor Search Using Hardware Ray Tracing

Arxiv

0+阅读 · 2022年1月4日

Minimizing the Signaling Overhead and Latency based on Users' Mobility Patterns

Arxiv

0+阅读 · 2022年1月4日

Towards Efficient Dynamic Virtual Network Embedding Strategy for Cloud IoT Networks

Arxiv

4+阅读 · 2018年1月30日

Improved Training of Generative Adversarial Networks Using Representative Features

Arxiv

7+阅读 · 2018年1月28日

Scale Up Event Extraction Learning via Automatic Training Data Generation

Arxiv

7+阅读 · 2017年12月11日

VIP会员

文章信息

相关主题

Processing（编程语言）

张量处理单元

相关VIP内容

15%接受率！AAAI2022结果出炉，1349篇上榜，你的paper中了吗？

15%接受率！AAAI2022结果出炉，1349篇上榜，你的paper中了吗？

专知会员服务

37+阅读 · 2021年12月2日

【KDD2019|讲座推荐】成本敏感多类多标签分类研究进展：Advances in Cost-sensitive Multiclass and Multilabel Classification

【KDD2019|讲座推荐】成本敏感多类多标签分类研究进展：Advances in Cost-sensitive Multiclass and Multilabel Classification

专知会员服务

20+阅读 · 2019年12月9日

【O'Reilly TensorFlow World 2019】使用transformer架构的自然语言处理（Natural language processing using transformer architectures），Kiwisoft的机器学习顾问Aurelien Geron

【O'Reilly TensorFlow World 2019】使用transformer架构的自然语言处理（Natural language processing using transformer architectures），Kiwisoft的机器学习顾问Aurelien Geron

专知会员服务

17+阅读 · 2019年11月14日

【Freddy Lecue博士】Thales嵌入式可解释AI：关键系统中AI的采用（Thales Embedded Explainable AI: Towards the Adoption of AI in Critical Systems.），AI Accelerator Summit 2019

【Freddy Lecue博士】Thales嵌入式可解释AI：关键系统中AI的采用（Thales Embedded Explainable AI: Towards the Adoption of AI in Critical Systems.），AI Accelerator Summit 2019

专知会员服务

21+阅读 · 2019年11月11日

【O'Reilly AI Conference 2019】使用GPU和Docker容器进行Horovod和Spark深度学习（Deep learning with Horovod and Spark using GPUs and Docker containers），BlueData的联合创始人兼首席架构师Thomas Phelan

【O'Reilly AI Conference 2019】使用GPU和Docker容器进行Horovod和Spark深度学习（Deep learning with Horovod and Spark using GPUs and Docker containers），BlueData的联合创始人兼首席架构师Thomas Phelan

专知会员服务

21+阅读 · 2019年11月5日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

军事前沿模型

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

相关资讯

生成对抗网络GANs学习路线

生成对抗网络GANs学习路线

专知

36+阅读 · 2019年6月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

专知《深度学习:算法到实战》300多位同学在学习！网易云课堂人工智能畅销榜首位！

专知《深度学习:算法到实战》300多位同学在学习！网易云课堂人工智能畅销榜首位！

专知

10+阅读 · 2018年12月30日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

专知《深度学习:算法到实战》126位同学在学习！网易云课堂深度学习畅销榜首位！

专知《深度学习:算法到实战》126位同学在学习！网易云课堂深度学习畅销榜首位！

专知

3+阅读 · 2018年12月23日

计算机类 | 11月截稿会议信息9条

计算机类 | 11月截稿会议信息9条

Call4Papers

6+阅读 · 2018年10月14日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

ConsumerCheck: A Software for Analysis of Sensory and Consumer Data

ConsumerCheck: A Software for Analysis of Sensory and Consumer Data

Arxiv

0+阅读 · 2022年1月11日

DynO: Dynamic Onloading of Deep Neural Networks from Cloud to Device

Arxiv

0+阅读 · 2022年1月11日

Trade-off on Sim2Real Learning: Real-world Learning Faster than Simulations

Arxiv

0+阅读 · 2022年1月10日

A Review of Deep Learning Techniques for Markerless Human Motion on Synthetic Datasets

Arxiv

0+阅读 · 2022年1月7日

A unified software/hardware scalable architecture for brain-inspired computing based on self-organizing neural models

Arxiv

0+阅读 · 2022年1月6日

RTNN: Accelerating Neighbor Search Using Hardware Ray Tracing

Arxiv

0+阅读 · 2022年1月4日

Minimizing the Signaling Overhead and Latency based on Users' Mobility Patterns

Arxiv

0+阅读 · 2022年1月4日

Towards Efficient Dynamic Virtual Network Embedding Strategy for Cloud IoT Networks

Arxiv

4+阅读 · 2018年1月30日

Improved Training of Generative Adversarial Networks Using Representative Features

Arxiv

7+阅读 · 2018年1月28日

Scale Up Event Extraction Learning via Automatic Training Data Generation

Arxiv

7+阅读 · 2017年12月11日

微信扫码咨询专知VIP会员