设计无数据培训方面的损失,使波尔兹曼分发的流流正常化 (Designing losses for data-free training of normalizing flows on Boltzmann distributions) - 专知论文

会员服务 ·

0

规范化的 · 损失 · 优化器 · MoDELS · 训练数据 ·

2023 年 1 月 13 日

Designing losses for data-free training of normalizing flows on Boltzmann distributions

翻译：设计无数据培训方面的损失,使波尔兹曼分发的流流正常化

Loris Felardos,Jérôme Hénin,Guillaume Charpiat

Generating a Boltzmann distribution in high dimension has recently been achieved with Normalizing Flows, which enable fast and exact computation of the generated density, and thus unbiased estimation of expectations. However, current implementations rely on accurate training data, which typically comes from computationally expensive simulations. There is therefore a clear incentive to train models with incomplete or no data by relying solely on the target density, which can be obtained from a physical energy model (up to a constant factor). For that purpose, we analyze the properties of standard losses based on Kullback-Leibler divergences. We showcase their limitations, in particular a strong propensity for mode collapse during optimization on high-dimensional distributions. We then propose strategies to alleviate these issues, most importantly a new loss function well-grounded in theory and with suitable optimization properties. Using as a benchmark the generation of 3D molecular configurations, we show on several tasks that, for the first time, imperfect pre-trained models can be further optimized in the absence of training data.

翻译：最近,通过正常化流程,产生了波尔兹曼高维分布,从而能够快速准确地计算生成的密度,从而对预期进行公正的估计。然而,目前的实施依赖精确的培训数据,这些数据通常来自计算成本的模拟。因此,有明显的动机,通过仅仅依靠目标密度来培训不完全或没有数据的模型,而目标密度可以从物理能量模型(直至一个不变系数)中获得。为此,我们根据库尔回背-利伯利尔差异分析标准损失的特性。我们展示了它们的局限性,特别是高维分布优化期间模式崩溃的强烈倾向。我们随后提出了缓解这些问题的战略,最重要的是,在理论和适当优化性能方面,一个新的损失功能。我们用3D分子配置的生成作为基准,我们展示了几项任务,即:在缺乏培训数据的情况下,不完善的预先培训模型第一次可以进一步优化。

0

相关内容

规范化的

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

59+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

72+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

50+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

122+阅读 · 2020年11月20日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

87+阅读 · 2020年2月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

18+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

168+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

77+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

14+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

微凸点中电迁移与Sn晶粒取向相互作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

1+阅读 · 2014年12月31日

CuFe2O4的形貌和尺寸可控合成及催化性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

Riemann-Hilbert方法及若干相关问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于C-PolInSAR和PolInSAR的森林垂直结构参数反演

国家自然科学基金

0+阅读 · 2009年12月31日

人泡沫病毒Tas蛋白与细胞Pirh2蛋白的相互作用及功能研究

国家自然科学基金

0+阅读 · 2009年12月31日

Resolving quantitative MRI model degeneracy with machine learning via training data distribution design

Resolving quantitative MRI model degeneracy with machine learning via training data distribution design

Arxiv

0+阅读 · 2023年3月9日

Reward Informed Dreamer for Task Generalization in Reinforcement Learning

Arxiv

0+阅读 · 2023年3月9日

The autoregressive neural network architecture of the Boltzmann distribution of pairwise interacting spins systems

Arxiv

0+阅读 · 2023年3月8日

The Influence Function of Graphical Lasso Estimators

Arxiv

0+阅读 · 2023年3月8日

Unimodal Distributions for Ordinal Regression

Arxiv

0+阅读 · 2023年3月8日

The Parallelism Tradeoff: Limitations of Log-Precision Transformers

Arxiv

0+阅读 · 2023年3月7日

A comparative study on different neural network architectures to model inelasticity

Arxiv

0+阅读 · 2023年3月6日

Distributed Graph Neural Network Training: A Survey

Arxiv

15+阅读 · 2022年11月1日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Generative Adversarial Autoencoder Networks

Arxiv

10+阅读 · 2018年3月23日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

59+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

72+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

50+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

122+阅读 · 2020年11月20日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

87+阅读 · 2020年2月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

18+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

168+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

77+阅读 · 2019年10月9日

热门VIP内容

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

14+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Resolving quantitative MRI model degeneracy with machine learning via training data distribution design

Resolving quantitative MRI model degeneracy with machine learning via training data distribution design

Arxiv

0+阅读 · 2023年3月9日

Reward Informed Dreamer for Task Generalization in Reinforcement Learning

Arxiv

0+阅读 · 2023年3月9日

The autoregressive neural network architecture of the Boltzmann distribution of pairwise interacting spins systems

Arxiv

0+阅读 · 2023年3月8日

The Influence Function of Graphical Lasso Estimators

Arxiv

0+阅读 · 2023年3月8日

Unimodal Distributions for Ordinal Regression

Arxiv

0+阅读 · 2023年3月8日

The Parallelism Tradeoff: Limitations of Log-Precision Transformers

Arxiv

0+阅读 · 2023年3月7日

A comparative study on different neural network architectures to model inelasticity

Arxiv

0+阅读 · 2023年3月6日

Distributed Graph Neural Network Training: A Survey

Arxiv

15+阅读 · 2022年11月1日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Generative Adversarial Autoencoder Networks

Arxiv

10+阅读 · 2018年3月23日

相关基金

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

微凸点中电迁移与Sn晶粒取向相互作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

1+阅读 · 2014年12月31日

CuFe2O4的形貌和尺寸可控合成及催化性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

Riemann-Hilbert方法及若干相关问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于C-PolInSAR和PolInSAR的森林垂直结构参数反演

国家自然科学基金

0+阅读 · 2009年12月31日

人泡沫病毒Tas蛋白与细胞Pirh2蛋白的相互作用及功能研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员