统计系统自自动递退神经网络 (Hierarchical autoregressive neural networks for statistical systems)

It was recently proposed that neural networks could be used to approximate many-dimensional probability distributions that appear e.g. in lattice field theories or statistical mechanics. Subsequently they can be used as variational approximators to asses extensive properties of statistical systems, like free energy, and also as neural samplers used in Monte Carlo simulations. The practical application of this approach is unfortunately limited by its unfavorable scaling both of the numerical cost required for training, and the memory requirements with the system size. This is due to the fact that the original proposition involved a neural network of width which scaled with the total number of degrees of freedom, e.g. $L^2$ in case of a two dimensional $L\times L$ lattice. In this work we propose a hierarchical association of physical degrees of freedom, for instance spins, to neurons which replaces it with the scaling with the linear extent $L$ of the system. We demonstrate our approach on the two-dimensional Ising model by simulating lattices of various sizes up to $128 \times 128$ spins, with time benchmarks reaching lattices of size $512 \times 512$. We observe that our proposal improves the quality of neural network training, i.e. the approximated probability distribution is closer to the target that could be previously achieved. As a consequence, the variational free energy reaches a value closer to its theoretical expectation and, if applied in a Markov Chain Monte Carlo algorithm, the resulting autocorrelation time is smaller. Finally, the replacement of a single neural network by a hierarchy of smaller networks considerably reduces the memory requirements.

翻译：最近有人提议,神经网络可用于近似于多种维度概率分布的神经网络,例如,拉蒂斯实地理论或统计力学。随后,这些网络可以用作变异近似相,用来评估统计系统的广泛特性,如免费能源,以及蒙特卡洛模拟中使用的神经取样器。不幸的是,这一方法的实际应用受到限制,因为它在培训所需的数字成本和系统规模的内存要求方面,其规模不尽人意。这是因为最初的提议涉及一个宽度神经网络,随着自由度的总数而扩大,例如,在两个维度为免费能源时,可以用作2维度的神经网络,例如,2美元为2美元。在这项工作中,我们提议将自由度的等级连成一个等级,例如旋转,用系统线度的缩放值来取代神经系统。我们在二维线度的模型上展示了我们的方法,通过模拟不同尺寸的升至128美元水平的螺旋线,可以降低28美元的内值,在轨值中可以降低一个更精确的内值,在时间范围内,我们提出一个更精确的网络的直径直达5度要求。

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日