The analysis of complex physical systems hinges on the ability to extract the relevant degrees of freedom from among the many others. Though much hope is placed in machine learning, it also brings challenges, chief of which is interpretability. It is often unclear what relation, if any, the architecture- and training-dependent learned "relevant" features bear to standard objects of physical theory. Here we report on theoretical results which may help to systematically address this issue: we establish equivalence between the information-theoretic notion of relevance defined in the Information Bottleneck (IB) formalism of compression theory, and the field-theoretic relevance of the Renormalization Group. We show analytically that for statistical physical systems described by a field theory the "relevant" degrees of freedom found using IB compression indeed correspond to operators with the lowest scaling dimensions. We confirm our field theoretic predictions numerically. We study dependence of the IB solutions on the physical symmetries of the data. Our findings provide a dictionary connecting two distinct theoretical toolboxes, and an example of constructively incorporating physical interpretability in applications of deep learning in physics.
翻译:复杂的物理系统的分析取决于从许多其他系统中提取相关自由度的能力。 虽然对机器学习寄予很大希望,但它也带来了挑战,主要在于可解释性。通常不清楚建筑和训练所学的“相关”特征与物理理论的标准对象之间的关系(如果有的话)如何。我们在这里报告可能有助于系统解决这一问题的理论结果:我们建立了信息瓶(IB)压缩理论的正式理论中定义的相关信息理论概念与重新正常化小组的实地理论相关性之间的等同。我们从分析中可以看出,对以实地理论描述的统计物理系统而言,使用IB压缩的“相关”自由度确实与最小尺寸的操作者相对应。我们用数字来证实我们的实地理论预测。我们研究了IB解决方案对数据物理对称性的依赖性。我们的研究提供了将两个截然不同的理论工具箱连接起来的词典,以及将物理解释纳入物理学深层学习应用的建设性实例。