Self-supervised learning (SSL) of energy based models has an intuitive relation to equilibrium thermodynamics because the softmax layer, mapping energies to probabilities, is a Gibbs distribution. However, in what way SSL is a thermodynamic process? We show that some SSL paradigms behave as a thermodynamic composite system formed by representations and self-labels in contact with a nonequilibrium reservoir. Moreover, this system is subjected to usual thermodynamic cycles, such as adiabatic expansion and isochoric heating, resulting in a generalized Gibbs ensemble (GGE). In this picture, we show that learning is seen as a demon that operates in cycles using feedback measurements to extract negative work from the system. As applications, we examine some SSL algorithms using this idea.
翻译:以能源为基础的模型的自我监督学习(SSL)与平衡热力动力学有着直觉关系,因为软模层、将能量与概率相映射成的能量是Gibbs分布的。然而,SSL以什么方式是一个热力过程?我们表明,一些SSL范式是一种热力复合系统,由与无平衡储油层接触的表象和自标组成。此外,这个系统受到通常的热力循环的制约,如非对称扩张和异地取暖,从而形成一个普遍的Gibbs共和体(GGE),在这个图中,我们表明,学习被视为一种恶魔,利用反馈测量来循环运作,从系统中获取负面的工作。作为应用,我们用这个概念来审查一些SSL算法。