不平衡问题: 重新审视神经崩溃的几何 (Imbalance Trouble: Revisiting Neural-Collapse Geometry)

Neural Collapse refers to the remarkable structural properties characterizing the geometry of class embeddings and classifier weights, found by deep nets when trained beyond zero training error. However, this characterization only holds for balanced data. Here we thus ask whether it can be made invariant to class imbalances. Towards this end, we adopt the unconstrained-features model (UFM), a recent theoretical model for studying neural collapse, and introduce Simplex-Encoded-Labels Interpolation (SELI) as an invariant characterization of the neural collapse phenomenon. Specifically, we prove for the UFM with cross-entropy loss and vanishing regularization that, irrespective of class imbalances, the embeddings and classifiers always interpolate a simplex-encoded label matrix and that their individual geometries are determined by the SVD factors of this same label matrix. We then present extensive experiments on synthetic and real datasets that confirm convergence to the SELI geometry. However, we caution that convergence worsens with increasing imbalances. We theoretically support this finding by showing that unlike the balanced case, when minorities are present, ridge-regularization plays a critical role in tweaking the geometry. This defines new questions and motivates further investigations into the impact of class imbalances on the rates at which first-order methods converge to their asymptotically preferred solutions.

翻译：神经断裂是指深网在培训超过零培训错误后发现,班级嵌入和分类重量的几何特征具有显著的结构特性,深网在培训超过零培训错误时发现这种特征,但这种特征只能保留平衡的数据。因此,我们问它是否能够随着阶级失衡而变化。为此,我们采用了不受限制的体格模型(UFM),这是研究神经崩溃的最新理论模型,并引入了简单编码标签(SELI)国际化(SELI),作为神经崩溃现象的一种变化不定的特征。然而,我们告诫说,趋同会加剧不平衡现象。我们从理论上支持这一发现,即不论阶级失衡,嵌入和分类者总是会相互交叉推介一个简单编码的标签矩阵,他们各自的地貌差异是由同一标签矩阵的SVD因素决定的。我们随后对合成和真实的数据集进行了广泛的实验,证实与SELI偏向偏向性地理崩溃现象的趋同性特征。我们从理论上支持这一发现,即与这一平衡的个案不同,当少数群体正在进一步界定其正走向正统度的地平整方法,从而进一步决定其正统的地标定性作用。

相关内容