Recent years have witnessed the effectiveness of contrastive learning in obtaining the representation of dataset that is useful in interpretation and downstream tasks. However, the mechanism that describes this effectiveness have not been thoroughly analyzed, and many studies have been conducted to investigate the data structures captured by contrastive learning. In particular, the recent study of \citet{content_isolate} has shown that contrastive learning is capable of decomposing the data space into the space that is invariant to all augmentations and its complement. In this paper, we introduce the notion of invariance-adapted latent space that decomposes the data space into the intersections of the invariant spaces of each augmentation and their complements. This decomposition generalizes the one introduced in \citet{content_isolate}, and describes a structure that is analogous to the frequencies in the harmonic analysis of a group. We experimentally show that contrastive learning with lasso-type metric can be used to find an invariance-adapted latent space, thereby suggesting a new potential for the contrastive learning. We also investigate when such a latent space can be identified up to mixings within each component.
翻译:近些年来,在获取有助于解释和下游任务的数据集代表方面,对比式学习取得了效果。然而,描述这种效果的机制还没有经过彻底分析,而且为了调查通过对比式学习所捕捉的数据结构进行了许多研究。特别是,最近对\citet{content_solarate}的研究显示,对比式学习能够将数据空间分解到与所有增强及其补充性不相容的空间。在本文中,我们引入了将数据空间分解到每个增强及其补充性不变化空间交叉点的不适应性潜层空间的概念。这种分解将“citet{content_solarate}”中引入的数据结构概括化,并描述一个类似于一个组的调频分析结构。我们实验性地显示,与lassso-type测量器的对比性学习可以用来找到一种差异式的潜在空间,从而为对比性学习提供新的潜力。我们还调查了每个组成部分内部何时可以混合这种潜在空间。