对通过相似性结构理解对比学习机制: 一项理论分析的探索 (Towards Understanding the Mechanism of Contrastive Learning via Similarity Structure: A Theoretical Analysis) - 专知论文

会员服务 ·

0

对比学习 · KCL · 相似性 · 理论分析 · 表示 ·

2023 年 4 月 1 日

Towards Understanding the Mechanism of Contrastive Learning via Similarity Structure: A Theoretical Analysis

翻译：对通过相似性结构理解对比学习机制: 一项理论分析的探索

Hiroki Waida,Yuichiro Wada,Léo andéol,Takumi Nakagawa,Yuhui Zhang,Takafumi Kanamori

Contrastive learning is an efficient approach to self-supervised representation learning. Although recent studies have made progress in the theoretical understanding of contrastive learning, the investigation of how to characterize the clusters of the learned representations is still limited. In this paper, we aim to elucidate the characterization from theoretical perspectives. To this end, we consider a kernel-based contrastive learning framework termed Kernel Contrastive Learning (KCL), where kernel functions play an important role when applying our theoretical results to other frameworks. We introduce a formulation of the similarity structure of learned representations by utilizing a statistical dependency viewpoint. We investigate the theoretical properties of the kernel-based contrastive loss via this formulation. We first prove that the formulation characterizes the structure of representations learned with the kernel-based contrastive learning framework. We show a new upper bound of the classification error of a downstream task, which explains that our theory is consistent with the empirical success of contrastive learning. We also establish a generalization error bound of KCL. Finally, we show a guarantee for the generalization ability of KCL to the downstream classification task via a surrogate bound.

翻译：对比学习是一种有效的自监督学习方法。虽然最近的研究在对比学习的理论理解方面取得了进展，但有关如何表征所学表示的簇的研究仍然有限。本文旨在通过理论角度阐明字符化表示学习的簇。为此，我们考虑了一个基于核的对比学习框架，称为核对比学习（Kernel Contrastive Learning, KCL）。当将我们的理论结果应用于其他框架时，核函数发挥了重要作用。我们利用统计依赖视角介绍了所学表示的相似性结构的表达方式。通过这种表示方式，我们研究了基于核的对比损失的理论性质。首先，我们证明了该表示方法表征了基于核的对比学习框架中所学表示的结构。我们展示了一个下游任务分类错误的新上界，解释了我们的理论与对比学习的经验成功是相一致的。我们还建立了KCL的一般化误差界。最后，我们通过一个替代界，证明了KCL的泛化能力对下游分类任务也是有保证的。

0

相关内容

对比学习

通过潜在空间的对比损失最大限度地提高相同数据样本的不同扩充视图之间的一致性来学习表示。对比式自监督学习技术是一类很有前途的方法，它通过学习编码来构建表征，编码使两个事物相似或不同

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

【AAAI2022】对偶对比学习在人脸伪造检测中的应用

【AAAI2022】对偶对比学习在人脸伪造检测中的应用

专知会员服务

23+阅读 · 2022年1月9日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【KDD2020教程】多模态网络表示学习

【KDD2020教程】多模态网络表示学习

专知会员服务

132+阅读 · 2020年8月26日

面向结构化数据的向量嵌入理论 | word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings of Structured Data

面向结构化数据的向量嵌入理论 | word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings of Structured Data

专知会员服务

52+阅读 · 2020年4月1日

【AAAI 2019 Tutorial】超越单词的神经向量表示:句子和文档嵌入（Neural Vector Representations beyond Words: Sentence and Document Embeddings），Gerard de Melo

【AAAI 2019 Tutorial】超越单词的神经向量表示:句子和文档嵌入（Neural Vector Representations beyond Words: Sentence and Document Embeddings），Gerard de Melo

专知会员服务

19+阅读 · 2019年11月18日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

专知

10+阅读 · 2018年2月1日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

带时滞随机动力系统不变流形的光滑性

国家自然科学基金

0+阅读 · 2015年12月31日

形状记忆合金时效效应的微观机理及调控方法

国家自然科学基金

0+阅读 · 2014年12月31日

间充质干细胞调控Treg/Th17平衡诱导肝移植免疫耐受的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

多任务学习的理论分析与应用

国家自然科学基金

6+阅读 · 2013年12月31日

黎曼流形的曲率与拓扑关系研究

国家自然科学基金

2+阅读 · 2013年12月31日

群环的代数K理论及其结构

国家自然科学基金

0+阅读 · 2012年12月31日

HDPR1-δ-catenin通路在非小细胞肺癌侵袭和凋亡中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

Calpain调控胸膜间皮细胞的增殖与迁移在胸膜及胸膜下纤维化发生中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

约化群酉表示的branching law及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

半监督鉴别特征抽取及人脸识别应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding

Arxiv

0+阅读 · 2023年5月23日

On the Computational Complexity of Mechanism Design in Single-Crossing Settings

Arxiv

0+阅读 · 2023年5月22日

SFP: Spurious Feature-targeted Pruning for Out-of-Distribution Generalization

Arxiv

0+阅读 · 2023年5月19日

What You Hear Is What You See: Audio Quality Metrics From Image Quality Metrics

Arxiv

0+阅读 · 2023年5月19日

Inferring Stochastic Group Interactions within Structured Populations via Coupled Autoregression

Arxiv

0+阅读 · 2023年5月18日

Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning

Arxiv

11+阅读 · 2023年3月10日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

Towards Out-Of-Distribution Generalization: A Survey

Arxiv

38+阅读 · 2021年8月31日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Few-shot Learning with Meta Metric Learners

Arxiv

13+阅读 · 2019年1月26日

VIP会员

文章信息

相关主题

相关VIP内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

【AAAI2022】对偶对比学习在人脸伪造检测中的应用

【AAAI2022】对偶对比学习在人脸伪造检测中的应用

专知会员服务

23+阅读 · 2022年1月9日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【KDD2020教程】多模态网络表示学习

【KDD2020教程】多模态网络表示学习

专知会员服务

132+阅读 · 2020年8月26日

面向结构化数据的向量嵌入理论 | word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings of Structured Data

面向结构化数据的向量嵌入理论 | word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings of Structured Data

专知会员服务

52+阅读 · 2020年4月1日

【AAAI 2019 Tutorial】超越单词的神经向量表示:句子和文档嵌入（Neural Vector Representations beyond Words: Sentence and Document Embeddings），Gerard de Melo

【AAAI 2019 Tutorial】超越单词的神经向量表示:句子和文档嵌入（Neural Vector Representations beyond Words: Sentence and Document Embeddings），Gerard de Melo

专知会员服务

19+阅读 · 2019年11月18日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

卫星导航技术发展综述

《美军"僚机"联合能力技术演示项目：有人-无人火炮作战》41页报告

美军条令《火力指挥》116页

可解释的人工智能在生物医学图像分析中的应用综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

专知

10+阅读 · 2018年2月1日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding

Arxiv

0+阅读 · 2023年5月23日

On the Computational Complexity of Mechanism Design in Single-Crossing Settings

Arxiv

0+阅读 · 2023年5月22日

SFP: Spurious Feature-targeted Pruning for Out-of-Distribution Generalization

Arxiv

0+阅读 · 2023年5月19日

What You Hear Is What You See: Audio Quality Metrics From Image Quality Metrics

Arxiv

0+阅读 · 2023年5月19日

Inferring Stochastic Group Interactions within Structured Populations via Coupled Autoregression

Arxiv

0+阅读 · 2023年5月18日

Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning

Arxiv

11+阅读 · 2023年3月10日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

Towards Out-Of-Distribution Generalization: A Survey

Arxiv

38+阅读 · 2021年8月31日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Few-shot Learning with Meta Metric Learners

Arxiv

13+阅读 · 2019年1月26日

相关基金

带时滞随机动力系统不变流形的光滑性

国家自然科学基金

0+阅读 · 2015年12月31日

形状记忆合金时效效应的微观机理及调控方法

国家自然科学基金

0+阅读 · 2014年12月31日

间充质干细胞调控Treg/Th17平衡诱导肝移植免疫耐受的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

多任务学习的理论分析与应用

国家自然科学基金

6+阅读 · 2013年12月31日

黎曼流形的曲率与拓扑关系研究

国家自然科学基金

2+阅读 · 2013年12月31日

群环的代数K理论及其结构

国家自然科学基金

0+阅读 · 2012年12月31日

HDPR1-δ-catenin通路在非小细胞肺癌侵袭和凋亡中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

Calpain调控胸膜间皮细胞的增殖与迁移在胸膜及胸膜下纤维化发生中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

约化群酉表示的branching law及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

半监督鉴别特征抽取及人脸识别应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员