未经监督的徒刑嵌入式的平滑反比学习 (Smoothed Contrastive Learning for Unsupervised Sentence Embedding) - 专知论文

会员服务 ·

0

Learning · 平滑 · Batch Size · contrastive · 无监督 ·

2022 年 9 月 12 日

Smoothed Contrastive Learning for Unsupervised Sentence Embedding

翻译：未经监督的徒刑嵌入式的平滑反比学习

Xing Wu,Chaochen Gao,Yipeng Su,Jizhong Han,Zhongyuan Wang,Songlin Hu

from arxiv, COLING 2022

Contrastive learning has been gradually applied to learn high-quality unsupervised sentence embedding. Among the previous un-supervised methods, the latest state-of-the-art method, as far as we know, is unsupervised SimCSE (unsup-SimCSE). Unsup-SimCSE uses the InfoNCE1loss function in the training stage by pulling semantically similar sentences together and pushing apart dis-similar ones.Theoretically, we expect to use larger batches in unsup-SimCSE to get more adequate comparisons among samples and avoid overfitting. However, increasing the batch size does not always lead to improvements, but instead even lead to performance degradation when the batch size exceeds a threshold. Through statistical observation, we find that this is probably due to the introduction of low-confidence negative pairs after in-creasing the batch size. To alleviate this problem, we introduce a simple smoothing strategy upon the InfoNCE loss function, termedGaussian Smoothing InfoNCE (GS-InfoNCE).Specifically, we add random Gaussian noise vectors as negative samples, which act asa smoothing of the negative sample space.Though being simple, the proposed smooth-ing strategy brings substantial improvements to unsup-SimCSE. We evaluate GS-InfoNCEon the standard semantic text similarity (STS)task. GS-InfoNCE outperforms the state-of-the-art unsup-SimCSE by an average Spear-man correlation of 1.38%, 0.72%, 1.17% and0.28% on the base of BERT-base, BERT-large,RoBERTa-base and RoBERTa-large, respectively.

翻译：渐渐应用了对比性学习来学习高质量的未经监管的句子嵌入。在先前的未经监督的方法中, 据我们所知,最新的最先进的艺术方法并非不受监督的SimCSE(unsup-SIMCSE) 。 Usup-SimCSE 在培训阶段使用InfONCEE1loss 函数, 将语义相似的句子拉在一起, 并推开不相近的句子。从理论上看, 我们期望在未升级的 SIMCSE 中使用更大的批量, 以便在样本之间进行更充分的比较。然而, 增加批量规模并不总是导致改进, 而当批量大小超过阈值时,甚至会导致性能退化。通过统计观察,我们发现这可能是由于在批量大小中引入了低自信负对子。为了缓解这一问题, 我们在InfocNCE损失功能中引入了一个简单的平滑动策略, 将Gassian Slishan Stal NCEE (GS-InNCE) 的平流化为否定的样本。

0

相关内容

Learning

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【google】监督对比学习，Supervised Contrastive Learning

【google】监督对比学习，Supervised Contrastive Learning

专知会员服务

32+阅读 · 2020年4月23日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

浅聊对比学习（Contrastive Learning）

浅聊对比学习（Contrastive Learning）

极市平台

2+阅读 · 2022年7月26日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

组蛋白甲基化酶G9a调控糖尿病肾病中巨噬细胞极化失衡的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

SMAD2调控ERK通路干预M2巨噬细胞活化在糖尿病肾病小鼠肾脏纤维化中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

少突胶质细胞转录因子Olig2对精神分裂症海马神经发生的调控及机制

国家自然科学基金

0+阅读 · 2014年12月31日

华支睾吸虫病致肝纤维化分泌排泄抗原新靶点果糖1,6二磷酸的CCK/Leptin途径的研究

国家自然科学基金

0+阅读 · 2013年12月31日

量子点与超导电极耦合的多端系统中自旋输运问题的研究

国家自然科学基金

0+阅读 · 2013年12月31日

解析RELM-α调控中性粒细胞迁移机制对重症急性胰腺炎继发急性肺损伤的影响研究

国家自然科学基金

0+阅读 · 2012年12月31日

铁调控枢纽Hepcidin-Fpn1在动脉粥样硬化进程中的作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ca2+信号通路介导猪骨髓MSCs成脂分化的分子机制及其营养调控

国家自然科学基金

0+阅读 · 2012年12月31日

多尺度随机微分方程的平均原理

国家自然科学基金

0+阅读 · 2012年12月31日

基于脱氧核酶的逻辑调控基因表达的分子自动机研究

国家自然科学基金

0+阅读 · 2009年12月31日

Generative or Contrastive? Phrase Reconstruction for Better Sentence Representation Learning

Arxiv

0+阅读 · 2022年10月21日

PCL: Peer-Contrastive Learning with Diverse Augmentations for Unsupervised Sentence Embeddings

Arxiv

1+阅读 · 2022年10月19日

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings

Arxiv

0+阅读 · 2022年10月19日

Learning Representation for Clustering via Prototype Scattering and Positive Sampling

Arxiv

0+阅读 · 2022年10月19日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

A Comparative Study for Unsupervised Network Representation Learning

Arxiv

24+阅读 · 2020年3月11日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Learning Embedding Adaptation for Few-Shot Learning

Learning Embedding Adaptation for Few-Shot Learning

Arxiv

17+阅读 · 2018年12月10日

A Structured Self-attentive Sentence Embedding

Arxiv

24+阅读 · 2017年3月9日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【google】监督对比学习，Supervised Contrastive Learning

【google】监督对比学习，Supervised Contrastive Learning

专知会员服务

32+阅读 · 2020年4月23日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

浅聊对比学习（Contrastive Learning）

浅聊对比学习（Contrastive Learning）

极市平台

2+阅读 · 2022年7月26日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Generative or Contrastive? Phrase Reconstruction for Better Sentence Representation Learning

Arxiv

0+阅读 · 2022年10月21日

PCL: Peer-Contrastive Learning with Diverse Augmentations for Unsupervised Sentence Embeddings

Arxiv

1+阅读 · 2022年10月19日

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings

Arxiv

0+阅读 · 2022年10月19日

Learning Representation for Clustering via Prototype Scattering and Positive Sampling

Arxiv

0+阅读 · 2022年10月19日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

A Comparative Study for Unsupervised Network Representation Learning

Arxiv

24+阅读 · 2020年3月11日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Learning Embedding Adaptation for Few-Shot Learning

Learning Embedding Adaptation for Few-Shot Learning

Arxiv

17+阅读 · 2018年12月10日

A Structured Self-attentive Sentence Embedding

Arxiv

24+阅读 · 2017年3月9日

相关基金

组蛋白甲基化酶G9a调控糖尿病肾病中巨噬细胞极化失衡的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

SMAD2调控ERK通路干预M2巨噬细胞活化在糖尿病肾病小鼠肾脏纤维化中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

少突胶质细胞转录因子Olig2对精神分裂症海马神经发生的调控及机制

国家自然科学基金

0+阅读 · 2014年12月31日

华支睾吸虫病致肝纤维化分泌排泄抗原新靶点果糖1,6二磷酸的CCK/Leptin途径的研究

国家自然科学基金

0+阅读 · 2013年12月31日

量子点与超导电极耦合的多端系统中自旋输运问题的研究

国家自然科学基金

0+阅读 · 2013年12月31日

解析RELM-α调控中性粒细胞迁移机制对重症急性胰腺炎继发急性肺损伤的影响研究

国家自然科学基金

0+阅读 · 2012年12月31日

铁调控枢纽Hepcidin-Fpn1在动脉粥样硬化进程中的作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ca2+信号通路介导猪骨髓MSCs成脂分化的分子机制及其营养调控

国家自然科学基金

0+阅读 · 2012年12月31日

多尺度随机微分方程的平均原理

国家自然科学基金

0+阅读 · 2012年12月31日

基于脱氧核酶的逻辑调控基因表达的分子自动机研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员