MDCLIP: 从未受控制的医疗图像和文本中学习差异 (MedCLIP: Contrastive Learning from Unpaired Medical Images and Text) - 专知论文

会员服务 ·

0

contrastive · Learning · 假阴性 · state-of-the-art · 对比学习 ·

2022 年 10 月 18 日

MedCLIP: Contrastive Learning from Unpaired Medical Images and Text

翻译：MDCLIP: 从未受控制的医疗图像和文本中学习差异

Zifeng Wang,Zhenbang Wu,Dinesh Agarwal,Jimeng Sun

from arxiv, EMNLP 2022

Existing vision-text contrastive learning like CLIP aims to match the paired image and caption embeddings while pushing others apart, which improves representation transferability and supports zero-shot prediction. However, medical image-text datasets are orders of magnitude below the general images and captions from the internet. Moreover, previous methods encounter many false negatives, i.e., images and reports from separate patients probably carry the same semantics but are wrongly treated as negatives. In this paper, we decouple images and texts for multimodal contrastive learning thus scaling the usable training data in a combinatorial magnitude with low cost. We also propose to replace the InfoNCE loss with semantic matching loss based on medical knowledge to eliminate false negatives in contrastive learning. We prove that MedCLIP is a simple yet effective framework: it outperforms state-of-the-art methods on zero-shot prediction, supervised classification, and image-text retrieval. Surprisingly, we observe that with only 20K pre-training data, MedCLIP wins over the state-of-the-art method (using around 200K data). Our code is available at https://github.com/RyanWangZf/MedCLIP.

翻译：类似CLIP(CLIP)这样的现有视觉-文字对比式学习,旨在匹配配对图像和字幕嵌入,同时将其他图像和字幕分开,从而提高代表性可传输性,支持零点预测。然而,医疗图像文本数据集在互联网一般图像和字幕下是数量级的。此外,以往方法遇到许多虚假的负数,即不同病人的图像和报告可能带有相同的语义,但被错误地作为负数处理。在本文中,我们拆分了多式对比学习的图像和文本,从而以低成本的方式在组合规模上扩大可用的培训数据。我们还提议以基于医学知识的语义匹配损失取代InfoNCE损失,以消除对比性学习中的虚假负数。我们证明MedCLIP是一个简单而有效的框架:它比零射预测、监管分类和图像文本检索方面最先进的方法要差。令人惊讶的是,我们观察到,只有20K 培训前数据,MCLIP(M/com)赢得了Rart方法(使用约200K/CLOBZ数据)。我们的代码可在 http://CLA/CLA/CRVZ.

0

相关内容

contrastive

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

电场调制增强型AlGaN/GaN HEMT关键技术研究

国家自然科学基金

0+阅读 · 2017年12月31日

二维碳材料掺杂与修饰的第一性原理研究

国家自然科学基金

1+阅读 · 2014年12月31日

Resveratrol联合MSCs移植对阿尔茨海默鼠的干预效果及Sirt1分子信号的介导作用

国家自然科学基金

0+阅读 · 2014年12月31日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

肝细胞肝癌中高表达的PRC1基因功能及其受CTCF调控的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型多孔复合材料

国家自然科学基金

0+阅读 · 2013年12月31日

可极化力场的优化及其在药物-蛋白质相互作用研究中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

LIMK1：罗格列酮抑制人胃癌细胞增殖、迁移及侵袭的作用靶点

国家自然科学基金

0+阅读 · 2012年12月31日

Calpain调控胸膜间皮细胞的增殖与迁移在胸膜及胸膜下纤维化发生中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

算子代数上的映射及与群SL(2,R)相关的vN代数

国家自然科学基金

0+阅读 · 2008年12月31日

Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs

Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs

Arxiv

0+阅读 · 2022年12月1日

DC-cycleGAN: Bidirectional CT-to-MR Synthesis from Unpaired Data

Arxiv

0+阅读 · 2022年12月1日

Research on the application of contrastive learning in multi-label text classification

Arxiv

0+阅读 · 2022年12月1日

Self-Supervised Feature Learning for Long-Term Metric Visual Localization

Arxiv

0+阅读 · 2022年11月30日

Extracting Semantic Knowledge from GANs with Unsupervised Learning

Arxiv

0+阅读 · 2022年11月30日

Rethinking Transfer Learning for Medical Image Classification

Arxiv

0+阅读 · 2022年11月29日

CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text Labels

Arxiv

0+阅读 · 2022年11月29日

PatchMix Augmentation to Identify Causal Features in Few-shot Learning

Arxiv

0+阅读 · 2022年11月29日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Arxiv

18+阅读 · 2019年9月25日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

相关论文

Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs

Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs

Arxiv

0+阅读 · 2022年12月1日

DC-cycleGAN: Bidirectional CT-to-MR Synthesis from Unpaired Data

Arxiv

0+阅读 · 2022年12月1日

Research on the application of contrastive learning in multi-label text classification

Arxiv

0+阅读 · 2022年12月1日

Self-Supervised Feature Learning for Long-Term Metric Visual Localization

Arxiv

0+阅读 · 2022年11月30日

Extracting Semantic Knowledge from GANs with Unsupervised Learning

Arxiv

0+阅读 · 2022年11月30日

Rethinking Transfer Learning for Medical Image Classification

Arxiv

0+阅读 · 2022年11月29日

CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text Labels

Arxiv

0+阅读 · 2022年11月29日

PatchMix Augmentation to Identify Causal Features in Few-shot Learning

Arxiv

0+阅读 · 2022年11月29日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Arxiv

18+阅读 · 2019年9月25日

相关基金

电场调制增强型AlGaN/GaN HEMT关键技术研究

国家自然科学基金

0+阅读 · 2017年12月31日

二维碳材料掺杂与修饰的第一性原理研究

国家自然科学基金

1+阅读 · 2014年12月31日

Resveratrol联合MSCs移植对阿尔茨海默鼠的干预效果及Sirt1分子信号的介导作用

国家自然科学基金

0+阅读 · 2014年12月31日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

肝细胞肝癌中高表达的PRC1基因功能及其受CTCF调控的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型多孔复合材料

国家自然科学基金

0+阅读 · 2013年12月31日

可极化力场的优化及其在药物-蛋白质相互作用研究中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

LIMK1：罗格列酮抑制人胃癌细胞增殖、迁移及侵袭的作用靶点

国家自然科学基金

0+阅读 · 2012年12月31日

Calpain调控胸膜间皮细胞的增殖与迁移在胸膜及胸膜下纤维化发生中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

算子代数上的映射及与群SL(2,R)相关的vN代数

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员