SSLGuard: 自我监督的学习前受过训练的学习者自学前编目计划 (SSLGuard: A Watermarking Scheme for Self-supervised Learning Pre-trained Encoders) - 专知论文

会员服务 ·

0

Learning · MoDELS · Extensibility · 机器学习建模 · 特征提取器 ·

2022 年 7 月 1 日

SSLGuard: A Watermarking Scheme for Self-supervised Learning Pre-trained Encoders

翻译：SSLGuard: 自我监督的学习前受过训练的学习者自学前编目计划

Tianshuo Cong,Xinlei He,Yang Zhang

Self-supervised learning is an emerging machine learning (ML) paradigm. Compared to supervised learning which leverages high-quality labeled datasets to achieve good performance, self-supervised learning relies on unlabeled datasets to pre-train powerful encoders which can then be treated as feature extractors for various downstream tasks. The huge amount of data and computational resources consumption makes the encoders themselves become valuable intellectual property of the model owner. Recent research has shown that the ML model's copyright is threatened by model stealing attacks, which aim to train a surrogate model to mimic the behavior of a given model. We empirically show that pre-trained encoders are highly vulnerable to model stealing attacks. However, most of the current efforts of copyright protection algorithms such as watermarking concentrate on classifiers. Meanwhile, the intrinsic challenges of pre-trained encoder's copyright protection remain largely unstudied. We fill the gap by proposing SSLGuard, the first watermarking algorithm for pre-trained encoders. Given a clean pre-trained encoder, SSLGuard injects a watermark into it and outputs a watermarked version. The shadow training technique is also applied to preserve the watermark under potential model stealing attacks. Our extensive evaluation shows that SSLGuard is effective in watermark injection and verification, and is robust against model stealing and other watermark removal attacks such as input noising, output perturbing, overwriting, model pruning, and fine-tuning.

翻译：自我监督的学习是一种新兴的机器学习(ML)范式。与利用高品质标签数据集实现良好业绩的监督学习相比, 自监督的学习依赖于未贴标签的数据集,对强大的编码器进行预培训,然后可以将其作为各种下游任务的特性提取器。大量的数据和计算资源消耗使得编码器本身成为模型拥有者的宝贵知识产权。最近的研究显示,ML模型的版权受到模型盗窃袭击的威胁,该模型旨在训练一个替代模型以模拟特定模型的行为。我们的经验显示,预先训练的编码器极易受到模型盗窃攻击的伤害。然而,目前版权保护算法的多数努力,例如将水标记集中用于分类。与此同时,预先训练的编码器的版权保护的内在挑战仍然在很大程度上没有受到研究。我们通过提出模型SLSLGuard,这是为预先训练的编码首次精确的编码算法,目的是训练一个模拟精练前精细的模型, SSLGuard的编码器极易易受攻击。将一个有效的水标记, 将一个有效的模型标记用于在水上标记中, 将一个有效的模型中进行。

0

相关内容

Learning

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

专知

12+阅读 · 2018年2月2日

空间分数阶Schr？dinger方程的时间分裂谱方法

国家自然科学基金

0+阅读 · 2014年12月31日

肺内皮细胞S1PR1受体在流感病毒所致ARDS中的作用

国家自然科学基金

1+阅读 · 2014年12月31日

混凝土Weibull统计尺寸效应理论模型改进研究

国家自然科学基金

0+阅读 · 2013年12月31日

锆改性沸石添加对湖泊沉积物-水界面氮磷迁移转化的影响研究

国家自然科学基金

0+阅读 · 2013年12月31日

量子点/有机分子界面光致电荷转移过程中电子激发态振动结构的飞秒时间分辨瞬态光栅受激发射光谱研究

国家自然科学基金

0+阅读 · 2013年12月31日

电子超快动力学的阿秒时间分辨与操控

国家自然科学基金

0+阅读 · 2013年12月31日

砷化镓表面摩擦化学去除的机理及应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

Treg和双基因修饰的imDC诱导肝移植免疫耐受的相互作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

能量转化与多相流化学反应耦合作用下的颗粒群多相流动力学基础研究

国家自然科学基金

0+阅读 · 2008年12月31日

NOD蛋白在不可分型流感嗜血杆菌诱导肺组织炎症反应中的作用及相关信号通路研究

国家自然科学基金

0+阅读 · 2008年12月31日

Robust DNN Watermarking via Fixed Embedding Weights with Optimized Distribution

Arxiv

0+阅读 · 2022年8月23日

Safe Reinforcement Learning via Shielding under Partial Observability

Arxiv

0+阅读 · 2022年8月23日

Toward Better Target Representation for Source-Free and Black-Box Domain Adaptation

Arxiv

0+阅读 · 2022年8月22日

Class-Aware Visual Prompt Tuning for Vision-Language Pre-Trained Model

Arxiv

0+阅读 · 2022年8月22日

A Generic Self-Supervised Framework of Learning Invariant Discriminative Features

Arxiv

0+阅读 · 2022年8月21日

Byzantines can also Learn from History: Fall of Centered Clipping in Federated Learning

Arxiv

0+阅读 · 2022年8月21日

Near-Optimal $Φ$-Regret Learning in Extensive-Form Games

Arxiv

0+阅读 · 2022年8月20日

Curbing Task Interference using Representation Similarity-Guided Multi-Task Feature Sharing

Curbing Task Interference using Representation Similarity-Guided Multi-Task Feature Sharing

Arxiv

0+阅读 · 2022年8月19日

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

Arxiv

23+阅读 · 2021年3月3日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

VIP会员

文章信息

相关主题

机器学习建模

特征提取器

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

专知

12+阅读 · 2018年2月2日

相关论文

Robust DNN Watermarking via Fixed Embedding Weights with Optimized Distribution

Arxiv

0+阅读 · 2022年8月23日

Safe Reinforcement Learning via Shielding under Partial Observability

Arxiv

0+阅读 · 2022年8月23日

Toward Better Target Representation for Source-Free and Black-Box Domain Adaptation

Arxiv

0+阅读 · 2022年8月22日

Class-Aware Visual Prompt Tuning for Vision-Language Pre-Trained Model

Arxiv

0+阅读 · 2022年8月22日

A Generic Self-Supervised Framework of Learning Invariant Discriminative Features

Arxiv

0+阅读 · 2022年8月21日

Byzantines can also Learn from History: Fall of Centered Clipping in Federated Learning

Arxiv

0+阅读 · 2022年8月21日

Near-Optimal $Φ$-Regret Learning in Extensive-Form Games

Arxiv

0+阅读 · 2022年8月20日

Curbing Task Interference using Representation Similarity-Guided Multi-Task Feature Sharing

Curbing Task Interference using Representation Similarity-Guided Multi-Task Feature Sharing

Arxiv

0+阅读 · 2022年8月19日

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

Arxiv

23+阅读 · 2021年3月3日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

相关基金

空间分数阶Schr？dinger方程的时间分裂谱方法

国家自然科学基金

0+阅读 · 2014年12月31日

肺内皮细胞S1PR1受体在流感病毒所致ARDS中的作用

国家自然科学基金

1+阅读 · 2014年12月31日

混凝土Weibull统计尺寸效应理论模型改进研究

国家自然科学基金

0+阅读 · 2013年12月31日

锆改性沸石添加对湖泊沉积物-水界面氮磷迁移转化的影响研究

国家自然科学基金

0+阅读 · 2013年12月31日

量子点/有机分子界面光致电荷转移过程中电子激发态振动结构的飞秒时间分辨瞬态光栅受激发射光谱研究

国家自然科学基金

0+阅读 · 2013年12月31日

电子超快动力学的阿秒时间分辨与操控

国家自然科学基金

0+阅读 · 2013年12月31日

砷化镓表面摩擦化学去除的机理及应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

Treg和双基因修饰的imDC诱导肝移植免疫耐受的相互作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

能量转化与多相流化学反应耦合作用下的颗粒群多相流动力学基础研究

国家自然科学基金

0+阅读 · 2008年12月31日

NOD蛋白在不可分型流感嗜血杆菌诱导肺组织炎症反应中的作用及相关信号通路研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员