以卫星数据解运语言模拟数据提取挑战为目标袭击GPT-Neo (Targeted Attack on GPT-Neo for the SATML Language Model Data Extraction Challenge) - 专知论文

会员服务 ·

0

语言模型化 · MoDELS · 查全率/召回率 · 训练数据 · 可辨认的 ·

2023 年 2 月 13 日

Targeted Attack on GPT-Neo for the SATML Language Model Data Extraction Challenge

翻译：以卫星数据解运语言模拟数据提取挑战为目标袭击GPT-Neo

Ali Al-Kaswan,Maliheh Izadi,Arie van Deursen

Previous work has shown that Large Language Models are susceptible to so-called data extraction attacks. This allows an attacker to extract a sample that was contained in the training data, which has massive privacy implications. The construction of data extraction attacks is challenging, current attacks are quite inefficient, and there exists a significant gap in the extraction capabilities of untargeted attacks and memorization. Thus, targeted attacks are proposed, which identify if a given sample from the training data, is extractable from a model. In this work, we apply a targeted data extraction attack to the SATML2023 Language Model Training Data Extraction Challenge. We apply a two-step approach. In the first step, we maximise the recall of the model and are able to extract the suffix for 69% of the samples. In the second step, we use a classifier-based Membership Inference Attack on the generations. Our AutoSklearn classifier achieves a precision of 0.841. The full approach reaches a score of 0.405 recall at a 10% false positive rate, which is an improvement of 34% over the baseline of 0.301.

翻译：先前的工作显示, 大语言模型很容易被所谓的数据提取攻击。这使得攻击者能够提取培训数据中包含的样本, 这对隐私具有巨大的影响。数据提取攻击的构建具有挑战性, 目前的攻击相当低效, 在非目标攻击和记忆化的提取能力方面还存在巨大的差距。因此, 提出了有针对性的攻击, 确定培训数据中的某个样本是否可从模型中提取。这项工作中, 我们对 SATML2023 语言模拟培训数据提取挑战进行了定向数据提取攻击。我们采用了两步方法。在第一步, 我们最大限度地回收模型, 并能够提取69%的样本的后缀。在第二步, 我们使用基于分类的会员身份推断攻击代代代。我们的自动斯克勒恩分类器实现了0. 841 的精确度。整个方法达到0. 405 分, 以10%的假正率回回记, 这比0. 301 基线提高了 34% 。

0

相关内容

语言模型化

语言模型化

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

Marcks调节斑马鱼原肠胚形成中Bmp分泌和转运的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

插层与Te元素掺杂对FeSe超导体系磁通钉扎机制的影响研究

国家自然科学基金

0+阅读 · 2013年12月31日

全球气候变化下黄脊竹蝗Ceracris kiangsu Tsai发生的物候学模型

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

Ryanodine受体介导钙信号调控成髓鞘细胞分化发育的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

新生期再编程介导慢性内脏痛的表观调控机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

行星系统形成研究:(II)类地行星形成演化和内部结构

国家自然科学基金

0+阅读 · 2009年12月31日

钙钛矿型高温质子导体晶体结构与质子输运性质的关联性研究

国家自然科学基金

0+阅读 · 2009年12月31日

Planning for Attacker Entrapment in Adversarial Settings

Arxiv

0+阅读 · 2023年4月5日

Structured prompt interrogation and recursive extraction of semantics (SPIRES): A method for populating knowledge bases using zero-shot learning

Arxiv

0+阅读 · 2023年4月5日

A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification

Arxiv

0+阅读 · 2023年4月5日

Sociocultural knowledge is needed for selection of shots in hate speech detection tasks

Arxiv

0+阅读 · 2023年4月5日

Ericson: An Interactive Open-Domain Conversational Search Agent

Arxiv

0+阅读 · 2023年4月5日

Prefix tuning for automated audio captioning

Arxiv

0+阅读 · 2023年4月4日

LASP: Text-to-Text Optimization for Language-Aware Soft Prompting of Vision & Language Models

Arxiv

0+阅读 · 2023年4月2日

PromptORE -- A Novel Approach Towards Fully Unsupervised Relation Extraction

Arxiv

0+阅读 · 2023年3月24日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data

Arxiv

12+阅读 · 2018年6月8日

VIP会员

文章信息

相关主题

语言模型化

查全率/召回率

相关VIP内容

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

相关论文

Planning for Attacker Entrapment in Adversarial Settings

Arxiv

0+阅读 · 2023年4月5日

Structured prompt interrogation and recursive extraction of semantics (SPIRES): A method for populating knowledge bases using zero-shot learning

Arxiv

0+阅读 · 2023年4月5日

A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification

Arxiv

0+阅读 · 2023年4月5日

Sociocultural knowledge is needed for selection of shots in hate speech detection tasks

Arxiv

0+阅读 · 2023年4月5日

Ericson: An Interactive Open-Domain Conversational Search Agent

Arxiv

0+阅读 · 2023年4月5日

Prefix tuning for automated audio captioning

Arxiv

0+阅读 · 2023年4月4日

LASP: Text-to-Text Optimization for Language-Aware Soft Prompting of Vision & Language Models

Arxiv

0+阅读 · 2023年4月2日

PromptORE -- A Novel Approach Towards Fully Unsupervised Relation Extraction

Arxiv

0+阅读 · 2023年3月24日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data

Arxiv

12+阅读 · 2018年6月8日

相关基金

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

Marcks调节斑马鱼原肠胚形成中Bmp分泌和转运的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

插层与Te元素掺杂对FeSe超导体系磁通钉扎机制的影响研究

国家自然科学基金

0+阅读 · 2013年12月31日

全球气候变化下黄脊竹蝗Ceracris kiangsu Tsai发生的物候学模型

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

Ryanodine受体介导钙信号调控成髓鞘细胞分化发育的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

新生期再编程介导慢性内脏痛的表观调控机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

行星系统形成研究:(II)类地行星形成演化和内部结构

国家自然科学基金

0+阅读 · 2009年12月31日

钙钛矿型高温质子导体晶体结构与质子输运性质的关联性研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员