大型语言模型中的自发和可预测记忆 (Emergent and Predictable Memorization in Large Language Models) - 专知论文

会员服务 ·

0

大型语言模型 · 语言模型 · 数据点 · 检查点 · 序列 ·

2023 年 4 月 21 日

Emergent and Predictable Memorization in Large Language Models

翻译：大型语言模型中的自发和可预测记忆

Stella Biderman,USVSN Sai Prashanth,Lintang Sutawika,Hailey Schoelkopf,Quentin Anthony,Shivanshu Purohit,Edward Raf

Memorization, or the tendency of large language models (LLMs) to output entire sequences from their training data verbatim, is a key concern for safely deploying language models. In particular, it is vital to minimize a model's memorization of sensitive datapoints such as those containing personal identifiable information (PII). The prevalence of such undesirable memorization can pose issues for model trainers, and may even require discarding an otherwise functional model. We therefore seek to predict which sequences will be memorized before a large model's full train-time by extrapolating the memorization behavior of lower-compute trial runs. We measure memorization of the Pythia model suite, and find that intermediate checkpoints are better predictors of a model's memorization behavior than smaller fully-trained models. We additionally provide further novel discoveries on the distribution of memorization scores across models and data.

翻译：记忆化，即大型语言模型（LLM）输出其训练数据中整个序列的倾向，是安全部署语言模型的关键问题。特别是，必须将模型在敏感数据点（如包含个人可识别信息（PII）的数据点）中的记忆最小化。这种不良记忆的普遍存在可能会给模型培训者带来问题，甚至可能需要放弃一个本来功能良好的模型。因此，我们寻求在大型模型的完全训练时间之前通过推断低计算量试验运行的记忆化行为来预测哪些序列将被记忆。我们测量了 Pythia 模型套件的记忆化情况，并发现中间检查点比较小的完全训练模型更好地预测模型的记忆行为。我们还提供了关于模型和数据中记忆化得分分布的其他新发现。

0

相关内容

大型语言模型

大型语言模型

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

【纽约大学 Ethan Perez 博士论文】在预训练语言模型中发现和修正不良行为，217页pdf，，Finding and Fixing Undesirable Behaviors in Pretrained Language Models

【纽约大学 Ethan Perez 博士论文】在预训练语言模型中发现和修正不良行为，217页pdf，，Finding and Fixing Undesirable Behaviors in Pretrained Language Models

专知会员服务

18+阅读 · 2022年3月16日

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

专知会员服务

45+阅读 · 2022年3月6日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【斯坦福大学AI】BERT, ELMo， & GPT-2:上下文化的单词表示是怎样的?

专知会员服务

35+阅读 · 2020年3月28日

【论文推荐】用于低资源药物发现的元学习初始化，Meta-Learning Initializations for Low-Resource Drug Discovery

【论文推荐】用于低资源药物发现的元学习初始化，Meta-Learning Initializations for Low-Resource Drug Discovery

专知会员服务

27+阅读 · 2020年3月26日

【开放书】预测模型:探索、解释和调试，以人为本的可解释机器学习，Predictive Models: Explore, Explain, and Debug，Human-Centered Interpretable Machine Learning

【开放书】预测模型:探索、解释和调试，以人为本的可解释机器学习，Predictive Models: Explore, Explain, and Debug，Human-Centered Interpretable Machine Learning

专知会员服务

37+阅读 · 2019年12月26日

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

专知会员服务

18+阅读 · 2019年12月14日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

基于三带模型研究竞争序对铁基超导体中涡旋态的影响

国家自然科学基金

0+阅读 · 2015年12月31日

天然活性分子Isatin抗神经母细胞瘤转移的作用及分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于运动上下文学习的老鼠社会行为检测与识别研究

国家自然科学基金

0+阅读 · 2013年12月31日

运动对神经退行性疾病模型小鼠脑内葡萄糖乳酸转运和线粒体动能活性的影响

国家自然科学基金

0+阅读 · 2013年12月31日

动态面孔语音情绪的整合加工及神经生理机制

国家自然科学基金

0+阅读 · 2013年12月31日

ILKAP催化HIF-1调控骨肉瘤发生及恶性生物学行为的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

真实自发情感的听视觉多模态实时心理学连续维度分析

国家自然科学基金

0+阅读 · 2012年12月31日

Wnt3a参与学习记忆调控的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

GSK-3调控GAPDH嵌入线粒体的作用和机制

国家自然科学基金

0+阅读 · 2012年12月31日

调控人类CYP3A4基因表达的miRNA的筛选和验证

国家自然科学基金

0+阅读 · 2009年12月31日

Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering

Arxiv

1+阅读 · 2023年6月7日

An Empirical Analysis of Parameter-Efficient Methods for Debiasing Pre-Trained Language Models

Arxiv

0+阅读 · 2023年6月6日

Do GPTs Produce Less Literal Translations?

Arxiv

0+阅读 · 2023年6月6日

ThinkSum: Probabilistic reasoning over sets using large language models

Arxiv

0+阅读 · 2023年6月2日

Evaluating the Capabilities of Multi-modal Reasoning Models with Synthetic Task Data

Arxiv

0+阅读 · 2023年6月1日

Learning Transformer Programs

Arxiv

0+阅读 · 2023年6月1日

True Detective: A Deep Abductive Reasoning Benchmark Undoable for GPT-3 and Challenging for GPT-4

Arxiv

0+阅读 · 2023年6月1日

Augmented Large Language Models with Parametric Knowledge Guiding

Arxiv

20+阅读 · 2023年5月8日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Meta-learning in natural and artificial intelligence

Arxiv

10+阅读 · 2020年11月26日

VIP会员

文章信息

相关主题

大型语言模型

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

【纽约大学 Ethan Perez 博士论文】在预训练语言模型中发现和修正不良行为，217页pdf，，Finding and Fixing Undesirable Behaviors in Pretrained Language Models

【纽约大学 Ethan Perez 博士论文】在预训练语言模型中发现和修正不良行为，217页pdf，，Finding and Fixing Undesirable Behaviors in Pretrained Language Models

专知会员服务

18+阅读 · 2022年3月16日

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

专知会员服务

45+阅读 · 2022年3月6日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【斯坦福大学AI】BERT, ELMo， & GPT-2:上下文化的单词表示是怎样的?

专知会员服务

35+阅读 · 2020年3月28日

【论文推荐】用于低资源药物发现的元学习初始化，Meta-Learning Initializations for Low-Resource Drug Discovery

【论文推荐】用于低资源药物发现的元学习初始化，Meta-Learning Initializations for Low-Resource Drug Discovery

专知会员服务

27+阅读 · 2020年3月26日

【开放书】预测模型:探索、解释和调试，以人为本的可解释机器学习，Predictive Models: Explore, Explain, and Debug，Human-Centered Interpretable Machine Learning

【开放书】预测模型:探索、解释和调试，以人为本的可解释机器学习，Predictive Models: Explore, Explain, and Debug，Human-Centered Interpretable Machine Learning

专知会员服务

37+阅读 · 2019年12月26日

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

专知会员服务

18+阅读 · 2019年12月14日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用射频传感器载荷增强无人机的侦察、监视与目标获取（ISR）能力》报告

《导航战》2025最新报告

人工智能驱动的国防战术通信与网络：提升现代战争中的态势感知、安全性与自主决策 | 万字长文

《有人-无人轻型驱逐舰与中型无人水面艇支队在第二与第一岛链作战中的部署概念（CONOPS）》56页报告

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering

Arxiv

1+阅读 · 2023年6月7日

An Empirical Analysis of Parameter-Efficient Methods for Debiasing Pre-Trained Language Models

Arxiv

0+阅读 · 2023年6月6日

Do GPTs Produce Less Literal Translations?

Arxiv

0+阅读 · 2023年6月6日

ThinkSum: Probabilistic reasoning over sets using large language models

Arxiv

0+阅读 · 2023年6月2日

Evaluating the Capabilities of Multi-modal Reasoning Models with Synthetic Task Data

Arxiv

0+阅读 · 2023年6月1日

Learning Transformer Programs

Arxiv

0+阅读 · 2023年6月1日

True Detective: A Deep Abductive Reasoning Benchmark Undoable for GPT-3 and Challenging for GPT-4

Arxiv

0+阅读 · 2023年6月1日

Augmented Large Language Models with Parametric Knowledge Guiding

Arxiv

20+阅读 · 2023年5月8日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Meta-learning in natural and artificial intelligence

Arxiv

10+阅读 · 2020年11月26日

相关基金

基于三带模型研究竞争序对铁基超导体中涡旋态的影响

国家自然科学基金

0+阅读 · 2015年12月31日

天然活性分子Isatin抗神经母细胞瘤转移的作用及分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于运动上下文学习的老鼠社会行为检测与识别研究

国家自然科学基金

0+阅读 · 2013年12月31日

运动对神经退行性疾病模型小鼠脑内葡萄糖乳酸转运和线粒体动能活性的影响

国家自然科学基金

0+阅读 · 2013年12月31日

动态面孔语音情绪的整合加工及神经生理机制

国家自然科学基金

0+阅读 · 2013年12月31日

ILKAP催化HIF-1调控骨肉瘤发生及恶性生物学行为的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

真实自发情感的听视觉多模态实时心理学连续维度分析

国家自然科学基金

0+阅读 · 2012年12月31日

Wnt3a参与学习记忆调控的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

GSK-3调控GAPDH嵌入线粒体的作用和机制

国家自然科学基金

0+阅读 · 2012年12月31日

调控人类CYP3A4基因表达的miRNA的筛选和验证

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员