$k$NN提示：无需校准的最近邻推理，超越上下文学习 ($k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference) - 专知论文

会员服务 ·

0

上下文 · 最近邻 · 上下文学习 · 近邻 · 训练数据 ·

2023 年 3 月 24 日

$k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference

翻译：$k$NN提示：无需校准的最近邻推理，超越上下文学习

Benfeng Xu,Quan Wang,Zhendong Mao,Yajuan Lyu,Qiaoqiao She,Yongdong Zhang

from arxiv, ICLR 2023. Code is available at https://github.com/BenfengXu/KNNPrompting

In-Context Learning (ICL), which formulates target tasks as prompt completion conditioned on in-context demonstrations, has become the prevailing utilization of LLMs. In this paper, we first disclose an actual predicament for this typical usage that it can not scale up with training data due to context length restriction. Besides, existing works have shown that ICL also suffers from various biases and requires delicate calibration treatment. To address both challenges, we advocate a simple and effective solution, $k$NN Prompting, which first queries LLM with training data for distributed representations, then predicts test instances by simply referring to nearest neighbors. We conduct comprehensive experiments to demonstrate its two-fold superiority: 1) Calibration-Free: $k$NN Prompting does not directly align LLM output distribution with task-specific label space, instead leverages such distribution to align test and training instances. It significantly outperforms state-of-the-art calibration-based methods under comparable few-shot scenario. 2) Beyond-Context: $k$NN Prompting can further scale up effectively with as many training data as are available, continually bringing substantial improvements. The scaling trend holds across 10 orders of magnitude ranging from 2 shots to 1024 shots as well as different LLMs scales ranging from 0.8B to 30B. It successfully bridges data scaling into model scaling, and brings new potentials for the gradient-free paradigm of LLM deployment. Code is publicly available.

翻译：在上下文学习（ICL）中，将目标任务公式化为在上下文演示的条件下完成提示已成为LLM的主要用途。在本文中，我们首先揭示了这种典型用法的一个实际问题，即由于上下文长度限制，它无法随着训练数据扩大而扩大。此外，现有的研究表明，ICL还面临各种偏差，并需要精细的校准处理。为了解决这两个挑战，我们提出了一种简单而有效的解决方案$k$NN提示，它首先通过训练数据查询LLM以获取分布式表示，然后通过简单地参考最近邻居预测测试实例。我们进行了全面的实验，证明了它的双重优势：1）无需校准：$k$NN提示不直接将LLM输出分布与任务特定的标签空间对齐，而是利用这种分布将测试和训练实例对齐。它在可比较的少样本场景下显着优于最先进的基于校准的方法。2）超越上下文：$k$NN提示可以进一步有效地扩展到尽可能多的训练数据，持续带来实质性的改进。该缩放趋势涵盖了10个数量级，范围从2次到1024次，以及从0.8B到30B的不同LLM规模。它成功地将数据扩展桥接到模型扩展，并为LLM部署的无梯度范例带来了新的潜力，代码公开可用。

0

相关内容

上下文

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

终身学习如何构建？NeurIPS2022《终身学习机》教程，70页ppt

终身学习如何构建？NeurIPS2022《终身学习机》教程，70页ppt

专知会员服务

46+阅读 · 2023年1月26日

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

专知会员服务

26+阅读 · 2022年3月1日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【WWW2021】在语义文本匹配任务中利用先验知识引导BERT注意力

【WWW2021】在语义文本匹配任务中利用先验知识引导BERT注意力

专知会员服务

42+阅读 · 2021年2月24日

近期必读的七篇AAAI 2021【问答（QA）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月2日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【ACL2020】DeeBERT:动态加速BERT推理，DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

【ACL2020】DeeBERT:动态加速BERT推理，DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

专知会员服务

21+阅读 · 2020年4月30日

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

专知会员服务

26+阅读 · 2020年3月19日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

一文了解prompt learning在计算机视觉领域进展

一文了解prompt learning在计算机视觉领域进展

极市平台

7+阅读 · 2022年11月11日

浅聊对比学习（Contrastive Learning）

浅聊对比学习（Contrastive Learning）

极市平台

2+阅读 · 2022年7月26日

ACL‘22杰出论文：Prompt范式有bug！

ACL‘22杰出论文：Prompt范式有bug！

夕小瑶的卖萌屋

2+阅读 · 2022年7月10日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

【论文推荐】最新七篇图像检索相关论文—草图、Tie-Aware、场景图解析、叠加跨注意力机制、深度哈希、人群估计

【论文推荐】最新七篇图像检索相关论文—草图、Tie-Aware、场景图解析、叠加跨注意力机制、深度哈希、人群估计

专知

10+阅读 · 2018年4月22日

Exosome中miR-1281通过组蛋白去乙酰化介导血管新生的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

mRNA甲基化检测概率图模型

国家自然科学基金

2+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Partial Spread Bent函数与Bent-Negabent函数的构造及密码学性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

液晶自适应光学系统的快速波前处理技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

两类指数和的相关性质及应用

国家自然科学基金

0+阅读 · 2012年12月31日

基于OC-seislet变换的三维叠前复杂地震波场迭代数据插值方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Riemann-Hilbert方法及若干相关问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

miRNA-16在正常及病理性红细胞分化成熟过程中的表达及功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

无线网络中一些图论与组合优化问题

国家自然科学基金

1+阅读 · 2009年12月31日

Logic and Commonsense-Guided Temporal Knowledge Graph Completion

Arxiv

0+阅读 · 2023年5月15日

Calibration-Aware Bayesian Learning

Arxiv

0+阅读 · 2023年5月12日

Local Causal Discovery for Estimating Causal Effects

Arxiv

0+阅读 · 2023年5月12日

Is ChatGPT a Good Causal Reasoner? A Comprehensive Evaluation

Arxiv

0+阅读 · 2023年5月12日

Bayesian variance change point detection with credible sets

Arxiv

0+阅读 · 2023年5月10日

CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors

Arxiv

0+阅读 · 2023年5月9日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Relational Learning with Gated and Attentive Neighbor Aggregator for Few-Shot Knowledge Graph Completion

Arxiv

12+阅读 · 2021年4月27日

Improving Knowledge-aware Dialogue Generation via Knowledge Base Question Answering

Arxiv

16+阅读 · 2019年12月16日

VIP会员

文章信息

相关主题

上下文学习

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

终身学习如何构建？NeurIPS2022《终身学习机》教程，70页ppt

终身学习如何构建？NeurIPS2022《终身学习机》教程，70页ppt

专知会员服务

46+阅读 · 2023年1月26日

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

专知会员服务

26+阅读 · 2022年3月1日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【WWW2021】在语义文本匹配任务中利用先验知识引导BERT注意力

【WWW2021】在语义文本匹配任务中利用先验知识引导BERT注意力

专知会员服务

42+阅读 · 2021年2月24日

近期必读的七篇AAAI 2021【问答（QA）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月2日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【ACL2020】DeeBERT:动态加速BERT推理，DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

【ACL2020】DeeBERT:动态加速BERT推理，DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

专知会员服务

21+阅读 · 2020年4月30日

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

专知会员服务

26+阅读 · 2020年3月19日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

126页ppt《AI应用（AI Agent）开发新范式》！

基于深度神经网络的视频分析中的效率优化技术综述：处理系统、算法与应用

WWW2025 | KAG：一种大模型知识增强生成框架

用于时间序列预测的扩散模型：综述

相关资讯

一文了解prompt learning在计算机视觉领域进展

一文了解prompt learning在计算机视觉领域进展

极市平台

7+阅读 · 2022年11月11日

浅聊对比学习（Contrastive Learning）

浅聊对比学习（Contrastive Learning）

极市平台

2+阅读 · 2022年7月26日

ACL‘22杰出论文：Prompt范式有bug！

ACL‘22杰出论文：Prompt范式有bug！

夕小瑶的卖萌屋

2+阅读 · 2022年7月10日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

【论文推荐】最新七篇图像检索相关论文—草图、Tie-Aware、场景图解析、叠加跨注意力机制、深度哈希、人群估计

【论文推荐】最新七篇图像检索相关论文—草图、Tie-Aware、场景图解析、叠加跨注意力机制、深度哈希、人群估计

专知

10+阅读 · 2018年4月22日

相关论文

Logic and Commonsense-Guided Temporal Knowledge Graph Completion

Arxiv

0+阅读 · 2023年5月15日

Calibration-Aware Bayesian Learning

Arxiv

0+阅读 · 2023年5月12日

Local Causal Discovery for Estimating Causal Effects

Arxiv

0+阅读 · 2023年5月12日

Is ChatGPT a Good Causal Reasoner? A Comprehensive Evaluation

Arxiv

0+阅读 · 2023年5月12日

Bayesian variance change point detection with credible sets

Arxiv

0+阅读 · 2023年5月10日

CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors

Arxiv

0+阅读 · 2023年5月9日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Relational Learning with Gated and Attentive Neighbor Aggregator for Few-Shot Knowledge Graph Completion

Arxiv

12+阅读 · 2021年4月27日

Improving Knowledge-aware Dialogue Generation via Knowledge Base Question Answering

Arxiv

16+阅读 · 2019年12月16日

相关基金

Exosome中miR-1281通过组蛋白去乙酰化介导血管新生的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

mRNA甲基化检测概率图模型

国家自然科学基金

2+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Partial Spread Bent函数与Bent-Negabent函数的构造及密码学性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

液晶自适应光学系统的快速波前处理技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

两类指数和的相关性质及应用

国家自然科学基金

0+阅读 · 2012年12月31日

基于OC-seislet变换的三维叠前复杂地震波场迭代数据插值方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Riemann-Hilbert方法及若干相关问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

miRNA-16在正常及病理性红细胞分化成熟过程中的表达及功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

无线网络中一些图论与组合优化问题

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员