GeneGPT: 教授大型语言模型使用 NCBI Web API (GeneGPT: Teaching Large Language Models to Use NCBI Web APIs) - 专知论文

会员服务 ·

0

API · 大型语言模型 · WEB · URL · 语言模型 ·

2023 年 4 月 19 日

GeneGPT: Teaching Large Language Models to Use NCBI Web APIs

翻译：GeneGPT: 教授大型语言模型使用 NCBI Web API

Qiao Jin,Yifan Yang,Qingyu Chen,Zhiyong Lu

from arxiv, Work in progress

In this paper, we present GeneGPT, a novel method for teaching large language models (LLMs) to use the Web Application Programming Interfaces (APIs) of the National Center for Biotechnology Information (NCBI) and answer genomics questions. Specifically, we prompt Codex (code-davinci-002) to solve the GeneTuring tests with few-shot URL requests of NCBI API calls as demonstrations for in-context learning. During inference, we stop the decoding once a call request is detected and make the API call with the generated URL. We then append the raw execution results returned by NCBI APIs to the generated texts and continue the generation until the answer is found or another API call is detected. Our preliminary results show that GeneGPT achieves state-of-the-art results on three out of four one-shot tasks and four out of five zero-shot tasks in the GeneTuring dataset. Overall, GeneGPT achieves a macro-average score of 0.76, which is much higher than retrieval-augmented LLMs such as the New Bing (0.44), biomedical LLMs such as BioMedLM (0.08) and BioGPT (0.04), as well as other LLMs such as GPT-3 (0.16) and ChatGPT (0.12).

翻译：在本文中，我们提出了GeneGPT，这是一种新的方法，用于教授大型语言模型（LLM）使用国家生物技术信息中心（NCBI）的Web应用程序编程接口（API），并回答基因组学问题。具体而言，我们提示Codex（code-davinci-002）使用少量NCBI API调用的URL请求来解决GeneTuring测试，作为上下文学习的演示。在推理过程中，一旦检测到调用请求，我们停止解码并使用生成的URL进行API调用。然后，我们将NCBI API返回的原始执行结果附加到生成的文本中，并继续生成，直到找到答案或检测到另一个API调用。我们的初步结果表明，在GeneTuring数据集的四个一次性任务中，GeneGPT在三个任务中实现了最新的结果，并在五个零次任务中实现了四个最新的结果。总体而言，GeneGPT实现了0.76的宏平均得分，远高于检索增强的LLM，如New Bing（0.44），生物医学LLM，如BioMedLM（0.08）和BioGPT（0.04），以及其他LLM，如GPT-3（0.16）和ChatGPT（0.12）。

1

相关内容

API

应用程序接口（简称 API），又称为应用编程接口，就是软件系统不同组成部分衔接的约定。

用GPT-4实现可控文本图像生成，UC伯克利&微软提出新框架Control-GPT

用GPT-4实现可控文本图像生成，UC伯克利&微软提出新框架Control-GPT

专知会员服务

35+阅读 · 2023年6月3日

【ICML2023】调整语言模型作为增强少样本学习的训练数据生成器

【ICML2023】调整语言模型作为增强少样本学习的训练数据生成器

专知会员服务

32+阅读 · 2023年5月19日

PaLM 2 大模型发布！谷歌反击ChatGPT， 92页《Google PaLM 2 技术报告》论文详细阐述！附中文版下载

PaLM 2 大模型发布！谷歌反击ChatGPT， 92页《Google PaLM 2 技术报告》论文详细阐述！附中文版下载

专知会员服务

172+阅读 · 2023年5月11日

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

微软最新GPT-4报告！154页pdf《通用人工智能的火花:GPT-4的早期实验》报告，去向AGI之路（附中文版）

微软最新GPT-4报告！154页pdf《通用人工智能的火花:GPT-4的早期实验》报告，去向AGI之路（附中文版）

专知会员服务

181+阅读 · 2023年3月24日

【普林斯顿陈丹琦团队】使预训练语言模型成为更好的少样本学习器

专知会员服务

32+阅读 · 2021年1月4日

【GPT-3作者亲解】超大型语言模型少样本学习，109页ppt

【GPT-3作者亲解】超大型语言模型少样本学习，109页ppt

专知会员服务

110+阅读 · 2020年12月19日

1750亿参数！GPT-3来了！31位作者，OpenAI发布小样本学习器语言模型

1750亿参数！GPT-3来了！31位作者，OpenAI发布小样本学习器语言模型

专知会员服务

73+阅读 · 2020年5月30日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

首次：微软用GPT-4做大模型指令微调，新任务零样本性能再提升

首次：微软用GPT-4做大模型指令微调，新任务零样本性能再提升

机器之心

7+阅读 · 2023年4月9日

使用BERT做文本摘要

使用BERT做文本摘要

专知

23+阅读 · 2019年12月7日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

ACSL3调控雄激素受体相关信号通路抑制前列腺癌进展转移研究

国家自然科学基金

0+阅读 · 2013年12月31日

肝特异性循环exosomes的miRNA谱：一种潜在的HCC筛选标志

国家自然科学基金

0+阅读 · 2012年12月31日

关于AI-半环簇与 Conway半环簇的研究

国家自然科学基金

1+阅读 · 2012年12月31日

Plk2与ArgBP2相互作用在骨肉瘤细胞迁移中的作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

骨桥蛋白通过miR-200a调节鼻息肉VEGF表达和血管生成

国家自然科学基金

0+阅读 · 2012年12月31日

Nampt对脑卒中后神经再生的调控及机制

国家自然科学基金

0+阅读 · 2012年12月31日

Elmo1-Nck 的相互作用在肝细胞癌侵袭和转移中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

生物编码的金纳米棒探针多通道检测ACS标志物的研究

国家自然科学基金

0+阅读 · 2010年12月31日

InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Arxiv

0+阅读 · 2023年6月5日

Prompt to be Consistent is Better than Self-Consistent? Few-Shot and Zero-Shot Fact Verification with Pre-trained Language Models

Arxiv

0+阅读 · 2023年6月5日

ThinkSum: Probabilistic reasoning over sets using large language models

Arxiv

0+阅读 · 2023年6月2日

True Detective: A Deep Abductive Reasoning Benchmark Undoable for GPT-3 and Challenging for GPT-4

Arxiv

0+阅读 · 2023年6月1日

Can Large Pre-trained Models Help Vision Models on Perception Tasks?

Arxiv

0+阅读 · 2023年6月1日

Teaching Small Language Models to Reason

Arxiv

0+阅读 · 2023年6月1日

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

Arxiv

0+阅读 · 2023年5月31日

Towards Reasoning in Large Language Models: A Survey

Arxiv

34+阅读 · 2022年12月20日

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Arxiv

31+阅读 · 2021年11月1日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

VIP会员

文章信息

相关主题

大型语言模型

相关VIP内容

用GPT-4实现可控文本图像生成，UC伯克利&微软提出新框架Control-GPT

用GPT-4实现可控文本图像生成，UC伯克利&微软提出新框架Control-GPT

专知会员服务

35+阅读 · 2023年6月3日

【ICML2023】调整语言模型作为增强少样本学习的训练数据生成器

【ICML2023】调整语言模型作为增强少样本学习的训练数据生成器

专知会员服务

32+阅读 · 2023年5月19日

PaLM 2 大模型发布！谷歌反击ChatGPT， 92页《Google PaLM 2 技术报告》论文详细阐述！附中文版下载

PaLM 2 大模型发布！谷歌反击ChatGPT， 92页《Google PaLM 2 技术报告》论文详细阐述！附中文版下载

专知会员服务

172+阅读 · 2023年5月11日

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

微软最新GPT-4报告！154页pdf《通用人工智能的火花:GPT-4的早期实验》报告，去向AGI之路（附中文版）

微软最新GPT-4报告！154页pdf《通用人工智能的火花:GPT-4的早期实验》报告，去向AGI之路（附中文版）

专知会员服务

181+阅读 · 2023年3月24日

【普林斯顿陈丹琦团队】使预训练语言模型成为更好的少样本学习器

专知会员服务

32+阅读 · 2021年1月4日

【GPT-3作者亲解】超大型语言模型少样本学习，109页ppt

【GPT-3作者亲解】超大型语言模型少样本学习，109页ppt

专知会员服务

110+阅读 · 2020年12月19日

1750亿参数！GPT-3来了！31位作者，OpenAI发布小样本学习器语言模型

1750亿参数！GPT-3来了！31位作者，OpenAI发布小样本学习器语言模型

专知会员服务

73+阅读 · 2020年5月30日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

热门VIP内容

开通专知VIP会员享更多权益服务

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

《理解城市战及其在俄乌战争中的表现》报告

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

《建设式兵棋模拟作为战术集群配置优化的关键组成部分》

相关资讯

首次：微软用GPT-4做大模型指令微调，新任务零样本性能再提升

首次：微软用GPT-4做大模型指令微调，新任务零样本性能再提升

机器之心

7+阅读 · 2023年4月9日

使用BERT做文本摘要

使用BERT做文本摘要

专知

23+阅读 · 2019年12月7日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

相关论文

InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Arxiv

0+阅读 · 2023年6月5日

Prompt to be Consistent is Better than Self-Consistent? Few-Shot and Zero-Shot Fact Verification with Pre-trained Language Models

Arxiv

0+阅读 · 2023年6月5日

ThinkSum: Probabilistic reasoning over sets using large language models

Arxiv

0+阅读 · 2023年6月2日

True Detective: A Deep Abductive Reasoning Benchmark Undoable for GPT-3 and Challenging for GPT-4

Arxiv

0+阅读 · 2023年6月1日

Can Large Pre-trained Models Help Vision Models on Perception Tasks?

Arxiv

0+阅读 · 2023年6月1日

Teaching Small Language Models to Reason

Arxiv

0+阅读 · 2023年6月1日

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

Arxiv

0+阅读 · 2023年5月31日

Towards Reasoning in Large Language Models: A Survey

Arxiv

34+阅读 · 2022年12月20日

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Arxiv

31+阅读 · 2021年11月1日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

相关基金

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

ACSL3调控雄激素受体相关信号通路抑制前列腺癌进展转移研究

国家自然科学基金

0+阅读 · 2013年12月31日

肝特异性循环exosomes的miRNA谱：一种潜在的HCC筛选标志

国家自然科学基金

0+阅读 · 2012年12月31日

关于AI-半环簇与 Conway半环簇的研究

国家自然科学基金

1+阅读 · 2012年12月31日

Plk2与ArgBP2相互作用在骨肉瘤细胞迁移中的作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

骨桥蛋白通过miR-200a调节鼻息肉VEGF表达和血管生成

国家自然科学基金

0+阅读 · 2012年12月31日

Nampt对脑卒中后神经再生的调控及机制

国家自然科学基金

0+阅读 · 2012年12月31日

Elmo1-Nck 的相互作用在肝细胞癌侵袭和转移中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

生物编码的金纳米棒探针多通道检测ACS标志物的研究

国家自然科学基金

0+阅读 · 2010年12月31日

微信扫码咨询专知VIP会员