TextBox 2. 0 : 具有培训前语言模式的文本生成库 (TextBox 2.0: A Text Generation Library with Pre-trained Language Models) - 专知论文

会员服务 ·

0

语言模型化 · MoDELS · Extensibility · HTTPS · 任务对话系统 ·

2022 年 12 月 26 日

TextBox 2.0: A Text Generation Library with Pre-trained Language Models

翻译：TextBox 2. 0 : 具有培训前语言模式的文本生成库

Tianyi Tang,Junyi Li,Zhipeng Chen,Yiwen Hu,Zhuohao Yu,Wenxun Dai,Zican Dong,Xiaoxue Cheng,Yuhao Wang,Wayne Xin Zhao,Jian-Yun Nie,Ji-Rong Wen

from arxiv, Accepted by EMNLP 2022

To facilitate research on text generation, this paper presents a comprehensive and unified library, TextBox 2.0, focusing on the use of pre-trained language models (PLMs). To be comprehensive, our library covers $13$ common text generation tasks and their corresponding $83$ datasets and further incorporates $45$ PLMs covering general, translation, Chinese, dialogue, controllable, distilled, prompting, and lightweight PLMs. We also implement $4$ efficient training strategies and provide $4$ generation objectives for pre-training new PLMs from scratch. To be unified, we design the interfaces to support the entire research pipeline (from data loading to training and evaluation), ensuring that each step can be fulfilled in a unified way. Despite the rich functionality, it is easy to use our library, either through the friendly Python API or command line. To validate the effectiveness of our library, we conduct extensive experiments and exemplify four types of research scenarios. The project is released at the link: https://github.com/RUCAIBox/TextBox.

翻译：为了便利对文本生成的研究,本文件介绍了一个全面和统一的图书馆,即TextBox 2.0, 重点是使用预先培训的语言模型(PLMs)。为了做到全面,我们的图书馆涵盖13美元的共同文本生成任务及其相应的830美元数据集,并进一步纳入了45美元PLMs,涵盖一般、翻译、中文、对话、可控制、蒸馏、促动和轻量级的PLMs。我们还实施了4美元的高效培训战略,并为从零开始培训新的PLMs提供了4美元的生成目标。为了统一,我们设计了界面,以支持整个研究管道(从数据装载到培训和评估),确保每个步骤都能以统一的方式完成。尽管功能丰富,但我们很容易使用我们的图书馆,或者通过友好的 Python API 或指挥线。为了验证我们的图书馆的有效性,我们进行了广泛的实验,并展示了四种类型的研究情景。该项目在链接上发布: https://github.com/ROCIBox/TextBox。

0

相关内容

语言模型化

语言模型化

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【EMNLP2020】自然语言生成，Neural Language Generation

【EMNLP2020】自然语言生成，Neural Language Generation

专知会员服务

39+阅读 · 2020年11月20日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

STAT3调控FOXL2的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

IFN-τ调控奶牛子宫内膜上皮细胞差异表达BoLA-I的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

化疗诱导的细胞衰老在神经母细胞瘤复发中的作用及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

线粒体信号通路在热应激猪睾丸生精细胞凋亡中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于高通量测序的前列腺癌耐药相关候选DNA甲基化基因鉴定与功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

线粒体蛋白SIRT5对氧化/硝化应激诱导胰岛beta细胞损伤的调控作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

PLA2G7在前列腺癌中的表达和调控机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

SIRT1对椎间盘髓核细胞凋亡调控及信号转导机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

新基因Chin1在神经细胞凋亡中的作用及其分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

Active Prompting with Chain-of-Thought for Large Language Models

Arxiv

0+阅读 · 2023年2月23日

Black-box Prompt Learning for Pre-trained Language Models

Arxiv

0+阅读 · 2023年2月23日

InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval

Arxiv

0+阅读 · 2023年2月23日

Controlled and Conditional Text to Image Generation with Diffusion Prior

Arxiv

0+阅读 · 2023年2月23日

KS-DETR: Knowledge Sharing in Attention Learning for Detection Transformer

Arxiv

0+阅读 · 2023年2月22日

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Arxiv

31+阅读 · 2021年11月1日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Arxiv

17+阅读 · 2020年3月31日

Pre-trained Models for Natural Language Processing: A Survey

Arxiv

113+阅读 · 2020年3月18日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

VIP会员

文章信息

相关主题

语言模型化

任务对话系统

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【EMNLP2020】自然语言生成，Neural Language Generation

【EMNLP2020】自然语言生成，Neural Language Generation

专知会员服务

39+阅读 · 2020年11月20日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Active Prompting with Chain-of-Thought for Large Language Models

Arxiv

0+阅读 · 2023年2月23日

Black-box Prompt Learning for Pre-trained Language Models

Arxiv

0+阅读 · 2023年2月23日

InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval

Arxiv

0+阅读 · 2023年2月23日

Controlled and Conditional Text to Image Generation with Diffusion Prior

Arxiv

0+阅读 · 2023年2月23日

KS-DETR: Knowledge Sharing in Attention Learning for Detection Transformer

Arxiv

0+阅读 · 2023年2月22日

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Arxiv

31+阅读 · 2021年11月1日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Arxiv

17+阅读 · 2020年3月31日

Pre-trained Models for Natural Language Processing: A Survey

Arxiv

113+阅读 · 2020年3月18日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

相关基金

STAT3调控FOXL2的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

IFN-τ调控奶牛子宫内膜上皮细胞差异表达BoLA-I的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

化疗诱导的细胞衰老在神经母细胞瘤复发中的作用及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

线粒体信号通路在热应激猪睾丸生精细胞凋亡中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于高通量测序的前列腺癌耐药相关候选DNA甲基化基因鉴定与功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

线粒体蛋白SIRT5对氧化/硝化应激诱导胰岛beta细胞损伤的调控作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

PLA2G7在前列腺癌中的表达和调控机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

SIRT1对椎间盘髓核细胞凋亡调控及信号转导机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

新基因Chin1在神经细胞凋亡中的作用及其分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员