AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation - 专知论文

会员服务 ·

0

词元分析器 · MoDELS · 语言模型化 · 去噪 · Performance ·

2023 年 5 月 16 日

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

翻译：暂无翻译

Tong Wu,Zhihao Fan,Xiao Liu,Yeyun Gong,Yelong Shen,Jian Jiao,Hai-Tao Zheng,Juntao Li,Zhongyu Wei,Jian Guo,Nan Duan,Weizhu Chen

Diffusion models have gained significant attention in the realm of image generation due to their exceptional performance. Their success has been recently expanded to text generation via generating all tokens within a sequence concurrently. However, natural language exhibits a far more pronounced sequential dependency in comparison to images, and the majority of existing language models are trained utilizing a left-to-right auto-regressive approach. To account for the inherent sequential characteristic of natural language, we introduce Auto-Regressive Diffusion (AR-Diffusion). AR-Diffusion ensures that the generation of tokens on the right depends on the generated ones on the left, a mechanism achieved through employing a dynamic number of denoising steps that vary based on token position. This results in tokens on the left undergoing fewer denoising steps than those on the right, thereby enabling them to generate earlier and subsequently influence the generation of tokens on the right. In a series of experiments on various text generation tasks including text summarization, machine translation, and common sense generation, AR-Diffusion clearly demonstrated the superiority over existing diffusion language models and that it can be $100\times\sim600\times$ faster when achieving comparable results. Our code will be publicly released.

翻译：暂无翻译

0

相关内容

词元分析器

词元分析器

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

多元整数值GARCH模型的统计分析

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白甲基化酶复合物COMPASS催化的H3K4me2,H3K4me3对果蝇发育调控的研究

国家自然科学基金

0+阅读 · 2012年12月31日

分数阶变分PDE图像复原关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

流体动力学若干模型的定性研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

A Survey on Generative Diffusion Model

A Survey on Generative Diffusion Model

Arxiv

0+阅读 · 2023年7月3日

AMD: Autoregressive Motion Diffusion

Arxiv

0+阅读 · 2023年7月2日

Intriguing properties of synthetic images: from generative adversarial networks to diffusion models

Arxiv

0+阅读 · 2023年6月29日

Benchmarking Large Language Model Capabilities for Conditional Generation

Arxiv

0+阅读 · 2023年6月29日

Lossy Image Compression with Conditional Diffusion Models

Arxiv

0+阅读 · 2023年6月28日

VIP会员

文章信息

相关主题

词元分析器

语言模型化

相关VIP内容

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体工程（Agent Engineering）

《全球地缘政治环境中的反无人机系统互操作性》252页

专业软件开发者不靠“氛围编程”（Vibe Coding），而靠“控制”：2025 年 AI Agent 在编程中的应用研究

基于大语言模型的智能体化软件问题解决：综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

相关论文

A Survey on Generative Diffusion Model

A Survey on Generative Diffusion Model

Arxiv

0+阅读 · 2023年7月3日

AMD: Autoregressive Motion Diffusion

Arxiv

0+阅读 · 2023年7月2日

Intriguing properties of synthetic images: from generative adversarial networks to diffusion models

Arxiv

0+阅读 · 2023年6月29日

Benchmarking Large Language Model Capabilities for Conditional Generation

Arxiv

0+阅读 · 2023年6月29日

Lossy Image Compression with Conditional Diffusion Models

Arxiv

0+阅读 · 2023年6月28日

相关基金

多元整数值GARCH模型的统计分析

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白甲基化酶复合物COMPASS催化的H3K4me2,H3K4me3对果蝇发育调控的研究

国家自然科学基金

0+阅读 · 2012年12月31日

分数阶变分PDE图像复原关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

流体动力学若干模型的定性研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员