带有普通等式的低端空间可合成文本控制操作 (Composable Text Control Operations in Latent Space with Ordinary Differential Equations) - 专知论文

会员服务 ·

0

控制器 · 潜在 · 操作 · 向量化 · 语言模型化 ·

2022 年 8 月 1 日

Composable Text Control Operations in Latent Space with Ordinary Differential Equations

翻译：带有普通等式的低端空间可合成文本控制操作

Guangyi Liu,Zeyu Feng,Yuan Gao,Zichao Yang,Xiaodan Liang,Junwei Bao,Xiaodong He,Shuguang Cui,Zhen Li,Zhiting Hu

from arxiv, 20 Pages, Code: https://github.com/guangyliu/LatentOps

Real-world text applications often involve composing a wide range of text control operations, such as editing the text w.r.t. an attribute, manipulating keywords and structure, and generating new text of desired properties. Prior work typically learns/finetunes a language model (LM) to perform individual or specific subsets of operations. Recent research has studied combining operations in a plug-and-play manner, often with costly search or optimization in the complex sequence space. This paper proposes a new efficient approach for composable text operations in the compact latent space of text. The low-dimensionality and differentiability of the text latent vector allow us to develop an efficient sampler based on ordinary differential equations (ODEs) given arbitrary plug-in operators (e.g., attribute classifiers). By connecting pretrained LMs (e.g., GPT2) to the latent space through efficient adaption, we then decode the sampled vectors into desired text sequences. The flexible approach permits diverse control operators (sentiment, tense, formality, keywords, etc.) acquired using any relevant data from different domains. Experiments show that composing those operators within our approach manages to generate or edit high-quality text, substantially improving over previous methods in terms of generation quality and efficiency.

翻译：现实世界文本应用通常涉及一系列广泛的文本控制操作,例如编辑文本 w.r.t. 属性、操纵关键词和结构,以及生成想要的属性的新文本。先前的工作通常会学习/ finetunes 一种语言模型(LM) 来执行单个或特定操作子集。最近的研究研究以插接和播放方式将操作组合在一起,通常在复杂的序列空间中花费昂贵的搜索或优化。本文提出了在文本的紧凑潜质空间中进行可配置文本操作的一种新的有效方法。文本潜在矢量的低维度和不同性使我们能够根据普通差异方程式(ODEs) 开发一个高效的样本, 并基于任意插插接操作器( 例如, 属性分类器) 。通过将预培训的LM( GPT2) 与潜在空间连接起来, 我们随后将抽样矢量的矢量解成理想的文本序列。灵活的方法允许使用来自不同领域的任何相关数据获得的多种控制操作者( 流、时间、、、、格式、关键字等) 能够开发出一个基于以往领域任何相关数据的高效版本的高效操作者, 。实验将这些操作者管理到对前质量进行高质量的文本进行重大修改。

0

相关内容

控制器

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

miR-5591靶向AGER/ROS/JNK抑制MSCs氧化应激损伤在糖尿病创面修复中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

高维生物数据的PLS特征选择方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

prohibitin与PIG3基因启动子区（TGYCC）n序列结合并调控其转录的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

短时空PS InSAR形变模型构建与稳健解算方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cocycle动力学和拟周期薛定谔算子的谱

国家自然科学基金

0+阅读 · 2012年12月31日

西北主要城镇区域与PREE的动态模拟、空间整合研究

国家自然科学基金

0+阅读 · 2011年12月31日

Prokineticin 2 调节SCN神经元的电生理活动及昼夜节律行为

国家自然科学基金

0+阅读 · 2009年12月31日

光子晶体表面等离子体MEMS红外辐射源的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Phase function methods for second order linear ordinary differential equations with turning points

Arxiv

0+阅读 · 2022年9月29日

Fast Inference for Quantile Regression with Millions of Observations

Arxiv

0+阅读 · 2022年9月29日

Momentum Gradient Descent Federated Learning with Local Differential Privacy

Arxiv

0+阅读 · 2022年9月28日

Efficient Non-Parametric Optimizer Search for Diverse Tasks

Efficient Non-Parametric Optimizer Search for Diverse Tasks

Arxiv

0+阅读 · 2022年9月27日

Finite- and Large- Sample Inference for Model and Coefficients in High-dimensional Linear Regression with Repro Samples

Finite- and Large- Sample Inference for Model and Coefficients in High-dimensional Linear Regression with Repro Samples

Arxiv

0+阅读 · 2022年9月27日

Polynomial time computable functions over the reals characterized using discrete ordinary differential equations

Arxiv

0+阅读 · 2022年9月27日

OSDP: Optimal Sharded Data Parallel for Distributed Deep Learning

Arxiv

0+阅读 · 2022年9月27日

FedOBD: Opportunistic Block Dropout for Efficiently Training Large-scale Neural Networks through Federated Learning

FedOBD: Opportunistic Block Dropout for Efficiently Training Large-scale Neural Networks through Federated Learning

Arxiv

0+阅读 · 2022年9月27日

On Neural Differential Equations

Arxiv

23+阅读 · 2022年2月4日

Order-Free RNN with Visual Attention for Multi-Label Classification

Arxiv

16+阅读 · 2017年12月20日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Phase function methods for second order linear ordinary differential equations with turning points

Arxiv

0+阅读 · 2022年9月29日

Fast Inference for Quantile Regression with Millions of Observations

Arxiv

0+阅读 · 2022年9月29日

Momentum Gradient Descent Federated Learning with Local Differential Privacy

Arxiv

0+阅读 · 2022年9月28日

Efficient Non-Parametric Optimizer Search for Diverse Tasks

Efficient Non-Parametric Optimizer Search for Diverse Tasks

Arxiv

0+阅读 · 2022年9月27日

Finite- and Large- Sample Inference for Model and Coefficients in High-dimensional Linear Regression with Repro Samples

Finite- and Large- Sample Inference for Model and Coefficients in High-dimensional Linear Regression with Repro Samples

Arxiv

0+阅读 · 2022年9月27日

Polynomial time computable functions over the reals characterized using discrete ordinary differential equations

Arxiv

0+阅读 · 2022年9月27日

OSDP: Optimal Sharded Data Parallel for Distributed Deep Learning

Arxiv

0+阅读 · 2022年9月27日

FedOBD: Opportunistic Block Dropout for Efficiently Training Large-scale Neural Networks through Federated Learning

FedOBD: Opportunistic Block Dropout for Efficiently Training Large-scale Neural Networks through Federated Learning

Arxiv

0+阅读 · 2022年9月27日

On Neural Differential Equations

Arxiv

23+阅读 · 2022年2月4日

Order-Free RNN with Visual Attention for Multi-Label Classification

Arxiv

16+阅读 · 2017年12月20日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

miR-5591靶向AGER/ROS/JNK抑制MSCs氧化应激损伤在糖尿病创面修复中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

高维生物数据的PLS特征选择方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

prohibitin与PIG3基因启动子区（TGYCC）n序列结合并调控其转录的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

短时空PS InSAR形变模型构建与稳健解算方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cocycle动力学和拟周期薛定谔算子的谱

国家自然科学基金

0+阅读 · 2012年12月31日

西北主要城镇区域与PREE的动态模拟、空间整合研究

国家自然科学基金

0+阅读 · 2011年12月31日

Prokineticin 2 调节SCN神经元的电生理活动及昼夜节律行为

国家自然科学基金

0+阅读 · 2009年12月31日

光子晶体表面等离子体MEMS红外辐射源的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员