长期制文本生成的示范批评 (Model Criticism for Long-Form Text Generation) - 专知论文

会员服务 ·

0

评论员 · MoDELS · 语言模型化 · Processing（编程语言） · 潜在 ·

2022 年 10 月 16 日

Model Criticism for Long-Form Text Generation

翻译：长期制文本生成的示范批评

Yuntian Deng,Volodymyr Kuleshov,Alexander M. Rush

from arxiv, EMNLP 2022

Language models have demonstrated the ability to generate highly fluent text; however, it remains unclear whether their output retains coherent high-level structure (e.g., story progression). Here, we propose to apply a statistical tool, model criticism in latent space, to evaluate the high-level structure of the generated text. Model criticism compares the distributions between real and generated data in a latent space obtained according to an assumptive generative process. Different generative processes identify specific failure modes of the underlying model. We perform experiments on three representative aspects of high-level discourse -- coherence, coreference, and topicality -- and find that transformer-based language models are able to capture topical structures but have a harder time maintaining structural coherence or modeling coreference.

翻译：语言模型已经表明能够产生高度流利的文本;然而,仍然不清楚其产出是否保留了连贯的高层次结构(例如,故事进展)。在这里,我们提议采用统计工具,在潜藏空间进行示范批评,评价生成文本的高层次结构;示范批评比较了在根据一个随机基因化过程获得的潜在空间中真实数据与生成数据之间的分布情况;不同的基因化进程确定了基础模型的具体失败模式。我们试验了三个具有代表性的高级别讨论方面 -- -- 一致性、共同参照和专题性 -- -- 并发现基于变压器的语言模型能够捕捉到主题结构,但更难保持结构一致性或建模共同参照。

0

相关内容

评论员

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

SMYD3调控Wnt/β-catenin信号通路的分子机制及其在肝细胞癌中功能的研究

国家自然科学基金

0+阅读 · 2015年12月31日

BiFeO3基多铁性薄膜异质结中尺寸相关逆磁电效应的相场模拟

国家自然科学基金

0+阅读 · 2014年12月31日

金属晶粒长大动力学的多尺度模拟

国家自然科学基金

0+阅读 · 2012年12月31日

前列腺癌中Nedd4L对TrkA的抑癌性泛素化研究

国家自然科学基金

0+阅读 · 2012年12月31日

Notch信号通路调控膀胱癌上皮间质转化的作用与分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Co基Heusler合金半金属磁性隧道结界面特性及电子极化输运性质研究

国家自然科学基金

0+阅读 · 2011年12月31日

新型层状Bi-Co-O基氧化物材料的制备与热电性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

特异性抗菌NO修饰复合纳米材料的制备及分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

新型手性N-Oxide金属化合物的合成与催化研究

国家自然科学基金

0+阅读 · 2008年12月31日

Imagic: Text-Based Real Image Editing with Diffusion Models

Arxiv

0+阅读 · 2022年11月22日

Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark

Arxiv

0+阅读 · 2022年11月22日

Conflicting Interactions Among Protection Mechanisms for Machine Learning Models

Arxiv

0+阅读 · 2022年11月21日

On the Usefulness of Embeddings, Clusters and Strings for Text Generator Evaluation

Arxiv

0+阅读 · 2022年11月20日

An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation

Arxiv

0+阅读 · 2022年11月19日

3d human motion generation from the text via gesture action classification and the autoregressive model

Arxiv

0+阅读 · 2022年11月18日

RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation

Arxiv

0+阅读 · 2022年11月17日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

VIP会员

文章信息

相关主题

语言模型化

Processing（编程语言）

相关VIP内容

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

Imagic: Text-Based Real Image Editing with Diffusion Models

Arxiv

0+阅读 · 2022年11月22日

Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark

Arxiv

0+阅读 · 2022年11月22日

Conflicting Interactions Among Protection Mechanisms for Machine Learning Models

Arxiv

0+阅读 · 2022年11月21日

On the Usefulness of Embeddings, Clusters and Strings for Text Generator Evaluation

Arxiv

0+阅读 · 2022年11月20日

An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation

Arxiv

0+阅读 · 2022年11月19日

3d human motion generation from the text via gesture action classification and the autoregressive model

Arxiv

0+阅读 · 2022年11月18日

RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation

Arxiv

0+阅读 · 2022年11月17日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

相关基金

SMYD3调控Wnt/β-catenin信号通路的分子机制及其在肝细胞癌中功能的研究

国家自然科学基金

0+阅读 · 2015年12月31日

BiFeO3基多铁性薄膜异质结中尺寸相关逆磁电效应的相场模拟

国家自然科学基金

0+阅读 · 2014年12月31日

金属晶粒长大动力学的多尺度模拟

国家自然科学基金

0+阅读 · 2012年12月31日

前列腺癌中Nedd4L对TrkA的抑癌性泛素化研究

国家自然科学基金

0+阅读 · 2012年12月31日

Notch信号通路调控膀胱癌上皮间质转化的作用与分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Co基Heusler合金半金属磁性隧道结界面特性及电子极化输运性质研究

国家自然科学基金

0+阅读 · 2011年12月31日

新型层状Bi-Co-O基氧化物材料的制备与热电性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

特异性抗菌NO修饰复合纳米材料的制备及分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

新型手性N-Oxide金属化合物的合成与催化研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员