FR: 带统一编码器的折叠合理化 (FR: Folded Rationalization with a Unified Encoder) - 专知论文

会员服务 ·

0

预测器/决策函数 · MoDELS · Better · INFORMS · state-of-the-art ·

2022 年 9 月 17 日

FR: Folded Rationalization with a Unified Encoder

翻译：FR: 带统一编码器的折叠合理化

Wei Liu,Haozhao Wang,Jun Wang,Ruixuan Li,Chao Yue,Yuankai Zhang

from arxiv, Accepted at NeurIPS 2022

Conventional works generally employ a two-phase model in which a generator selects the most important pieces, followed by a predictor that makes predictions based on the selected pieces. However, such a two-phase model may incur the degeneration problem where the predictor overfits to the noise generated by a not yet well-trained generator and in turn, leads the generator to converge to a sub-optimal model that tends to select senseless pieces. To tackle this challenge, we propose Folded Rationalization (FR) that folds the two phases of the rationale model into one from the perspective of text semantic extraction. The key idea of FR is to employ a unified encoder between the generator and predictor, based on which FR can facilitate a better predictor by access to valuable information blocked by the generator in the traditional two-phase model and thus bring a better generator. Empirically, we show that FR improves the F1 score by up to 10.3% as compared to state-of-the-art methods.

翻译：常规工程通常使用两阶段模型,让发电机选择最重要的部件,然后用预测器根据选定的部件作出预测。然而,这种两阶段模型可能会产生退化问题,因为预测器与尚未受过良好训练的发电机产生的噪音相适应,而后又导致发电机聚集到一个往往选择无意义的部件的亚最佳模型上。为了应对这一挑战,我们提议以文字语义提取为视角,将理论模型的两个阶段折叠成一个阶段。FR的关键想法是,在发电机和预测器之间使用一个统一的编码器,使FR能够利用传统的两阶段模型中发电机堵塞的宝贵信息促进更好的预测,从而带来更好的生成器。我们巧妙地表明,FR将F1的得分提高到10.3 %, 与最先进的方法相比,将F1的得分提高到10.3 % 。

0

相关内容

预测器/决策函数

预测器/决策函数

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

脂肪细胞CD36蛋白对mTOR信号通路的影响：高脂诱导胰岛素抵抗的新机制探讨

国家自然科学基金

0+阅读 · 2013年12月31日

大肠杆菌胞嘧碱通透酶CodB结构和功能的研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

Shc C蛋白参与罗哌卡因脊髓神经毒性机制的作用

国家自然科学基金

0+阅读 · 2012年12月31日

RNF4的磷酸化修饰在不同细胞周期中对DNA损伤应答的影响与机制

国家自然科学基金

0+阅读 · 2012年12月31日

从头设计蛋白质DS119折叠机制的分子模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

图在曲面上嵌入的分类

国家自然科学基金

0+阅读 · 2011年12月31日

联合应用单磷酸酰脂质（MPL）的WapA防龋DNA疫苗增强粘膜免疫反应的效应及其机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

酪氨酸磷酸化信号转导网络在丙型肝炎病毒NS3致癌机理中的作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector

A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector

Arxiv

0+阅读 · 2022年10月27日

Improving Adversarial Robustness with Self-Paced Hard-Class Pair Reweighting

Arxiv

0+阅读 · 2022年10月26日

Autoregressive Structured Prediction with Language Models

Arxiv

0+阅读 · 2022年10月26日

Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization

Arxiv

0+阅读 · 2022年10月26日

A Unified Framework for Pun Generation with Humor Principles

Arxiv

0+阅读 · 2022年10月24日

Generating Natural Language Proofs with Verifier-Guided Search

Arxiv

0+阅读 · 2022年10月21日

Composing Ensembles of Pre-trained Models via Iterative Consensus

Arxiv

0+阅读 · 2022年10月20日

Self-Supervised Learning via Maximum Entropy Coding

Arxiv

13+阅读 · 2022年10月20日

Graph Structure Learning with Variational Information Bottleneck

Arxiv

11+阅读 · 2021年12月16日

Learning Implicit Fields for Generative Shape Modeling

Learning Implicit Fields for Generative Shape Modeling

Arxiv

11+阅读 · 2018年12月6日

VIP会员

文章信息

相关主题

预测器/决策函数

state-of-the-art

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《亚音速导弹空对空拦截建模与控制》

CMU《高级自然语言处理》2025课程

万字长文 | 人工智能引发大规模战争的六种可能路径

《美陆军最新条令：陆基中段防御作战》最新88页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector

A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector

Arxiv

0+阅读 · 2022年10月27日

Improving Adversarial Robustness with Self-Paced Hard-Class Pair Reweighting

Arxiv

0+阅读 · 2022年10月26日

Autoregressive Structured Prediction with Language Models

Arxiv

0+阅读 · 2022年10月26日

Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization

Arxiv

0+阅读 · 2022年10月26日

A Unified Framework for Pun Generation with Humor Principles

Arxiv

0+阅读 · 2022年10月24日

Generating Natural Language Proofs with Verifier-Guided Search

Arxiv

0+阅读 · 2022年10月21日

Composing Ensembles of Pre-trained Models via Iterative Consensus

Arxiv

0+阅读 · 2022年10月20日

Self-Supervised Learning via Maximum Entropy Coding

Arxiv

13+阅读 · 2022年10月20日

Graph Structure Learning with Variational Information Bottleneck

Arxiv

11+阅读 · 2021年12月16日

Learning Implicit Fields for Generative Shape Modeling

Learning Implicit Fields for Generative Shape Modeling

Arxiv

11+阅读 · 2018年12月6日

相关基金

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

脂肪细胞CD36蛋白对mTOR信号通路的影响：高脂诱导胰岛素抵抗的新机制探讨

国家自然科学基金

0+阅读 · 2013年12月31日

大肠杆菌胞嘧碱通透酶CodB结构和功能的研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

Shc C蛋白参与罗哌卡因脊髓神经毒性机制的作用

国家自然科学基金

0+阅读 · 2012年12月31日

RNF4的磷酸化修饰在不同细胞周期中对DNA损伤应答的影响与机制

国家自然科学基金

0+阅读 · 2012年12月31日

从头设计蛋白质DS119折叠机制的分子模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

图在曲面上嵌入的分类

国家自然科学基金

0+阅读 · 2011年12月31日

联合应用单磷酸酰脂质（MPL）的WapA防龋DNA疫苗增强粘膜免疫反应的效应及其机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

酪氨酸磷酸化信号转导网络在丙型肝炎病毒NS3致癌机理中的作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员