通过有适应性融合的增强依赖性的强化预培训模式改进语义匹配 (Improving Semantic Matching through Dependency-Enhanced Pre-trained Model with Adaptive Fusion)

Transformer-based pre-trained models like BERT have achieved great progress on Semantic Sentence Matching. Meanwhile, dependency prior knowledge has also shown general benefits in multiple NLP tasks. However, how to efficiently integrate dependency prior structure into pre-trained models to better model complex semantic matching relations is still unsettled. In this paper, we propose the \textbf{D}ependency-Enhanced \textbf{A}daptive \textbf{F}usion \textbf{A}ttention (\textbf{DAFA}), which explicitly introduces dependency structure into pre-trained models and adaptively fuses it with semantic information. Specifically, \textbf{\emph{(i)}} DAFA first proposes a structure-sensitive paradigm to construct a dependency matrix for calibrating attention weights. It adopts an adaptive fusion module to integrate the obtained dependency information and the original semantic signals. Moreover, DAFA reconstructs the attention calculation flow and provides better interpretability. By applying it on BERT, our method achieves state-of-the-art or competitive performance on 10 public datasets, demonstrating the benefits of adaptively fusing dependency structure in semantic matching task.

翻译：在语义匹配方面, BERT 等基于预先培训的变异模型在语义匹配方面取得了巨大进展。同时, 依赖前知识也显示在多个 NLP 任务中的一般好处。但是, 如何有效地将依赖性前结构整合到经过培训的模型中, 以更好地建模复杂的语义匹配关系模型中, 仍然未解决。在本文中, 我们提议了\ textbf{D}D}Empendency- Entlebf{A} a}ditive divisive\ textbf{Fus}Fusion\ textbf{A} attention (\ textbf{DAFA}) 。该模型将依赖性结构明确引入预先培训的模型, 并适应性地将其与语义信息连接起来。具体而言,,\ textbfffflentf- femph{(i)\ DAFAFA} 首先提出一个结构敏感的模型, 以构建一个调整关注性矩阵和原始语义信号信号信号的适应性模块。此外, DAFAFAFAFAFAFA在BERT 10 中, 通过应用它, 我们的方法在调整适应性任务结构中实现了适应性功能上展示了十项。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日