对分布式语义模型的综合比较评价和分析 (A comprehensive comparative evaluation and analysis of Distributional Semantic Models)

Distributional semantics has deeply changed in the last decades. First, predict models stole the thunder from traditional count ones, and more recently both of them were replaced in many NLP applications by contextualized vectors produced by Transformer neural language models. Although an extensive body of research has been devoted to Distributional Semantic Model (DSM) evaluation, we still lack a thorough comparison with respect to tested models, semantic tasks, and benchmark datasets. Moreover, previous work has mostly focused on task-driven evaluation, instead of exploring the differences between the way models represent the lexical semantic space. In this paper, we perform a comprehensive evaluation of type distributional vectors, either produced by static DSMs or obtained by averaging the contextualized vectors generated by BERT. First of all, we investigate the performance of embeddings in several semantic tasks, carrying out an in-depth statistical analysis to identify the major factors influencing the behavior of DSMs. The results show that i.) the alleged superiority of predict based models is more apparent than real, and surely not ubiquitous and ii.) static DSMs surpass contextualized representations in most out-of-context semantic tasks and datasets. Furthermore, we borrow from cognitive neuroscience the methodology of Representational Similarity Analysis (RSA) to inspect the semantic spaces generated by distributional models. RSA reveals important differences related to the frequency and part-of-speech of lexical items.

翻译：在过去几十年里,分布式语义发生了深刻的变化。首先,预测模型从传统的计算空间中偷走了雷电,而最近这两种模型都被许多NLP应用中由变异神经语言模型产生的背景矢量替换了。尽管大量研究都致力于分布式语义模型(DSM)评估,但我们仍缺乏对测试模型、语义任务和基准数据集的全面比较。此外,以往的工作大多侧重于任务驱动评估,而不是探索模型代表词汇空间的方式之间的差异。在本文件中,我们对类型分布矢量进行了全面评价,或者由静态DSMs生成,或者通过平均化BERT生成的背景矢量矢量矢量矢量。首先,我们调查了将若干语义任务中嵌入的绩效,进行了深入的统计分析,以确定影响DSMs行为的主要因素。结果显示,基于预测的模型的优越性比真实的要明显,而且肯定不是易变和不可变的。静止的DSM-S-S-S-S-S-SIM-SL-S-S-SIM-SIM-Slview recal-resmal-viol-viewal imal ex-slview-Lislviewdal-Slational resmal-Slview-Slview-Lisl-s-Slisal-Sl-Slislislviews-S-slview-s-Lisal-slviolviolviolvial-s-Slviolviolviolviolviolviewsmal-smal-smal-smal-smal-sm-smal-s-sm-s-s-s-s-s-sl-sl-sl-smal-smal-I-sl-sl-sm-slvical-sl-smvical-smvical-sal-slvical-sl-sl-sl-l-l-sl-sl-sl-sl-sl-sl-slismismismal-sl)-slismviol-sl-s

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

【图机器学习论文】图嵌入：问题、技术与应用综述（ A Comprehensive Survey of Graph Embedding: Problems, Techniques and Applications）

专知会员服务

52+阅读 · 2019年12月16日

【ECML-PKDD 2019】二部图中通过社区发现算法进行链接预测（Link Prediction via Community Detection inBipartite Multi-Layer Graphs）

专知会员服务

34+阅读 · 2019年12月3日