Distributional semantics has deeply changed in the last decades. First, predict models stole the thunder from traditional count ones, and more recently both of them were replaced in many NLP applications by contextualized vectors produced by Transformer neural language models. Although an extensive body of research has been devoted to Distributional Semantic Model (DSM) evaluation, we still lack a thorough comparison with respect to tested models, semantic tasks, and benchmark datasets. Moreover, previous work has mostly focused on task-driven evaluation, instead of exploring the differences between the way models represent the lexical semantic space. In this paper, we perform a comprehensive evaluation of type distributional vectors, either produced by static DSMs or obtained by averaging the contextualized vectors generated by BERT. First of all, we investigate the performance of embeddings in several semantic tasks, carrying out an in-depth statistical analysis to identify the major factors influencing the behavior of DSMs. The results show that i.) the alleged superiority of predict based models is more apparent than real, and surely not ubiquitous and ii.) static DSMs surpass contextualized representations in most out-of-context semantic tasks and datasets. Furthermore, we borrow from cognitive neuroscience the methodology of Representational Similarity Analysis (RSA) to inspect the semantic spaces generated by distributional models. RSA reveals important differences related to the frequency and part-of-speech of lexical items.
翻译:在过去几十年里,分布式语义发生了深刻的变化。首先,预测模型从传统的计算空间中偷走了雷电,而最近这两种模型都被许多NLP应用中由变异神经语言模型产生的背景矢量替换了。尽管大量研究都致力于分布式语义模型(DSM)评估,但我们仍缺乏对测试模型、语义任务和基准数据集的全面比较。此外,以往的工作大多侧重于任务驱动评估,而不是探索模型代表词汇空间的方式之间的差异。在本文件中,我们对类型分布矢量进行了全面评价,或者由静态DSMs生成,或者通过平均化BERT生成的背景矢量矢量矢量矢量。首先,我们调查了将若干语义任务中嵌入的绩效,进行了深入的统计分析,以确定影响DSMs行为的主要因素。结果显示,基于预测的模型的优越性比真实的要明显,而且肯定不是易变和不可变的。 静止的DSM-S-S-S-S-S-SIM-SL-S-S-SIM-SIM-Slview recal-resmal-viol-viewal imal ex-slview-Lislviewdal-Slational resmal-Slview-Slview-Lisl-s-Slisal-Sl-Slislislviews-S-slview-s-Lisal-slviolviolviolvial-s-Slviolviolviolviolviolviewsmal-smal-smal-smal-smal-sm-smal-s-sm-s-s-s-s-s-sl-sl-sl-smal-smal-I-sl-sl-sm-slvical-sl-smvical-smvical-sal-slvical-sl-sl-sl-l-l-sl-sl-sl-sl-sl-sl-slismismismal-sl)-slismviol-sl-s