在双曲空格中检测 BERT (Probing BERT in Hyperbolic Spaces)

Recently, a variety of probing tasks are proposed to discover linguistic properties learned in contextualized word embeddings. Many of these works implicitly assume these embeddings lay in certain metric spaces, typically the Euclidean space. This work considers a family of geometrically special spaces, the hyperbolic spaces, that exhibit better inductive biases for hierarchical structures and may better reveal linguistic hierarchies encoded in contextualized representations. We introduce a Poincare probe, a structural probe projecting these embeddings into a Poincare subspace with explicitly defined hierarchies. We focus on two probing objectives: (a) dependency trees where the hierarchy is defined as head-dependent structures; (b) lexical sentiments where the hierarchy is defined as the polarity of words (positivity and negativity). We argue that a key desideratum of a probe is its sensitivity to the existence of linguistic structures. We apply our probes on BERT, a typical contextualized embedding model. In a syntactic subspace, our probe better recovers tree structures than Euclidean probes, revealing the possibility that the geometry of BERT syntax may not necessarily be Euclidean. In a sentiment subspace, we reveal two possible meta-embeddings for positive and negative sentiments and show how lexically-controlled contextualization would change the geometric localization of embeddings. We demonstrate the findings with our Poincare probe via extensive experiments and visualization. Our results can be reproduced at https://github.com/FranxYao/PoincareProbe.

翻译：最近,我们提出了各种调查任务,以发现在背景化的字嵌入中学习的语言特性。许多这类工作都隐含地假定这些嵌入存在于某些测量空间,典型的欧clidean空间。这项工作考虑的是几何特殊空间的组合,即双曲空间,这对等级结构有更好的感化偏向,并可能更好地揭示在背景化的表述中编码的语言等级。我们引入了诗意探测器,一个结构探测器,将这些嵌入有明确定义的等级结构的Poincare子空间中。我们侧重于两个探测目标:(a) 依附树,其等级被定义为依赖性结构;(b) 词汇学情感,其等级被定义为词的极性(隐含性和内涵性)。我们说,一个探测器的关键偏移是语言结构的存在感。我们将这些探测器应用在一种典型的负面背景化嵌入模型上。在一个辛勤的子空间中,我们比Eucliderideideidea 更好地恢复树结构结构结构结构结构结构结构结构,而不是Eucliclideal-lideal devologyal 、我们可能显示一种直观-rogradual-listrisal-exal-restial-exal-exal-s、我们可能显示一种可能显示一种直系的地理-istrisal-ibal-ibal-ex-ex-ex-ex-ex-ex-exal-ibal-ex-exisal-istrismalismalismal-ex、我们或或或演化的表。