The difficulty of an entity matching task depends on a combination of multiple factors such as the amount of corner-case pairs, the fraction of entities in the test set that have not been seen during training, and the size of the development set. Current entity matching benchmarks usually represent single points in the space along such dimensions or they provide for the evaluation of matching methods along a single dimension, for instance the amount of training data. This paper presents WDC Products, an entity matching benchmark which provides for the systematic evaluation of matching systems along combinations of three dimensions while relying on real-word data. The three dimensions are (i) amount of corner-cases (ii) generalization to unseen entities, and (iii) development set size. Generalization to unseen entities is a dimension not covered by any of the existing benchmarks yet but is crucial for evaluating the robustness of entity matching systems. WDC Products is based on heterogeneous product data from thousands of e-shops which mark-up products offers using schema.org annotations. Instead of learning how to match entity pairs, entity matching can also be formulated as a multi-class classification task that requires the matcher to recognize individual entities. WDC Products is the first benchmark that provides a pair-wise and a multi-class formulation of the same tasks and thus allows to directly compare the two alternatives. We evaluate WDC Products using several state-of-the-art matching systems, including Ditto, HierGAT, and R-SupCon. The evaluation shows that all matching systems struggle with unseen entities to varying degrees. It also shows that some systems are more training data efficient than others.
翻译:实体匹配任务的困难取决于多种因素的组合,例如角对齐数量、测试组中未在培训中看到的实体的分数、以及开发组的规模。当前实体匹配基准通常代表空间中的单一点,这些层面的大小通常代表单一空间的单一点,但对于评价实体匹配系统的稳健性至关重要。WDC产品基于数千个电子商店的混合产品数据,这些商店的标记产品使用 schema.Cont.org 说明。除了学习如何匹配实体配对之外,实体匹配还可以被设计成一个多级分类任务,需要匹配者识别单个实体的通用度(二) 普通度,以及(三) 开发组的规模。对未知实体的普遍化是任何现有基准都没有覆盖的层面,但对于评估实体匹配系统的稳健性至关重要。因此,WDC产品以数千个电子商店的混合产品数据为基础,这些商店使用 schemta.org 说明。除了学习如何匹配实体配对外,实体还可以被设计成一个多级分类任务,需要匹配者对隐蔽实体进行一般化程度的概括,同时将WDC产品显示一个基准,因此,我们可以将多项对等的升级系统进行对比。我们对等的系统进行一个基准,从而对等对比。我们进行多项任务,从而对齐。