A major drawback of modern neural OpenIE systems and benchmarks is that they prioritize high coverage of information in extractions over compactness of their constituents. This severely limits the usefulness of OpenIE extractions in many downstream tasks. The utility of extractions can be improved if extractions are compact and share constituents. To this end, we study the problem of identifying compact extractions with neural-based methods. We propose CompactIE, an OpenIE system that uses a novel pipelined approach to produce compact extractions with overlapping constituents. It first detects constituents of the extractions and then links them to build extractions. We train our system on compact extractions obtained by processing existing benchmarks. Our experiments on CaRB and Wire57 datasets indicate that CompactIE finds 1.5x-2x more compact extractions than previous systems, with high precision, establishing a new state-of-the-art performance in OpenIE.
翻译:现代神经开放IE系统和基准的一大缺点是,它们将提取信息的高覆盖面列为优先事项,而不是其成分的紧凑性。这严重限制了开放IE的提取在许多下游任务中的有用性。如果提取是紧凑和共享成分,则提取的效用可以提高。为此目的,我们研究用神经法查明紧凑提取的问题。我们建议ClaimIEE,这是一个使用新型管道方法产生具有重叠成分的紧凑提取的系统。它首先检测提取的成分,然后将它们连接起来进行提取。我们就通过处理现有基准获得的紧凑提取进行了系统培训。我们对CARB和Wire57数据集的实验表明,CapractIE发现比以前的系统多1.5x-2x的紧凑提取,并且非常精确地在开放IE中建立了一个新的状态。