Reviewing contracts is a time-consuming procedure that incurs large expenses to companies and social inequality to those who cannot afford it. In this work, we propose "document-level natural language inference (NLI) for contracts", a novel, real-world application of NLI that addresses such problems. In this task, a system is given a set of hypotheses (such as "Some obligations of Agreement may survive termination.") and a contract, and it is asked to classify whether each hypothesis is "entailed by", "contradicting to" or "not mentioned by" (neutral to) the contract as well as identifying "evidence" for the decision as spans in the contract. We annotated and release the largest corpus to date consisting of 607 annotated contracts. We then show that existing models fail badly on our task and introduce a strong baseline, which (1) models evidence identification as multi-label classification over spans instead of trying to predict start and end tokens, and (2) employs more sophisticated context segmentation for dealing with long documents. We also show that linguistic characteristics of contracts, such as negations by exceptions, are contributing to the difficulty of this task and that there is much room for improvement.
翻译:审查合同是一种耗时的程序,给公司带来大量费用,给无力支付合同的人造成社会不平等。在这项工作中,我们提议“合同的自然语言文件推论”,这是NLI处理这类问题的新的、现实世界应用,在这项工作中,给一个系统提供一套假设(如“协议的某些义务可能持续终止”等)和合同,要求它将每一假设是“由”、“与合同的矛盾”还是“未提及”(中立的)合同,以及确定合同中的决定的“证据”。我们附加说明并公布迄今最大的文件,包括607份附加说明的合同。然后我们表明,现有模型在执行任务时严重失灵,并引入了强有力的基线,即(1) 模型证据确认是跨段的多标签分类,而不是试图预测开始和结束的符号,(2) 使用更复杂的背景分割处理长文件。我们还表明,合同的语言特征,例如例外否定,正在造成这项工作的难度很大,而且有改进的余地。