Estimating the similarity between two legal case documents is an important and challenging problem, having various downstream applications such as prior-case retrieval and citation recommendation. There are two broad approaches for the task -- citation network-based and text-based. Prior citation network-based approaches consider citations only to prior-cases (also called precedents) (PCNet). This approach misses important signals inherent in Statutes (written laws of a jurisdiction). In this work, we propose Hier-SPCNet that augments PCNet with a heterogeneous network of Statutes. We incorporate domain knowledge for legal document similarity into Hier-SPCNet, thereby obtaining state-of-the-art results for network-based legal document similarity. Both textual and network similarity provide important signals for legal case similarity; but till now, only trivial attempts have been made to unify the two signals. In this work, we apply several methods for combining textual and network information for estimating legal case similarity. We perform extensive experiments over legal case documents from the Indian judiciary, where the gold standard similarity between document-pairs is judged by law experts from two reputed Law institutes in India. Our experiments establish that our proposed network-based methods significantly improve the correlation with domain experts' opinion when compared to the existing methods for network-based legal document similarity. Our best-performing combination method (that combines network-based and text-based similarity) improves the correlation with domain experts' opinion by 11.8% over the best text-based method and 20.6\% over the best network-based method. We also establish that our best-performing method can be used to recommend / retrieve citable and similar cases for a source (query) case, which are well appreciated by legal experts.
翻译:估计两个法律案件文件之间的相似性是一个重要而具有挑战性的问题,我们建议Hier-SPCNet将法律文件相似性的域知识纳入Hier-SPCNet等各种下游应用系统,从而获得基于网络的法律文件类似性的最新结果。基于引用网络的先期方法只考虑引用以往案例(也称为先例)(PCNet),这种方法缺少《规约》(管辖权的成文法)所固有的重要信号。在这项工作中,我们提议Hier-SPCNet将PCNet与各种规约的网络网络联系网络联系起来。我们把法律文件相似性的域知识纳入Hier-SPCNet,从而获得基于网络的法律文件相似性的最新结果。 文本和网络联系都为法律案件提供了重要的信号;但到目前为止,我们只做了一些微不足道的尝试来统一这两个信号。 在这项工作中,我们采用几种方法将基于文本和网络信息结合起来来估计基于法律的案件相似性。 我们对基于印度司法机构的法律文件进行广泛的实验,在那里,由两个基于文件标准的金本系专家对基于文件相似性进行判断,然后由两个法律专家对使用最佳的域网际法方法进行比较,用我们的最佳方法来改进现有的网络法系法系。我们用最佳方法来将现有的法系与我们的最佳方法比现有的法系/比较。我们现有的法系法系,我们的最佳法系的专家可以把现有的法系法系法系法系法系法系法系法系与目前比较法系法系法系法系法比比较。