知识是力量:理解因果使法律判断预测模型更加普遍和有力 (Knowledge is Power: Understanding Causality Makes Legal judgment Prediction Models More Generalizable and Robust)

Legal judgment Prediction (LJP), aiming to predict a judgment based on fact descriptions, serves as legal assistance to mitigate the great work burden of limited legal practitioners. Most existing methods apply various large-scale pre-trained language models (PLMs) finetuned in LJP tasks to obtain consistent improvements. However, we discover the fact that the state-of-the-art (SOTA) model makes judgment predictions according to wrong (or non-casual) information, which not only weakens the model's generalization capability but also results in severe social problems like discrimination. Here, we analyze the causal mechanism misleading the LJP model to learn the spurious correlations, and then propose a framework to guide the model to learn the underlying causality knowledge in the legal texts. Specifically, we first perform open information extraction (OIE) to refine the text having a high proportion of causal information, according to which we generate a new set of data. Then, we design a model learning the weights of the refined data and the raw data for LJP model training. The extensive experimental results show that our model is more generalizable and robust than the baselines and achieves a new SOTA performance on two commonly used legal-specific datasets.

翻译：法律判断预测(LJP)旨在预测基于事实描述的判决,作为减轻有限法律从业者繁重工作负担的法律援助,大多数现有方法都采用各种大型预先培训的语言模型(PLMs),对LJP的任务进行微调,以取得一致的改进;然而,我们发现,最先进的(SOTA)模型根据错误(或非偶然)信息作出判断预测,这不仅削弱了模型的概括能力,而且造成了歧视等严重的社会问题。在这里,我们分析了误导LJP模型的因果机制,以了解虚假的相互关系,然后提出了一个框架来指导模型,学习法律文本中的基本因果关系知识。具体地说,我们首先进行公开的信息提取(OIEE),以完善具有高度因果关系信息的文本,据此我们产生一套新的数据。然后,我们设计了一个模型,用来学习改进的数据的份量和LJP模型培训的原始数据。广泛的实验结果显示,我们的模型比基准更加宽泛和坚固,并实现了两个共同使用的法律具体数据。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日