韩方法律语言理解和判断预测的多任务基准 (A Multi-Task Benchmark for Korean Legal Language Understanding and Judgement Prediction)

The recent advances of deep learning have dramatically changed how machine learning, especially in the domain of natural language processing, can be applied to legal domain. However, this shift to the data-driven approaches calls for larger and more diverse datasets, which are nevertheless still small in number, especially in non-English languages. Here we present the first large-scale benchmark of Korean legal AI datasets, LBox Open, that consists of one legal corpus, two classification tasks, two legal judgement prediction (LJP) tasks, and one summarization task. The legal corpus consists of 150k Korean precedents (264M tokens), of which 63k are sentenced in last 4 years and 96k are from the first and the second level courts in which factual issues are reviewed. The two classification tasks are case names (10k) and statutes (3k) prediction from the factual description of individual cases. The LJP tasks consist of (1) 11k criminal examples where the model is asked to predict fine amount, imprisonment with labor, and imprisonment without labor ranges for the given facts, and (2) 5k civil examples where the inputs are facts and claim for relief and outputs are the degrees of claim acceptance. The summarization task consists of the Supreme Court precedents and the corresponding summaries. We also release LCube, the first Korean legal language model trained on the legal corpus from this study. Given the uniqueness of the Law of South Korea and the diversity of the legal tasks covered in this work, we believe that LBox Open contributes to the multilinguality of global legal research. LBox Open and LCube will be publicly available.

翻译：最近深层次学习的进展大大改变了机器学习、特别是自然语言处理领域的机械学习如何适用于法律领域。然而,这种转向数据驱动方法的转变需要更多、更多样化的数据集,但数量仍然很少,特别是非英语的数据集。这里我们介绍了韩国法律独立协会数据集的第一个大规模基准,LBox Open, 包括一个法律文件、两个分类任务、两个法律判决预测任务和一项总结任务。法律系统包括150k朝鲜先例(264M标志),其中63k在过去四年中被判刑,96k来自审查事实问题的第一和第二级法院。两个分类任务包括案件名称(10k)和法规(3k),根据对个别案件的事实描述作出预测。LB的任务包括:(1) 11k犯罪实例,要求模型预测罚款数额、有劳动的监禁和没有劳动的监禁范围。(2) 公开民事实例,其中提供的事实和主张救济和主张的主张来自调查事实问题的63k,96k是审查事实问题的第一和第二级法院。LC的公开性质, 两项分类任务包括案例名称名称(10k)和法规(3k)从具体描述中作出预测的预测。韩国法律研究中,我们所了解的法律研究的首项任务包括了韩国法律标准。