A law practitioner has to go through numerous lengthy legal case proceedings for their practices of various categories, such as land dispute, corruption, etc. Hence, it is important to summarize these documents, and ensure that summaries contain phrases with intent matching the category of the case. To the best of our knowledge, there is no evaluation metric that evaluates a summary based on its intent. We propose an automated intent-based summarization metric, which shows a better agreement with human evaluation as compared to other automated metrics like BLEU, ROUGE-L etc. in terms of human satisfaction. We also curate a dataset by annotating intent phrases in legal documents, and show a proof of concept as to how this system can be automated. Additionally, all the code and data to generate reproducible results is available on Github.
翻译:开业律师必须就其各类做法,如土地纠纷、腐败等,经历无数冗长的法律案件诉讼。 因此,必须总结这些文件,确保摘要中含有意图与案件类别相符的短语。据我们所知,没有根据意图评价摘要的评价标准。我们建议采用基于意图的自动汇总衡量标准,该标准比其他自动衡量标准,如BLEU、ROUGE-L等,在人文满意度方面更同意人文评价。我们还在法律文件中用说明意图的短语整理数据,并展示如何使该系统自动化的概念证明。此外,所有生成可复制结果的代码和数据都可以在Github上查阅。