In last decades, legal case search has received more and more attention. Legal practitioners need to work or enhance their efficiency by means of class case search. In the process of searching, legal practitioners often need the search results under several different causes of cases as reference. However, existing work tends to focus on the relevance of the judgments themselves, without considering the connection between the causes of action. Several well-established diversity search techniques already exist in open-field search efforts. However, these techniques do not take into account the specificity of legal search scenarios, e.g., the subtopic may not be independent of each other, but somehow connected. Therefore, we construct a diversity legal retrieval model. This model takes into account both diversity and relevance, and is well adapted to this scenario. At the same time, considering the lack of dataset with diversity labels, we constructed a diversity legal retrieval dataset and obtained labels by manual labeling. experiments confirmed that our model is effective.
翻译:在过去几十年中,法律案件搜索受到越来越多的关注。法律从业人员需要通过分类案件搜索来工作或提高其效率。在搜索过程中,法律从业人员往往需要在若干不同案件原因下作为参考的搜索结果。然而,现有工作往往侧重于判决本身的相关性,而没有考虑行动原因之间的联系。在露天搜索工作中已经存在一些既定的多样性搜索技术。然而,这些技术没有考虑到法律搜索情景的特殊性,例如,分专题可能并非相互独立,而是某种程度上相互联系。因此,我们构建了一个多样性法律检索模型。这一模型既考虑到多样性和相关性,又完全适应于这一假设。与此同时,考虑到缺乏多样性标签数据,我们建立了多样性法律检索数据集,通过手工标签获得标签。实验证实,我们的模式是有效的。