We report on an experiment in legal judgement prediction on European Court of Human Rights cases where our model first learns to predict the convention articles allegedly violated by the state from case facts descriptions, and subsequently utilizes that information to predict a finding of a violation by the court. We assess the dependency between these two tasks at the feature and outcome level. Furthermore, we leverage a hierarchical contrastive loss to pull together article specific representations of cases at the higher level level, leading to distinctive article clusters, and further pulls the cases in each article cluster based on their outcome leading to sub-clusters of cases with similar outcomes. Our experiment results demonstrate that, given a static pre-trained encoder, our models produce a small but consistent improvement in prediction performance over single-task and joint models without contrastive loss.
翻译:我们报告对欧洲人权法院案例的法律判决预测实验,我们的模型首先从案件事实描述中预测国家据称违反公约条款的情况,然后利用这些信息预测法院违反公约的情况。我们评估了这两项任务在特征和结果层面的依存性。此外,我们利用等级对比性损失将条款在较高层面的具体陈述案件集中在一起,从而形成独特的条款集群,并根据每组条款导致产生类似结果的小类案件的结果,进一步提取每组条款中的案件。 我们的实验结果表明,鉴于一个固定的事先训练的编码器,我们的模型在单一任务和联合模型的预测性能方面产生微小但一致的改进,而没有产生对比性损失。