最先进的法律推理模型支持附加推理的能力如何？ (How well do SOTA legal reasoning models support abductive reasoning?) - 专知论文

会员服务 ·

0

法律 · 法律推理 · 法律文本处理 · 推理模型 · SOTA ·

2023 年 4 月 14 日

How well do SOTA legal reasoning models support abductive reasoning?

翻译：最先进的法律推理模型支持附加推理的能力如何？

Ha-Thanh Nguyen,Randy Goebel,Francesca Toni,Kostas Stathis,Ken Satoh

We examine how well the state-of-the-art (SOTA) models used in legal reasoning support abductive reasoning tasks. Abductive reasoning is a form of logical inference in which a hypothesis is formulated from a set of observations, and that hypothesis is used to explain the observations. The ability to formulate such hypotheses is important for lawyers and legal scholars as it helps them articulate logical arguments, interpret laws, and develop legal theories. Our motivation is to consider the belief that deep learning models, especially large language models (LLMs), will soon replace lawyers because they perform well on tasks related to legal text processing. But to do so, we believe, requires some form of abductive hypothesis formation. In other words, while LLMs become more popular and powerful, we want to investigate their capacity for abductive reasoning. To pursue this goal, we start by building a logic-augmented dataset for abductive reasoning with 498,697 samples and then use it to evaluate the performance of a SOTA model in the legal field. Our experimental results show although these models can perform well on tasks related to some aspects of legal text processing, they still fall short in supporting abductive reasoning tasks.

翻译：我们研究了用于法律推理的最先进（SOTA）模型在支持附加推理任务方面的表现。附加推理是一种逻辑推理形式，通过一组观察结果提出假设，并使用该假设解释这些观察结果。能够提出这样的假设对于律师和法学者来说非常重要，因为它们有助于阐述逻辑论点，解释法律，并发展法律理论。我们的动机在于考虑一个信念，即深度学习模型，特别是大型语言模型（LLM）将很快取代律师，因为它们在与法律文本处理相关的任务上表现良好。但是，为了做到这一点，我们认为需要某种形式的附加假设形成。换句话说，虽然LLM变得越来越流行和强大，我们也想研究它们在附加推理方面的能力。为了追求这一目标，我们首先构建了一个附加推理的逻辑增强数据集，含有498,697个样本，然后使用它来评估法律领域SOTA模型的性能。我们的实验结果表明，尽管这些模型在与一些法律文本处理方面的任务上表现良好，但它们在支持附加推理任务方面仍然存在欠缺。

0

相关内容

法律是国家制定或认可的，由国家强制力保证实施的，以规定权利和义务为内容的具有普遍约束力的社会规范。

逆合成预测的学习图模型

逆合成预测的学习图模型

专知会员服务

7+阅读 · 2022年7月29日

历时2年442位作者132个机构！Google发布语言模型评价新基准BIG-bench，204个任务全面评价大语言模型的能力

历时2年442位作者132个机构！Google发布语言模型评价新基准BIG-bench，204个任务全面评价大语言模型的能力

专知会员服务

20+阅读 · 2022年6月10日

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

专知会员服务

78+阅读 · 2020年8月13日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

专知会员服务

11+阅读 · 2020年5月25日

【IBM】在视觉和关系推理中迁移学习，Transfer Learning in Visual and Relational Reasoning

【IBM】在视觉和关系推理中迁移学习，Transfer Learning in Visual and Relational Reasoning

专知会员服务

45+阅读 · 2020年1月15日

【Bengio新论文】从学习机中学习:优化、规则和社会规范（Learning from Learning Machines: Optimisation, Rules, and Social Norms）

【Bengio新论文】从学习机中学习:优化、规则和社会规范（Learning from Learning Machines: Optimisation, Rules, and Social Norms）

专知会员服务

19+阅读 · 2020年1月5日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

从Seq2seq到Attention模型到Self Attention（二）

从Seq2seq到Attention模型到Self Attention（二）

量化投资与机器学习

23+阅读 · 2018年10月9日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

基于非独立同分布学习理论的图模型词义消歧及领域适应方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向词汇功能的学术文本语义识别与知识图谱构建

国家自然科学基金

5+阅读 · 2014年12月31日

小鼠骨髓间充质干细胞对同种异体来源树突状细胞亚群免疫调节影响的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Doublesex基因在对虾性别决定和分化中的功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

血管内皮前体细胞调控骨髓间充质干细胞巢的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

等离子体物理和大气海洋动力学中某些偏微分方程的研究

国家自然科学基金

0+阅读 · 2012年12月31日

深部脑刺激的逆行皮质激活作用对改善帕金森式症运动及技能学习障碍的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

诱导HO-1对老年骨骼肌卫星细胞增殖分化的影响与机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

氨基酸离子液体物理化学性质的实验测定和半经验估算

国家自然科学基金

0+阅读 · 2011年12月31日

c-Myc及Cyclin A2诱导豚鼠耳蜗前体细胞增殖的实验研究

国家自然科学基金

0+阅读 · 2008年12月31日

Cross-Domain Car Detection Model with Integrated Convolutional Block Attention Mechanism

Arxiv

0+阅读 · 2023年5月31日

A rule-general abductive learning by rough sets

Arxiv

0+阅读 · 2023年5月31日

What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models?

Arxiv

0+阅读 · 2023年5月31日

Strategic Reasoning with Language Models

Arxiv

0+阅读 · 2023年5月30日

How to Staff When Customers Arrive in Batches

Arxiv

0+阅读 · 2023年5月29日

A Survey of Large Language Models

A Survey of Large Language Models

Arxiv

477+阅读 · 2023年3月31日

Towards Reasoning in Large Language Models: A Survey

Arxiv

34+阅读 · 2022年12月20日

Visual Attention Methods in Deep Learning: An In-Depth Survey

Arxiv

44+阅读 · 2022年4月16日

Neural Collaborative Reasoning

Arxiv

13+阅读 · 2021年5月3日

Attention, please! A survey of Neural Attention Models in Deep Learning

Arxiv

59+阅读 · 2021年3月31日

VIP会员

文章信息

相关主题

法律文本处理

相关VIP内容

逆合成预测的学习图模型

逆合成预测的学习图模型

专知会员服务

7+阅读 · 2022年7月29日

历时2年442位作者132个机构！Google发布语言模型评价新基准BIG-bench，204个任务全面评价大语言模型的能力

历时2年442位作者132个机构！Google发布语言模型评价新基准BIG-bench，204个任务全面评价大语言模型的能力

专知会员服务

20+阅读 · 2022年6月10日

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

专知会员服务

78+阅读 · 2020年8月13日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

专知会员服务

11+阅读 · 2020年5月25日

【IBM】在视觉和关系推理中迁移学习，Transfer Learning in Visual and Relational Reasoning

【IBM】在视觉和关系推理中迁移学习，Transfer Learning in Visual and Relational Reasoning

专知会员服务

45+阅读 · 2020年1月15日

【Bengio新论文】从学习机中学习:优化、规则和社会规范（Learning from Learning Machines: Optimisation, Rules, and Social Norms）

【Bengio新论文】从学习机中学习:优化、规则和社会规范（Learning from Learning Machines: Optimisation, Rules, and Social Norms）

专知会员服务

19+阅读 · 2020年1月5日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

构建军事人工智能信任体系始于破除黑盒机制

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

从Seq2seq到Attention模型到Self Attention（二）

从Seq2seq到Attention模型到Self Attention（二）

量化投资与机器学习

23+阅读 · 2018年10月9日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Cross-Domain Car Detection Model with Integrated Convolutional Block Attention Mechanism

Arxiv

0+阅读 · 2023年5月31日

A rule-general abductive learning by rough sets

Arxiv

0+阅读 · 2023年5月31日

What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models?

Arxiv

0+阅读 · 2023年5月31日

Strategic Reasoning with Language Models

Arxiv

0+阅读 · 2023年5月30日

How to Staff When Customers Arrive in Batches

Arxiv

0+阅读 · 2023年5月29日

A Survey of Large Language Models

A Survey of Large Language Models

Arxiv

477+阅读 · 2023年3月31日

Towards Reasoning in Large Language Models: A Survey

Arxiv

34+阅读 · 2022年12月20日

Visual Attention Methods in Deep Learning: An In-Depth Survey

Arxiv

44+阅读 · 2022年4月16日

Neural Collaborative Reasoning

Arxiv

13+阅读 · 2021年5月3日

Attention, please! A survey of Neural Attention Models in Deep Learning

Arxiv

59+阅读 · 2021年3月31日

相关基金

基于非独立同分布学习理论的图模型词义消歧及领域适应方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向词汇功能的学术文本语义识别与知识图谱构建

国家自然科学基金

5+阅读 · 2014年12月31日

小鼠骨髓间充质干细胞对同种异体来源树突状细胞亚群免疫调节影响的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Doublesex基因在对虾性别决定和分化中的功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

血管内皮前体细胞调控骨髓间充质干细胞巢的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

等离子体物理和大气海洋动力学中某些偏微分方程的研究

国家自然科学基金

0+阅读 · 2012年12月31日

深部脑刺激的逆行皮质激活作用对改善帕金森式症运动及技能学习障碍的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

诱导HO-1对老年骨骼肌卫星细胞增殖分化的影响与机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

氨基酸离子液体物理化学性质的实验测定和半经验估算

国家自然科学基金

0+阅读 · 2011年12月31日

c-Myc及Cyclin A2诱导豚鼠耳蜗前体细胞增殖的实验研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员