LawngNLI: 长期预测的短期到长期在部内普遍化和基于伤害的回收基准 (LawngNLI: A Long-Premise Benchmark for In-Domain Generalization from Short to Long Contexts and for Implication-Based Retrieval) - 专知论文

会员服务 ·

0

Performer · 泛化理论 · CASES · 数据集 · domain shift ·

2022 年 12 月 6 日

LawngNLI: A Long-Premise Benchmark for In-Domain Generalization from Short to Long Contexts and for Implication-Based Retrieval

翻译：LawngNLI: 长期预测的短期到长期在部内普遍化和基于伤害的回收基准

William Bruno,Dan Roth

from arxiv, Findings of EMNLP 2022

Natural language inference has trended toward studying contexts beyond the sentence level. An important application area is law: past cases often do not foretell how they apply to new situations and implications must be inferred. This paper introduces LawngNLI, constructed from U.S. legal opinions with automatic labels with high human-validated accuracy. Premises are long and multigranular. Experiments show two use cases. First, LawngNLI can benchmark for in-domain generalization from short to long contexts. It has remained unclear if large-scale long-premise NLI datasets actually need to be constructed: near-top performance on long premises could be achievable by fine-tuning using short premises. Without multigranularity, benchmarks cannot distinguish lack of fine-tuning on long premises versus domain shift between short and long datasets. In contrast, our long and short premises share the same examples and domain. Models fine-tuned using several past NLI datasets and/or our short premises fall short of top performance on our long premises. So for at least certain domains (such as ours), large-scale long-premise datasets are needed. Second, LawngNLI can benchmark for implication-based retrieval. Queries are entailed or contradicted by target documents, allowing users to move between arguments and evidence. Leading retrieval models perform reasonably zero shot on a LawngNLI-derived retrieval task. We compare different systems for re-ranking, including lexical overlap and cross-encoders fine-tuned using a modified LawngNLI or past NLI datasets. LawngNLI can train and test systems for implication-based case retrieval and argumentation.

翻译：自然语言的自然推断趋势是研究超出刑期范围的背景。一个重要的应用领域是法律:过去的案件往往不预示如何适用于新的情形和所涉问题。本文介绍LawngNLI, 由具有高人文价值准确度的自动标签的美国法律意见和高人类价值的自动标签构成。房舍是长期和多面的。实验显示两种使用案例。首先, LawngNLI 可以用短期到长时间的内置一般化为基准。目前还不清楚的是, 是否真的需要建立大型的跨版 NLI 数据集: 使用短期的微调可以实现长房地的近顶级性能。没有多面性能, 基准无法区分长期房地缺乏微调, 以及短处数据集之间的域变换。使用过去 NLIIS 数据集和短处短处的模型比重。因此, 至少在某些领域( 如我们的精细度系统)、大型的超版级级级级级级性性能性能通过短级级级的比值上, 将高级的LISLILU 翻校准性翻校准。

0

相关内容

Performer

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于Medip-seq 和MRE-seq数据的甲基化水平的估计及差异性检验

国家自然科学基金

0+阅读 · 2015年12月31日

高原鼢鼠（Myospalax baileyi）扩散机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

冷等离子体预处理促进木质素热解及低电阻率焦炭形成的机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

煤自燃过程非线性动力学特性及其“滞后”效应机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于稀疏分解的SAR射频干扰抑制与回波重构技术

国家自然科学基金

0+阅读 · 2013年12月31日

同轴多孔FeCo基磁性纤维的可控制备与微波损耗机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

关于AI-半环簇与 Conway半环簇的研究

国家自然科学基金

1+阅读 · 2012年12月31日

uPAR在慢性阻塞性肺疾病小气道上皮细胞EMT中作用的研究

国家自然科学基金

0+阅读 · 2012年12月31日

长波长星载森林生物量观测SAR电离层效应补偿方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

药用植物白木香内生真菌的抗肿瘤抗菌活性代谢产物研究

国家自然科学基金

0+阅读 · 2009年12月31日

Self-Sampling Training and Evaluation for the Accuracy-Bias Tradeoff in Recommendation

Arxiv

0+阅读 · 2023年2月7日

Domain Adaptation for Time Series Under Feature and Label Shifts

Arxiv

6+阅读 · 2023年2月6日

Learning Complementary Policies for Human-AI Teams

Learning Complementary Policies for Human-AI Teams

Arxiv

0+阅读 · 2023年2月6日

Guide the Learner: Controlling Product of Experts Debiasing Method Based on Token Attribution Similarities

Arxiv

0+阅读 · 2023年2月6日

Open Reproducible Publication Research

Arxiv

0+阅读 · 2023年2月6日

Domain Adaptation via Alignment of Operation Profile for Remaining Useful Lifetime Prediction

Arxiv

0+阅读 · 2023年2月3日

CTE: A Dataset for Contextualized Table Extraction

Arxiv

0+阅读 · 2023年2月2日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

DeepSeek: Content Based Image Search & Retrieval

Arxiv

13+阅读 · 2018年1月11日

VIP会员

文章信息

相关主题

相关VIP内容

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Self-Sampling Training and Evaluation for the Accuracy-Bias Tradeoff in Recommendation

Arxiv

0+阅读 · 2023年2月7日

Domain Adaptation for Time Series Under Feature and Label Shifts

Arxiv

6+阅读 · 2023年2月6日

Learning Complementary Policies for Human-AI Teams

Learning Complementary Policies for Human-AI Teams

Arxiv

0+阅读 · 2023年2月6日

Guide the Learner: Controlling Product of Experts Debiasing Method Based on Token Attribution Similarities

Arxiv

0+阅读 · 2023年2月6日

Open Reproducible Publication Research

Arxiv

0+阅读 · 2023年2月6日

Domain Adaptation via Alignment of Operation Profile for Remaining Useful Lifetime Prediction

Arxiv

0+阅读 · 2023年2月3日

CTE: A Dataset for Contextualized Table Extraction

Arxiv

0+阅读 · 2023年2月2日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

DeepSeek: Content Based Image Search & Retrieval

Arxiv

13+阅读 · 2018年1月11日

相关基金

基于Medip-seq 和MRE-seq数据的甲基化水平的估计及差异性检验

国家自然科学基金

0+阅读 · 2015年12月31日

高原鼢鼠（Myospalax baileyi）扩散机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

冷等离子体预处理促进木质素热解及低电阻率焦炭形成的机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

煤自燃过程非线性动力学特性及其“滞后”效应机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于稀疏分解的SAR射频干扰抑制与回波重构技术

国家自然科学基金

0+阅读 · 2013年12月31日

同轴多孔FeCo基磁性纤维的可控制备与微波损耗机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

关于AI-半环簇与 Conway半环簇的研究

国家自然科学基金

1+阅读 · 2012年12月31日

uPAR在慢性阻塞性肺疾病小气道上皮细胞EMT中作用的研究

国家自然科学基金

0+阅读 · 2012年12月31日

长波长星载森林生物量观测SAR电离层效应补偿方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

药用植物白木香内生真菌的抗肿瘤抗菌活性代谢产物研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员