用于特权审查的转让学习经验研究 (An Empirical Study on Transfer Learning for Privilege Review) - 专知论文

会员服务 ·

0

可辨认的 · 迁移学习 · 学成 · CASES · MoDELS ·

2021 年 12 月 16 日

An Empirical Study on Transfer Learning for Privilege Review

翻译：用于特权审查的转让学习经验研究

Haozhen Zhao,Shi Ye,Jingchao Yang

from arxiv, 2021 IEEE International Conference on Big Data (Big Data)

Protecting privileged communications and data from inadvertent disclosure is a paramount task in the US legal practice. Traditionally counsels rely on keyword searching and manual review to identify privileged documents in cases. As data volumes increase, this approach becomes less and less defensible in costs. Machine learning methods have been used in identifying privilege documents. Given the generalizable nature of privilege in legal cases, we hypothesize that transfer learning can capitalize knowledge learned from existing labeled data to identify privilege documents without requiring labeling new training data. In this paper, we study both traditional machine learning models and deep learning models based on BERT for privilege document classification tasks in legal document review, and we examine the effectiveness of transfer learning in privilege model on three real world datasets with privilege labels. Our results show that BERT model outperforms the industry standard logistic regression algorithm and transfer learning models can achieve decent performance on datasets in same or close domains.

翻译：保护特权通信和数据不被无意披露是美国法律惯例中的一项首要任务。传统上,律师依靠关键词搜索和人工审查来识别案件中的特权文件。随着数据量的增加,这种做法在成本上变得越来越少,越来越难以辩护。机器学习方法被用于确定特权文件。鉴于特权在法律案件中具有普遍适用的性质,我们假设转让学习能够利用从现有标签数据中获取的知识来识别特权文件,而不需要为新的培训数据贴标签。在本文中,我们研究了传统机器学习模式和基于BERT的深层次学习模式,以在法律文件审查中确定特权文件分类任务。我们研究了特权模式中三个真实世界数据集的特权学习在特权标签上的有效性。我们的结果表明,BERT模型超越了行业标准的物流回归算法和转让学习模式,可以在相同或近距离的域内实现数据集的体面表现。

0

相关内容

可辨认的

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

最新《3D医疗图像处理》综述论文，23页pdf，3D Deep Learning on Medical Images: A Review

最新《3D医疗图像处理》综述论文，23页pdf，3D Deep Learning on Medical Images: A Review

专知会员服务

60+阅读 · 2020年7月14日

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

专知会员服务

81+阅读 · 2020年5月20日

【2019/2020之交的机器学习/深度学习技术概述】《2019 In-Review and Trends for 2020 – A Technical Overview of Machine Learning and Deep Learning!》by Analytics Vidhya

【2019/2020之交的机器学习/深度学习技术概述】《2019 In-Review and Trends for 2020 – A Technical Overview of Machine Learning and Deep Learning!》by Analytics Vidhya

专知会员服务

21+阅读 · 2020年2月1日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

《动手学深度学习》(Dive into Deep Learning)PyTorch实现

《动手学深度学习》(Dive into Deep Learning)PyTorch实现

专知会员服务

120+阅读 · 2019年12月31日

【剑桥大学】神经机器翻译综述论文，Neural Machine Translation: A Review，附88页pdf

【剑桥大学】神经机器翻译综述论文，Neural Machine Translation: A Review，附88页pdf

专知会员服务

37+阅读 · 2019年12月4日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

A multi-task semi-supervised framework for Text2Graph & Graph2Text

A multi-task semi-supervised framework for Text2Graph & Graph2Text

Arxiv

0+阅读 · 2022年2月18日

Deep Semi-Supervised Learning for Time Series Classification

Arxiv

0+阅读 · 2022年2月16日

A Survey of Unsupervised Domain Adaptation for Visual Recognition

Arxiv

9+阅读 · 2021年12月13日

Dynamic Transfer Learning for Named Entity Recognition

Dynamic Transfer Learning for Named Entity Recognition

Arxiv

5+阅读 · 2019年5月1日

Hierarchical Meta Learning

Arxiv

9+阅读 · 2019年4月19日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

Deep Learning on Graphs: A Survey

Arxiv

53+阅读 · 2018年12月11日

A Survey on Deep Transfer Learning

A Survey on Deep Transfer Learning

Arxiv

11+阅读 · 2018年8月6日

Exploiting the potential of unlabeled endoscopic video data with self-supervised learning

Arxiv

7+阅读 · 2018年1月26日

Multi-Task Learning with Labeled and Unlabeled Tasks

Arxiv

3+阅读 · 2017年6月8日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

最新《3D医疗图像处理》综述论文，23页pdf，3D Deep Learning on Medical Images: A Review

最新《3D医疗图像处理》综述论文，23页pdf，3D Deep Learning on Medical Images: A Review

专知会员服务

60+阅读 · 2020年7月14日

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

专知会员服务

81+阅读 · 2020年5月20日

【2019/2020之交的机器学习/深度学习技术概述】《2019 In-Review and Trends for 2020 – A Technical Overview of Machine Learning and Deep Learning!》by Analytics Vidhya

【2019/2020之交的机器学习/深度学习技术概述】《2019 In-Review and Trends for 2020 – A Technical Overview of Machine Learning and Deep Learning!》by Analytics Vidhya

专知会员服务

21+阅读 · 2020年2月1日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

《动手学深度学习》(Dive into Deep Learning)PyTorch实现

《动手学深度学习》(Dive into Deep Learning)PyTorch实现

专知会员服务

120+阅读 · 2019年12月31日

【剑桥大学】神经机器翻译综述论文，Neural Machine Translation: A Review，附88页pdf

【剑桥大学】神经机器翻译综述论文，Neural Machine Translation: A Review，附88页pdf

专知会员服务

37+阅读 · 2019年12月4日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICML2025】关于语言模型对齐中奖励模型稳健性的研究

【阿姆斯特丹博士论文】终端设备上的高效深度学习推理

【EPFL博士论文】在多模态基础模型中扩展模态能力，附185页slides

Al Agent：AI时代的软件革命

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

相关论文

A multi-task semi-supervised framework for Text2Graph & Graph2Text

A multi-task semi-supervised framework for Text2Graph & Graph2Text

Arxiv

0+阅读 · 2022年2月18日

Deep Semi-Supervised Learning for Time Series Classification

Arxiv

0+阅读 · 2022年2月16日

A Survey of Unsupervised Domain Adaptation for Visual Recognition

Arxiv

9+阅读 · 2021年12月13日

Dynamic Transfer Learning for Named Entity Recognition

Dynamic Transfer Learning for Named Entity Recognition

Arxiv

5+阅读 · 2019年5月1日

Hierarchical Meta Learning

Arxiv

9+阅读 · 2019年4月19日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

Deep Learning on Graphs: A Survey

Arxiv

53+阅读 · 2018年12月11日

A Survey on Deep Transfer Learning

A Survey on Deep Transfer Learning

Arxiv

11+阅读 · 2018年8月6日

Exploiting the potential of unlabeled endoscopic video data with self-supervised learning

Arxiv

7+阅读 · 2018年1月26日

Multi-Task Learning with Labeled and Unlabeled Tasks

Arxiv

3+阅读 · 2017年6月8日

微信扫码咨询专知VIP会员