减少序列长度学习对变换模型的影响 (Reducing Sequence Length Learning Impacts on Transformer Models) - 专知论文

会员服务 ·

0

可约的 · Learning · 变换 · Transformer模型 · MoDELS ·

2022 年 12 月 16 日

Reducing Sequence Length Learning Impacts on Transformer Models

翻译：减少序列长度学习对变换模型的影响

Jean-Thomas Baillargeon,Luc Lamontagne

from arxiv, 10 pages, 8 content - 2 appendix, 2 figures

Classification algorithms using Transformer architectures can be affected by the sequence length learning problem whenever observations from different classes have a different length distribution. This problem brings models to use sequence length as a predictive feature instead of relying on important textual information. Even if most public datasets are not affected by this problem, privately corpora for fields such as medicine and insurance may carry this data bias. This poses challenges throughout the value chain given their usage in a machine learning application. In this paper, we empirically expose this problem and present approaches to minimize its impacts.

翻译：使用变换器结构的分类算法,如果不同类别的观测分布长度不同,则会受到序列学习问题的影响。这个问题使模型使用序列长度作为预测特征,而不是依赖重要的文字信息。即使大多数公共数据集不受这一问题的影响,医药和保险等领域的私人公司可能带有这种数据偏差。这在整个价值链中构成了挑战,因为它们在机器学习应用中使用。在本文件中,我们以经验方式揭示了这个问题,并提出了尽量减少其影响的方法。

0

相关内容

可约的

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

全太阳光谱驱动纳米结构OMS-2催化净化挥发性有机污染物

国家自然科学基金

0+阅读 · 2014年12月31日

非均质量子器件Schr？dinger-Poisson系统多尺度分析与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

蓖麻矮化相关RcDof基因功能分析及调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

miRNAs/mTOR调控网络在糖尿病雷帕霉素抵抗中的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

IL-17A介导的滑膜间充质干细胞自噬抑制在同种异体肌腱移植排斥反应中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

MeV离子辐照下钙钛矿型钛酸盐晶体结构稳定性的研究

国家自然科学基金

0+阅读 · 2013年12月31日

高维数据的图模型学习与统计推断

国家自然科学基金

8+阅读 · 2012年12月31日

等离子体层顶位形统计研究

国家自然科学基金

0+阅读 · 2012年12月31日

中国人特发性肺动脉高压致病基因BMPR2突变谱及其调控机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

A survey on online active learning

Arxiv

0+阅读 · 2023年2月17日

Alloprof: a new French question-answer education dataset and its use in an information retrieval case study

Arxiv

0+阅读 · 2023年2月10日

A Survey of Deep Causal Model

Arxiv

45+阅读 · 2022年9月19日

A Survey of Learning on Small Data

Arxiv

19+阅读 · 2022年7月29日

A Survey on Vision Transformer

Arxiv

17+阅读 · 2022年2月23日

Transformers in Medical Imaging: A Survey

Arxiv

15+阅读 · 2022年1月24日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

27+阅读 · 2021年6月16日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

A Survey on Deep Learning for Named Entity Recognition

A Survey on Deep Learning for Named Entity Recognition

Arxiv

26+阅读 · 2020年3月13日

An Attentive Survey of Attention Models

Arxiv

19+阅读 · 2019年4月5日

VIP会员

文章信息

相关主题

Transformer模型

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

[ICCV2025]EAMamba：面向图像恢复的高效全能视觉状态空间模型

ICCV 2025 | 超越π0，无界智慧提出A0，首个空间可供性感知的通用操作模型

【博士论文】大规模人工智能中的强化学习智能体：高效训练与更严谨分析

大语言模型推理系统综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

A survey on online active learning

Arxiv

0+阅读 · 2023年2月17日

Alloprof: a new French question-answer education dataset and its use in an information retrieval case study

Arxiv

0+阅读 · 2023年2月10日

A Survey of Deep Causal Model

Arxiv

45+阅读 · 2022年9月19日

A Survey of Learning on Small Data

Arxiv

19+阅读 · 2022年7月29日

A Survey on Vision Transformer

Arxiv

17+阅读 · 2022年2月23日

Transformers in Medical Imaging: A Survey

Arxiv

15+阅读 · 2022年1月24日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

27+阅读 · 2021年6月16日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

A Survey on Deep Learning for Named Entity Recognition

A Survey on Deep Learning for Named Entity Recognition

Arxiv

26+阅读 · 2020年3月13日

An Attentive Survey of Attention Models

Arxiv

19+阅读 · 2019年4月5日

相关基金

全太阳光谱驱动纳米结构OMS-2催化净化挥发性有机污染物

国家自然科学基金

0+阅读 · 2014年12月31日

非均质量子器件Schr？dinger-Poisson系统多尺度分析与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

蓖麻矮化相关RcDof基因功能分析及调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

miRNAs/mTOR调控网络在糖尿病雷帕霉素抵抗中的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

IL-17A介导的滑膜间充质干细胞自噬抑制在同种异体肌腱移植排斥反应中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

MeV离子辐照下钙钛矿型钛酸盐晶体结构稳定性的研究

国家自然科学基金

0+阅读 · 2013年12月31日

高维数据的图模型学习与统计推断

国家自然科学基金

8+阅读 · 2012年12月31日

等离子体层顶位形统计研究

国家自然科学基金

0+阅读 · 2012年12月31日

中国人特发性肺动脉高压致病基因BMPR2突变谱及其调控机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员