社会科学变换编码器 (Transformer Encoder for Social Science) - 专知论文

会员服务 ·

0

变换 · BERT · Neural Networks · Processing（编程语言） · Networking ·

2022 年 8 月 17 日

Transformer Encoder for Social Science

翻译：社会科学变换编码器

Haosen Ge,In Young Park,Xuancheng Qian,Grace Zeng

High-quality text data has become an important data source for social scientists. We have witnessed the success of pretrained deep neural network models, such as BERT and RoBERTa, in recent social science research. In this paper, we propose a compact pretrained deep neural network, Transformer Encoder for Social Science (TESS), explicitly designed to tackle text processing tasks in social science research. Using two validation tests, we demonstrate that TESS outperforms BERT and RoBERTa by 16.7% on average when the number of training samples is limited (<1,000 training instances). The results display the superiority of TESS over BERT and RoBERTa on social science text processing tasks. Lastly, we discuss the limitation of our model and present advice for future researchers.

翻译：高质量的文本数据已成为社会科学家的重要数据来源。我们在最近的社会科学研究中目睹了诸如BERT和ROBERTA等经过预先训练的深神经网络模型的成功。在本文件中,我们提议建立一个经过训练的精密神经网络,即社会科学变异器编码器(TESS),其设计明确是为了处理社会科学研究的文本处理任务。我们通过两个验证测试,证明在培训样本数量有限时,TESS比BERT和ROBERTA平均高出16.7%( < 1 000个培训案例),结果显示TESS在社会科学文本处理任务上优于BERT和ROBERTA。最后,我们讨论了我们模型的局限性,并向未来的研究人员提出咨询意见。

0

相关内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

28+阅读 · 2022年2月20日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

102+阅读 · 2020年6月21日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

双氰胺基离子液体低温电解制备稀土金属镧的基础研究

国家自然科学基金

0+阅读 · 2014年12月31日

SCI后铁超载及其致Ferroptosis在白质继发损伤中的作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

Al基非晶合金多孔材料制备方法及力学性能

国家自然科学基金

0+阅读 · 2012年12月31日

ApoCIII调节Lp-PLA2的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

无稀土锰基永磁材料结构与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

利用Salinomycin研究肿瘤细胞自噬的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

银屑病皮损上皮/内皮-间充质转化及间充质-上皮/内皮转化的研究

国家自然科学基金

0+阅读 · 2011年12月31日

应变及压力调制下半导体多层异质结构材料中的电子态

国家自然科学基金

0+阅读 · 2009年12月31日

过渡金属及其合金团簇的稳定性和磁性研究

国家自然科学基金

0+阅读 · 2009年12月31日

具有负的超磁致伸缩合金Sm-R-Fe（R=Dy,Nd）系相图及其合金的磁致伸缩性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

Efficient Spiking Transformer Enabled By Partial Information

Arxiv

0+阅读 · 2022年10月3日

Rewiring with Positional Encodings for Graph Neural Networks

Arxiv

0+阅读 · 2022年10月3日

AI-Assisted Discovery of Quantitative and Formal Models in Social Science

Arxiv

0+阅读 · 2022年10月2日

Masked Autoencoders Are Scalable Vision Learners

Arxiv

27+阅读 · 2021年11月11日

A Survey of Quantization Methods for Efficient Neural Network Inference

Arxiv

22+阅读 · 2021年6月21日

Efficient Transformers: A Survey

Arxiv

23+阅读 · 2020年9月16日

Graph Neural Networks for Social Recommendation

Arxiv

20+阅读 · 2019年11月23日

Pre-Training with Whole Word Masking for Chinese BERT

Arxiv

11+阅读 · 2019年6月19日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

Conditional Random Field and Deep Feature Learning for Hyperspectral Image Segmentation

Arxiv

11+阅读 · 2017年12月27日

VIP会员

文章信息

相关主题

Neural Networks

Processing（编程语言）

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

28+阅读 · 2022年2月20日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

102+阅读 · 2020年6月21日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

GPT-5如何对齐？从硬性拒绝到安全完成：走向以输出为中心的安全训练

【伯克利博士论文】超越人类监督的视觉智能

【ICCV2025】SO(3) 上连续非保守动力系统的预测

2025年中国数据要素行业发展研究报告

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Efficient Spiking Transformer Enabled By Partial Information

Arxiv

0+阅读 · 2022年10月3日

Rewiring with Positional Encodings for Graph Neural Networks

Arxiv

0+阅读 · 2022年10月3日

AI-Assisted Discovery of Quantitative and Formal Models in Social Science

Arxiv

0+阅读 · 2022年10月2日

Masked Autoencoders Are Scalable Vision Learners

Arxiv

27+阅读 · 2021年11月11日

A Survey of Quantization Methods for Efficient Neural Network Inference

Arxiv

22+阅读 · 2021年6月21日

Efficient Transformers: A Survey

Arxiv

23+阅读 · 2020年9月16日

Graph Neural Networks for Social Recommendation

Arxiv

20+阅读 · 2019年11月23日

Pre-Training with Whole Word Masking for Chinese BERT

Arxiv

11+阅读 · 2019年6月19日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

Conditional Random Field and Deep Feature Learning for Hyperspectral Image Segmentation

Arxiv

11+阅读 · 2017年12月27日

相关基金

双氰胺基离子液体低温电解制备稀土金属镧的基础研究

国家自然科学基金

0+阅读 · 2014年12月31日

SCI后铁超载及其致Ferroptosis在白质继发损伤中的作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

Al基非晶合金多孔材料制备方法及力学性能

国家自然科学基金

0+阅读 · 2012年12月31日

ApoCIII调节Lp-PLA2的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

无稀土锰基永磁材料结构与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

利用Salinomycin研究肿瘤细胞自噬的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

银屑病皮损上皮/内皮-间充质转化及间充质-上皮/内皮转化的研究

国家自然科学基金

0+阅读 · 2011年12月31日

应变及压力调制下半导体多层异质结构材料中的电子态

国家自然科学基金

0+阅读 · 2009年12月31日

过渡金属及其合金团簇的稳定性和磁性研究

国家自然科学基金

0+阅读 · 2009年12月31日

具有负的超磁致伸缩合金Sm-R-Fe（R=Dy,Nd）系相图及其合金的磁致伸缩性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员