【清华大学NLP】预训练语言模型（PLM）必读论文清单，附论文PDF、源码和模型链接 - 专知

会员服务 ·

2

【清华大学NLP】预训练语言模型（PLM）必读论文清单，附论文PDF、源码和模型链接

2019 年 9 月 27 日 专知

【导读】近两年来，ELMO、BERT等预训练语言模型（PLM）在多项任务中刷新了榜单，引起了学术界和工业界的大量关注。本文介绍清华大学NLP给出的预训练语言模型必读论文清单，包含论文的PDF链接、源码和模型等。

清华大学NLP在Github项目thunlp/PLMpapers中提供了预训练语言模型必读论文清单，包含了论文的PDF链接、源码和模型等，具体清单如下：

模型：

Deep contextualized word representations. Matthew E. Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee and Luke Zettlemoyer. NAACL 2018.

论文:
https://arxiv.org/pdf/1802.05365.pdf
工程:
https://allennlp.org/elmo (ELMo)

Universal Language Model Fine-tuning for Text Classification. Jeremy Howard and Sebastian Ruder. ACL 2018.

论文:
https://www.aclweb.org/anthology/P18-1031
工程:
http://nlp.fast.ai/category/classification.html (ULMFiT)

Improving Language Understanding by Generative Pre-Training. Alec Radford, Karthik Narasimhan, Tim Salimans and Ilya Sutskever. Preprint.

论文:
https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf
工程:
https://openai.com/blog/language-unsupervised/ (GPT)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Jacob Devlin, Ming-Wei Chang, Kenton Lee and Kristina Toutanova. NAACL 2019.

论文:
https://arxiv.org/pdf/1810.04805.pdf
代码+模型:
https://github.com/google-research/bert

Language Models are Unsupervised Multitask Learners. Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei and Ilya Sutskever. Preprint.

论文:
https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf
代码:
https://github.com/openai/gpt-2 (GPT-2)

ERNIE: Enhanced Language Representation with Informative Entities. Zhengyan Zhang, Xu Han, Zhiyuan Liu, Xin Jiang, Maosong Sun and Qun Liu. ACL2019.

论文:
https://www.aclweb.org/anthology/P19-1139
代码+模型:
https://github.com/thunlp/ERNIE (ERNIE (Tsinghua) )

ERNIE: Enhanced Representation through Knowledge Integration. Yu Sun, Shuohuan Wang, Yukun Li, Shikun Feng, Xuyi Chen, Han Zhang, Xin Tian, Danxiang Zhu, Hao Tian and Hua Wu. Preprint.

论文:
https://arxiv.org/pdf/1904.09223.pdf
代码:
https://github.com/PaddlePaddle/ERNIE/tree/develop/ERNIE (ERNIE (Baidu) )

Defending Against Neural Fake News. Rowan Zellers, Ari Holtzman, Hannah Rashkin, Yonatan Bisk, Ali Farhadi, Franziska Roesner, Yejin Choi. NeurIPS.

论文:
https://arxiv.org/pdf/1905.12616.pdf
工程:
https://rowanzellers.com/grover/ (Grover)

Cross-lingual Language Model Pretraining. Guillaume Lample, Alexis Conneau. NeurIPS2019.

论文:
https://arxiv.org/pdf/1901.07291.pdf
代码+模型:
https://github.com/facebookresearch/XLM (XLM)

Multi-Task Deep Neural Networks for Natural Language Understanding. Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao. ACL2019.

论文:
https://www.aclweb.org/anthology/P19-1441
代码+模型:
https://github.com/namisan/mt-dnn (MT-DNN)

MASS: Masked Sequence to Sequence Pre-training for Language Generation. Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, Tie-Yan Liu. ICML2019.

论文:
https://arxiv.org/pdf/1905.02450.pdf
代码+模型:
https://github.com/microsoft/MASS

Unified Language Model Pre-training for Natural Language Understanding and Generation. Li Dong, Nan Yang, Wenhui Wang, Furu Wei, Xiaodong Liu, Yu Wang, Jianfeng Gao, Ming Zhou, Hsiao-Wuen Hon. Preprint.

论文:
https://arxiv.org/pdf/1905.03197.pdf (UniLM)

XLNet: Generalized Autoregressive Pretraining for Language Understanding. Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov, Quoc V. Le. NeurIPS2019.

论文:
https://arxiv.org/pdf/1906.08237.pdf
代码+模型:
https://github.com/zihangdai/xlnet

RoBERTa: A Robustly Optimized BERT Pretraining Approach. Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, Veselin Stoyanov. Preprint.

论文:
https://arxiv.org/pdf/1907.11692.pdf
代码+模型:
https://github.com/pytorch/fairseq

SpanBERT: Improving Pre-training by Representing and Predicting Spans. Mandar Joshi, Danqi Chen, Yinhan Liu, Daniel S. Weld, Luke Zettlemoyer, Omer Levy. Preprint.

论文:
https://arxiv.org/pdf/1907.10529.pdf
代码+模型:
https://github.com/facebookresearch/SpanBERT

Knowledge Enhanced Contextual Word Representations. Matthew E. Peters, Mark Neumann, Robert L. Logan IV, Roy Schwartz, Vidur Joshi, Sameer Singh, Noah A. Smith. EMNLP2019.

论文:
https://arxiv.org/pdf/1909.04164.pdf (KnowBert)

VisualBERT: A Simple and Performant Baseline for Vision and Language. Liunian Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh, Kai-Wei Chang. Preprint.

论文:
https://arxiv.org/pdf/1908.03557.pdf
代码+模型:
https://github.com/uclanlp/visualbert

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks. Jiasen Lu, Dhruv Batra, Devi Parikh, Stefan Lee. NeurIPS.

论文:
https://arxiv.org/pdf/1908.02265.pdf
代码+模型:
https://github.com/jiasenlu/vilbert_beta

VideoBERT: A Joint Model for Video and Language Representation Learning. Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy, Cordelia Schmid. ICCV2019.

论文:
https://arxiv.org/pdf/1904.01766.pdf

LXMERT: Learning Cross-Modality Encoder Representations from Transformers. Hao Tan, Mohit Bansal. EMNLP2019.

论文:
https://arxiv.org/pdf/1908.07490.pdf
代码+模型:
https://github.com/airsplay/lxmert

VL-BERT: Pre-training of Generic Visual-Linguistic Representations. Weijie Su, Xizhou Zhu, Yue Cao, Bin Li, Lewei Lu, Furu Wei, Jifeng Dai. Preprint.

论文:
https://arxiv.org/pdf/1908.08530.pdf

Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training. Gen Li, Nan Duan, Yuejian Fang, Ming Gong, Daxin Jiang, Ming Zhou. Preprint.

论文:
https://arxiv.org/pdf/1908.06066.pdf

K-BERT: Enabling Language Representation with Knowledge Graph. Weijie Liu, Peng Zhou, Zhe Zhao, Zhiruo Wang, Qi Ju, Haotang Deng, Ping Wang. Preprint.

论文:
https://arxiv.org/pdf/1909.07606.pdf

Fusion of Detected Objects in Text for Visual Question Answering. Chris Alberti, Jeffrey Ling, Michael Collins, David Reitter. EMNLP2019.

论文:
https://arxiv.org/pdf/1908.05054.pdf (B2T2)

Contrastive Bidirectional Transformer for Temporal Representation Learning. Chen Sun, Fabien Baradel, Kevin Murphy, Cordelia Schmid. Preprint.

论文:
https://arxiv.org/pdf/1906.05743.pdf (CBT)

ERNIE 2.0: A Continual Pre-training Framework for Language Understanding. Yu Sun, Shuohuan Wang, Yukun Li, Shikun Feng, Hao Tian, Hua Wu, Haifeng Wang. Preprint.

论文:
https://arxiv.org/pdf/1907.12412v1.pdf
代码:
https://github.com/PaddlePaddle/ERNIE/blob/develop/README.md

75 Languages, 1 Model: Parsing Universal Dependencies Universally. Dan Kondratyuk, Milan Straka. EMNLP2019.

论文:
https://arxiv.org/pdf/1904.02099.pdf
代码+模型:
https://github.com/hyperparticle/udify (UDify)

Pre-Training with Whole Word Masking for Chinese BERT. Yiming Cui, Wanxiang Che, Ting Liu, Bing Qin, Ziqing Yang, Shijin Wang, Guoping Hu. Preprint.

论文:
https://arxiv.org/pdf/1906.08101.pdf
代码+模型:
https://github.com/ymcui/Chinese-BERT-wwm/blob/master/README_EN.md (Chinese-BERT-wwm)

知识蒸馏和模型压缩:

TinyBERT: Distilling BERT for Natural Language Understanding. Xiaoqi Jiao, Yichun Yin, Lifeng Shang, Xin Jiang, Xiao Chen, Linlin Li, Fang Wang, Qun Liu.

论文:
https://arxiv.org/pdf/1909.10351v1.pdf

Distilling Task-Specific Knowledge from BERT into Simple Neural Networks. Raphael Tang, Yao Lu, Linqing Liu, Lili Mou, Olga Vechtomova, Jimmy Lin. Preprint.

论文:
https://arxiv.org/pdf/1903.12136.pdf

Patient Knowledge Distillation for BERT Model Compression. Siqi Sun, Yu Cheng, Zhe Gan, Jingjing Liu. EMNLP2019.

论文:
https://arxiv.org/pdf/1908.09355.pdf
代码:
https://github.com/intersun/PKD-for-BERT-Model-Compression

Model Compression with Multi-Task Knowledge Distillation for Web-scale Question Answering System. Ze Yang, Linjun Shou, Ming Gong, Wutao Lin, Daxin Jiang. Preprint.

论文:
https://arxiv.org/pdf/1904.09636.pdf

PANLP at MEDIQA 2019: Pre-trained Language Models, Transfer Learning and Knowledge Distillation. Wei Zhu, Xiaofeng Zhou, Keqiang Wang, Xun Luo, Xiepeng Li, Yuan Ni, Guotong Xie. The 18th BioNLP workshop.

论文:
https://www.aclweb.org/anthology/W19-5040

Improving Multi-Task Deep Neural Networks via Knowledge Distillation for Natural Language Understanding. Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao. Preprint.

论文:
https://arxiv.org/pdf/1904.09482.pdf
代码+模型:
https://github.com/namisan/mt-dnn

Well-Read Students Learn Better: The Impact of Student Initialization on Knowledge Distillation. Iulia Turc, Ming-Wei Chang, Kenton Lee, Kristina Toutanova. Preprint.

论文:
https://arxiv.org/pdf/1908.08962.pdf

Small and Practical BERT Models for Sequence Labeling. Henry Tsai, Jason Riesa, Melvin Johnson, Naveen Arivazhagan, Xin Li, Amelia Archer. EMNLP2019.

论文:
https://arxiv.org/pdf/1909.00100.pdf

Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT. Sheng Shen, Zhen Dong, Jiayu Ye, Linjian Ma, Zhewei Yao, Amir Gholami, Michael W. Mahoney, Kurt Keutzer. Preprint.

论文:
https://arxiv.org/pdf/1909.05840.pdf

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations. Anonymous authors. ICLR2020 under review.

论文:
https://openreview.net/pdf?id=H1eA7AEtvS

分析:

Revealing the Dark Secrets of BERT. Olga Kovaleva, Alexey Romanov, Anna Rogers, Anna Rumshisky. EMNLP2019.

论文:
https://arxiv.org/abs/1908.08593

How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations. Betty van Aken, Benjamin Winter, Alexander Löser, Felix A. Gers. CIKM2019.

论文:
https://arxiv.org/pdf/1909.04925.pdf

Are Sixteen Heads Really Better than One?. Paul Michel, Omer Levy, Graham Neubig. Preprint.

论文:
https://arxiv.org/pdf/1905.10650.pdf
代码:
https://github.com/pmichel31415/are-16-heads-really-better-than-1

Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment. Di Jin, Zhijing Jin, Joey Tianyi Zhou, Peter Szolovits. Preprint.

论文:
https://arxiv.org/pdf/1907.11932.pdf
代码:
https://github.com/jind11/TextFooler

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model. Alex Wang, Kyunghyun Cho. NeuralGen2019.

论文:
https://arxiv.org/pdf/1902.04094.pdf
代码:
https://github.com/nyu-dl/bert-gen

Linguistic Knowledge and Transferability of Contextual Representations. Nelson F. Liu, Matt Gardner, Yonatan Belinkov, Matthew E. Peters, Noah A. Smith. NAACL2019.

论文:
https://www.aclweb.org/anthology/N19-1112

What Does BERT Look At? An Analysis of BERT's Attention. Kevin Clark, Urvashi Khandelwal, Omer Levy, Christopher D. Manning. BlackBoxNLP2019.

论文:
https://arxiv.org/pdf/1906.04341.pdf
代码:
https://github.com/clarkkev/attention-analysis

Open Sesame: Getting Inside BERT's Linguistic Knowledge. Yongjie Lin, Yi Chern Tan, Robert Frank. BlackBoxNLP2019.

论文:
https://arxiv.org/pdf/1906.01698.pdf
代码:
https://github.com/yongjie-lin/bert-opensesame

Analyzing the Structure of Attention in a Transformer Language Model. Jesse Vig, Yonatan Belinkov. BlackBoxNLP2019.

论文:
https://arxiv.org/pdf/1906.04284.pdf

Blackbox meets blackbox: Representational Similarity and Stability Analysis of Neural Language Models and Brains. Samira Abnar, Lisa Beinborn, Rochelle Choenni, Willem Zuidema. BlackBoxNLP2019.

论文:
https://arxiv.org/pdf/1906.01539.pdf

BERT Rediscovers the Classical NLP Pipeline. Ian Tenney, Dipanjan Das, Ellie Pavlick. ACL2019.

论文:
https://www.aclweb.org/anthology/P19-1452

How multilingual is Multilingual BERT?. Telmo Pires, Eva Schlinger, Dan Garrette. ACL2019.

论文:
https://www.aclweb.org/anthology/P19-1493

What Does BERT Learn about the Structure of Language?. Ganesh Jawahar, Benoît Sagot, Djamé Seddah. ACL2019.

论文:
https://www.aclweb.org/anthology/P19-1356

Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT. Shijie Wu, Mark Dredze. EMNLP2019.

论文:
https://arxiv.org/pdf/1904.09077.pdf

How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings. Kawin Ethayarajh. EMNLP2019.

论文:
https://arxiv.org/pdf/1909.00512.pdf

Probing Neural Network Comprehension of Natural Language Arguments. Timothy Niven, Hung-Yu Kao. ACL2019.

论文:
https://www.aclweb.org/anthology/P19-1459
代码:
https://github.com/IKMLab/arct2

Universal Adversarial Triggers for Attacking and Analyzing NLP. Eric Wallace, Shi Feng, Nikhil Kandpal, Matt Gardner, Sameer Singh. EMNLP2019.

论文:
https://arxiv.org/pdf/1908.07125.pdf
代码:
https://github.com/Eric-Wallace/universal-triggers

The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives. Elena Voita, Rico Sennrich, Ivan Titov. EMNLP2019.

论文:
https://arxiv.org/pdf/1909.01380.pdf

Do NLP Models Know Numbers? Probing Numeracy in Embeddings. Eric Wallace, Yizhong Wang, Sujian Li, Sameer Singh, Matt Gardner. EMNLP2019.

论文:
https://arxiv.org/pdf/1909.07940.pdf

Investigating BERT's Knowledge of Language: Five Analysis Methods with NPIs. Alex Warstadt, Yu Cao, Ioana Grosu, Wei Peng, Hagen Blix, Yining Nie, Anna Alsop, Shikha Bordia, Haokun Liu, Alicia Parrish, Sheng-Fu Wang, Jason Phang, Anhad Mohananey, Phu Mon Htut, Paloma Jeretič, Samuel R. Bowman. EMNLP2019.

论文:
https://arxiv.org/pdf/1909.02597.pdf
代码:
https://github.com/alexwarstadt/data_generation

Visualizing and Understanding the Effectiveness of BERT. Yaru Hao, Li Dong, Furu Wei, Ke Xu. EMNLP2019.

论文:
https://arxiv.org/pdf/1908.05620.pdf

Visualizing and Measuring the Geometry of BERT. Andy Coenen, Emily Reif, Ann Yuan, Been Kim, Adam Pearce, Fernanda Viégas, Martin Wattenberg. NeurIPS2019.

论文:
https://arxiv.org/pdf/1906.02715.pdf

On the Validity of Self-Attention as Explanation in Transformer Models. Gino Brunner, Yang Liu, Damián Pascual, Oliver Richter, Roger Wattenhofer. Preprint.

论文:
https://arxiv.org/pdf/1908.04211.pdf

Transformer Dissection: An Unified Understanding for Transformer's Attention via the Lens of Kernel. Yao-Hung Hubert Tsai, Shaojie Bai, Makoto Yamada, Louis-Philippe Morency, Ruslan Salakhutdinov. EMNLP2019.

论文:
https://arxiv.org/pdf/1908.11775.pdf

Language Models as Knowledge Bases? Fabio Petroni, Tim Rocktäschel, Patrick Lewis, Anton Bakhtin, Yuxiang Wu, Alexander H. Miller, Sebastian Riedel. EMNLP2019.

论文:
https://arxiv.org/pdf/1909.01066.pdf
代码:
https://github.com/facebookresearch/LAMA

参考链接：

https://github.com/thunlp/PLMpapers

-END-

专 · 知

专知，专业可信的人工智能知识分发，让认知协作更快更好！欢迎登录www.zhuanzhi.ai，注册登录专知，获取更多AI知识资料！

欢迎微信扫一扫加入专知人工智能知识星球群，获取最新AI专业干货知识教程视频资料和与专家交流咨询！

请加专知小助手微信（扫一扫如下二维码添加），加入专知人工智能主题群，咨询技术商务合作~

专知《深度学习:算法到实战》课程全部完成！560+位同学在学习，现在报名，限时优惠！网易云课堂人工智能畅销榜首位！

点击“阅读原文”，了解报名专知《深度学习:算法到实战》课程

登录查看更多

39

相关内容

arXiv

arXiv（X依希腊文的χ发音，读音如英语的archive）是一个收集物理学、数学、计算机科学与生物学的论文预印本的网站，始于1991年8月14日。截至2008年10月，arXiv.org已收集超过50万篇预印本；至2014年底，藏量达到1百万篇。在2014年时，约以每月8000篇的速度增加。

还在修改博士论文？这份《博士论文写作技巧》为你指南

还在修改博士论文？这份《博士论文写作技巧》为你指南

专知会员服务

165+阅读 · 2020年6月9日

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

97+阅读 · 2020年4月10日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

近期必读的10篇ACL 2019【图神经网络（GNN）+NLP】相关论文和代码

专知会员服务

71+阅读 · 2020年1月10日

AAAI2020必读的10篇「知识图谱（Knowledge Graph）」相关论文和代码

AAAI2020必读的10篇「知识图谱（Knowledge Graph）」相关论文和代码

专知会员服务

146+阅读 · 2020年1月10日

BERT进展2019四篇必读论文

BERT进展2019四篇必读论文

专知会员服务

69+阅读 · 2020年1月2日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

最强NLP预训练模型库PyTorch-Transformers正式开源！支持6个预训练框架，27个预训练模型

最强NLP预训练模型库PyTorch-Transformers正式开源！支持6个预训练框架，27个预训练模型

AI前线

12+阅读 · 2019年7月22日

GitHub超9千星：一个API调用27个NLP预训练模型

GitHub超9千星：一个API调用27个NLP预训练模型

新智元

17+阅读 · 2019年7月22日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

必读！TOP10生成对抗网络GAN论文（附链接）

必读！TOP10生成对抗网络GAN论文（附链接）

数据派THU

16+阅读 · 2019年3月24日

必读！生成对抗网络GAN论文TOP 10

必读！生成对抗网络GAN论文TOP 10

GAN生成式对抗网络

58+阅读 · 2019年3月20日

Github项目推荐 | awesome-bert：BERT相关资源大列表

Github项目推荐 | awesome-bert：BERT相关资源大列表

AI研习社

27+阅读 · 2019年2月26日

清华大学自然语言处理组年度巨献：370+篇机器翻译必读论文，一文收尽

清华大学自然语言处理组年度巨献：370+篇机器翻译必读论文，一文收尽

专知

3+阅读 · 2018年12月30日

清华NLP组图深度学习推荐，146篇必读GNN最新论文

清华NLP组图深度学习推荐，146篇必读GNN最新论文

中国人工智能学会

28+阅读 · 2018年12月29日

清华 NLP 团队推荐：必读的77篇机器阅读理解论文

清华 NLP 团队推荐：必读的77篇机器阅读理解论文

专知

20+阅读 · 2018年11月1日

CoKE: Contextualized Knowledge Graph Embedding

Arxiv

9+阅读 · 2019年11月6日

Understanding Attention and Generalization in Graph Neural Networks

Arxiv

4+阅读 · 2019年10月28日

Question Generation by Transformers

Question Generation by Transformers

Arxiv

5+阅读 · 2019年9月14日

Entity-aware ELMo: Learning Contextual Entity Representation for Entity Disambiguation

Arxiv

3+阅读 · 2019年8月22日

DocBERT: BERT for Document Classification

Arxiv

6+阅读 · 2019年8月22日

Zero-Shot Entity Linking by Reading Entity Descriptions

Zero-Shot Entity Linking by Reading Entity Descriptions

Arxiv

6+阅读 · 2019年6月18日

Fine-tune BERT for Extractive Summarization

Arxiv

21+阅读 · 2019年3月25日

Few-shot 3D Multi-modal Medical Image Segmentation using Generative Adversarial Learning

Few-shot 3D Multi-modal Medical Image Segmentation using Generative Adversarial Learning

Arxiv

9+阅读 · 2018年10月29日

Universal Language Model Fine-tuning for Text Classification

Arxiv

3+阅读 · 2018年5月17日

KBGAN: Adversarial Learning for Knowledge Graph Embeddings

Arxiv

6+阅读 · 2018年2月20日

VIP会员

相关主题

预训练语言模型

相关VIP内容

还在修改博士论文？这份《博士论文写作技巧》为你指南

还在修改博士论文？这份《博士论文写作技巧》为你指南

专知会员服务

165+阅读 · 2020年6月9日

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

97+阅读 · 2020年4月10日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

近期必读的10篇ACL 2019【图神经网络（GNN）+NLP】相关论文和代码

专知会员服务

71+阅读 · 2020年1月10日

AAAI2020必读的10篇「知识图谱（Knowledge Graph）」相关论文和代码

AAAI2020必读的10篇「知识图谱（Knowledge Graph）」相关论文和代码

专知会员服务

146+阅读 · 2020年1月10日

BERT进展2019四篇必读论文

BERT进展2019四篇必读论文

专知会员服务

69+阅读 · 2020年1月2日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACL2025教程】大语言模型的护栏与安全性：对其应用的安全、可靠与可控引导

《实现协同自主：从人机协作到多智能体系统》最新190页

【ICML2025】SToFM：一种用于空间转录组学的多尺度基础模型

通信网络智能体白皮书V1.0，61页pdf

相关资讯

最强NLP预训练模型库PyTorch-Transformers正式开源！支持6个预训练框架，27个预训练模型

最强NLP预训练模型库PyTorch-Transformers正式开源！支持6个预训练框架，27个预训练模型

AI前线

12+阅读 · 2019年7月22日

GitHub超9千星：一个API调用27个NLP预训练模型

GitHub超9千星：一个API调用27个NLP预训练模型

新智元

17+阅读 · 2019年7月22日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

必读！TOP10生成对抗网络GAN论文（附链接）

必读！TOP10生成对抗网络GAN论文（附链接）

数据派THU

16+阅读 · 2019年3月24日

必读！生成对抗网络GAN论文TOP 10

必读！生成对抗网络GAN论文TOP 10

GAN生成式对抗网络

58+阅读 · 2019年3月20日

Github项目推荐 | awesome-bert：BERT相关资源大列表

Github项目推荐 | awesome-bert：BERT相关资源大列表

AI研习社

27+阅读 · 2019年2月26日

清华大学自然语言处理组年度巨献：370+篇机器翻译必读论文，一文收尽

清华大学自然语言处理组年度巨献：370+篇机器翻译必读论文，一文收尽

专知

3+阅读 · 2018年12月30日

清华NLP组图深度学习推荐，146篇必读GNN最新论文

清华NLP组图深度学习推荐，146篇必读GNN最新论文

中国人工智能学会

28+阅读 · 2018年12月29日

清华 NLP 团队推荐：必读的77篇机器阅读理解论文

清华 NLP 团队推荐：必读的77篇机器阅读理解论文

专知

20+阅读 · 2018年11月1日

相关论文

CoKE: Contextualized Knowledge Graph Embedding

Arxiv

9+阅读 · 2019年11月6日

Understanding Attention and Generalization in Graph Neural Networks

Arxiv

4+阅读 · 2019年10月28日

Question Generation by Transformers

Question Generation by Transformers

Arxiv

5+阅读 · 2019年9月14日

Entity-aware ELMo: Learning Contextual Entity Representation for Entity Disambiguation

Arxiv

3+阅读 · 2019年8月22日

DocBERT: BERT for Document Classification

Arxiv

6+阅读 · 2019年8月22日

Zero-Shot Entity Linking by Reading Entity Descriptions

Zero-Shot Entity Linking by Reading Entity Descriptions

Arxiv

6+阅读 · 2019年6月18日

Fine-tune BERT for Extractive Summarization

Arxiv

21+阅读 · 2019年3月25日

Few-shot 3D Multi-modal Medical Image Segmentation using Generative Adversarial Learning

Few-shot 3D Multi-modal Medical Image Segmentation using Generative Adversarial Learning

Arxiv

9+阅读 · 2018年10月29日

Universal Language Model Fine-tuning for Text Classification

Arxiv

3+阅读 · 2018年5月17日

KBGAN: Adversarial Learning for Knowledge Graph Embeddings

Arxiv

6+阅读 · 2018年2月20日

大家都在搜

智库最新报告

生成式人工智能

久别重逢话双塔

软件无线电

无人机航拍交通事故现场勘查处置系统——行业第一的警用事故处理软件

微信扫码咨询专知VIP会员