培训从零到英雄的深网络:避免陷阱和超越 (Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond)

Training deep neural networks may be challenging in real world data. Using models as black-boxes, even with transfer learning, can result in poor generalization or inconclusive results when it comes to small datasets or specific applications. This tutorial covers the basic steps as well as more recent options to improve models, in particular, but not restricted to, supervised learning. It can be particularly useful in datasets that are not as well-prepared as those in challenges, and also under scarce annotation and/or small data. We describe basic procedures: as data preparation, optimization and transfer learning, but also recent architectural choices such as use of transformer modules, alternative convolutional layers, activation functions, wide and deep networks, as well as training procedures including as curriculum, contrastive and self-supervised learning.

翻译：在现实世界数据中,培训深层神经网络可能具有挑战性。使用模型作为黑箱,即使进行传输学习,在小型数据集或具体应用程序方面,也可能造成不全面或无结果的结果。这一指导性内容涵盖基本步骤以及最新的改进模型的备选方案,特别是但不局限于监督学习。它对于没有像挑战中那样做好准备的数据集,以及缺乏说明和(或)小数据的数据集可能特别有用。我们描述了基本程序:例如数据编制、优化和传输学习,以及最近的建筑选择,如变压器模块的使用、替代的交替层、激活功能、宽广而深的网络以及培训程序,包括课程、对比式和自我监督的学习。

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日