A zoo of deep nets is available these days for almost any given task, and it is increasingly unclear which net to start with when addressing a new task, or which net to use as an initialization for fine-tuning a new model. To address this issue, in this paper, we develop knowledge flow which moves 'knowledge' from multiple deep nets, referred to as teachers, to a new deep net model, called the student. The structure of the teachers and the student can differ arbitrarily and they can be trained on entirely different tasks with different output spaces too. Upon training with knowledge flow the student is independent of the teachers. We demonstrate our approach on a variety of supervised and reinforcement learning tasks, outperforming fine-tuning and other 'knowledge exchange' methods.
翻译:现在几乎任何特定任务都有一个深网动物园,而且越来越不清楚在处理新任务时从哪个网开始,或者用哪个网开始微调新模式。为了解决这个问题,我们在本文件中发展知识流动,将“知识”从被称为教师的多条深网转移到称为学生的新的深网模式。教师和学生的结构可以任意不同,他们也可以在不同的产出空间接受完全不同的任务培训。在接受知识流动培训后,学生独立于教师。我们展示了我们在各种监督和强化学习任务、优于微调和其他“知识交流”方法上的做法。