Transfer learning can significantly improve the sample efficiency of neural networks, by exploiting the relatedness between a data-scarce target task and a data-abundant source task. Despite years of successful applications, transfer learning practice often relies on ad-hoc solutions, while theoretical understanding of these procedures is still limited. In the present work, we re-think a solvable model of synthetic data as a framework for modeling correlation between data-sets. This setup allows for an analytic characterization of the generalization performance obtained when transferring the learned feature map from the source to the target task. Focusing on the problem of training two-layer networks in a binary classification setting, we show that our model can capture a range of salient features of transfer learning with real data. Moreover, by exploiting parametric control over the correlation between the two data-sets, we systematically investigate under which conditions the transfer of features is beneficial for generalization.
翻译:通过利用数据破碎目标任务与数据丰富源任务之间的关联性,转移学习可以显著提高神经网络的样本效率。尽管应用成功多年,转移学习做法往往依赖临时解决办法,而对于这些程序的理论理解仍然有限。在目前的工作中,我们重新思考一个可溶解的合成数据模型,作为建立数据集之间相互关系模型的框架。这一设置允许对从来源向目标任务转移所学地貌图时所获得的一般化性能进行分析性定性。我们注重二层网络在二元分类设置中的培训问题,我们表明我们的模型可以捕捉一系列用真实数据进行转移学习的突出特征。此外,通过利用对两个数据集之间相互关系的参数的类比控制,我们系统地调查在哪些条件下将特征的转移有利于普遍性。