Transfer learning across heterogeneous data distributions (a.k.a. domains) and distinct tasks is a more general and challenging problem than conventional transfer learning, where either domains or tasks are assumed to be the same. While neural network based feature transfer is widely used in transfer learning applications, finding the optimal transfer strategy still requires time-consuming experiments and domain knowledge. We propose a transferability metric called Optimal Transport based Conditional Entropy (OTCE), to analytically predict the transfer performance for supervised classification tasks in such cross-domain and cross-task feature transfer settings. Our OTCE score characterizes transferability as a combination of domain difference and task difference, and explicitly evaluates them from data in a unified framework. Specifically, we use optimal transport to estimate domain difference and the optimal coupling between source and target distributions, which is then used to derive the conditional entropy of the target task (task difference). Experiments on the largest cross-domain dataset DomainNet and Office31 demonstrate that OTCE shows an average of 21% gain in the correlation with the ground truth transfer accuracy compared to state-of-the-art methods. We also investigate two applications of the OTCE score including source model selection and multi-source feature fusion.
翻译:不同数据分布(a.k.a.a.域)和不同任务之间的转移学习是一个比常规转移学习(其中要么领域或任务假定相同)更普遍和更具挑战性的问题。虽然在转让学习应用中广泛使用基于神经网络特征的转让,但找到最佳转移战略仍需要花费时间的实验和领域知识。我们建议采用一个叫作基于最佳运输的有条件连接(OTCE)的可转让性指标,以分析预测在此类跨域和跨任务特性传输设置中监督分类任务的转移性能。我们的OTCE分数将可转让性描述为域差异和任务差异的组合,并明确从统一框架内的数据中评估。具体地说,我们使用最佳运输来估计领域差异以及源和目标分布之间的最佳组合,然后用它来得出目标任务(塔克差异)的有条件的酶谱。对最大的跨域数据集DomainNet和Office31的实验显示,OTCE的得分平均为21 %,这与地面真相转移模型的准确度比对州-艺术选择和多级方法。我们还调查了两种来源的利用。