In multi-task learning, a learner is given a collection of prediction tasks and needs to solve all of them. In contrast to previous work, which required that annotated training data is available for all tasks, we consider a new setting, in which for some tasks, potentially most of them, only unlabeled training data is provided. Consequently, to solve all tasks, information must be transferred between tasks with labels and tasks without labels. Focusing on an instance-based transfer method we analyze two variants of this setting: when the set of labeled tasks is fixed, and when it can be actively selected by the learner. We state and prove a generalization bound that covers both scenarios and derive from it an algorithm for making the choice of labeled tasks (in the active case) and for transferring information between the tasks in a principled way. We also illustrate the effectiveness of the algorithm by experiments on synthetic and real data.
翻译:在多任务学习中,向学习者提供一系列预测任务,并需要解决所有这些任务。与以前的工作不同,以前的工作要求为所有任务提供附加说明的培训数据,我们考虑一种新的环境,即某些任务(可能多数任务)只提供无标签的培训数据。因此,为了解决所有任务,必须在带有标签的任务和没有标签的任务之间传递信息。我们侧重于基于实例的转移方法,我们分析了这一环境的两个变式:当标签的任务组固定下来,并且可以由学习者积极选择时。我们声明并证明一种涵盖两种情况的概括性约束,并从中推导出一种算法,用于选择标签任务(在活动的情况下)和以有原则的方式在任务之间传递信息。我们还通过合成数据和真实数据的实验来说明算法的有效性。