Multi-task learning (MTL) aims to improve the performance of multiple related prediction tasks by leveraging useful information from them. Due to their flexibility and ability to reduce unknown coefficients substantially, the task-clustering-based MTL approaches have attracted considerable attention. Motivated by the idea of semisoft clustering of data, we propose a semisoft task clustering approach, which can simultaneously reveal the task cluster structure for both pure and mixed tasks as well as select the relevant features. The main assumption behind our approach is that each cluster has some pure tasks, and each mixed task can be represented by a linear combination of pure tasks in different clusters. To solve the resulting non-convex constrained optimization problem, we design an efficient three-step algorithm. The experimental results based on synthetic and real-world datasets validate the effectiveness and efficiency of the proposed approach. Finally, we extend the proposed approach to a robust task clustering problem.
翻译:多任务学习(MTL)旨在通过利用从它们获得的有用信息来改进多相关预测任务的业绩。由于它们的灵活性和大幅降低未知系数的能力,基于任务分组的MTL方法吸引了相当多的注意力。由于半软数据组合的想法,我们提出了一个半软任务分组方法,它既可以同时揭示纯任务和混合任务的任务组结构,也可以选择相关特征。我们方法的主要假设是,每个组都有一些纯任务,而每个组的任务组可以通过将不同组群的纯任务线性组合来代表。为了解决由此产生的非组合限制优化的问题,我们设计了一个高效的三步算法。基于合成和现实世界数据集的实验结果验证了拟议方法的有效性和效率。最后,我们将拟议方法扩大到一个稳健的任务组问题。