Multi-Task Learning (MTL) is a well-established paradigm for training deep neural network models for multiple correlated tasks. Often the task objectives conflict, requiring trade-offs between them during model building. In such cases, MTL models can use gradient-based multi-objective optimization (MOO) to find one or more Pareto optimal solutions. A common requirement in MTL applications is to find an {\it Exact} Pareto optimal (EPO) solution, which satisfies user preferences with respect to task-specific objective functions. Further, to improve model generalization, various constraints on the weights may need to be enforced during training. Addressing these requirements is challenging because it requires a search direction that allows descent not only towards the Pareto front but also towards the input preference, within the constraints imposed and in a manner that scales to high-dimensional gradients. We design and theoretically analyze such search directions and develop the first scalable algorithm, with theoretical guarantees of convergence, to find an EPO solution, including when box and equality constraints are imposed. Our unique method combines multiple gradient descent with carefully controlled ascent to traverse the Pareto front in a principled manner, making it robust to initialization. This also facilitates systematic exploration of the Pareto front, that we utilize to approximate the Pareto front for multi-criteria decision-making. Empirical results show that our algorithm outperforms competing methods on benchmark MTL datasets and MOO problems.
翻译:多任务学习(MTL)是培训深神经网络模型的既定范例,用于培训多个相关任务的深神经网络模式。任务目标往往有冲突,在模型建设期间需要相互权衡。在这种情况下,MTL模式可以使用基于梯度的多目标优化(MOO)来找到一种或多种最佳解决方案。MTL应用的一个共同要求是找到一个满足用户对任务特定目标功能的偏好的优化(EPO)解决方案。此外,为了改进模型的概括化,在培训期间可能需要对权重施加各种限制。满足这些要求具有挑战性,因为要满足这些要求需要有一个搜索方向,不仅允许从Pareto前面向下,而且允许向投入偏好,在规定的限制范围内,以一个尺度到高维度梯度的尺度。我们设计并理论上分析这种搜索方向,并开发第一个具有理论保证一致性的算法,以找到EPO(EPO)解决方案,包括当设置框和平等制约时。我们的独特方法将多重梯度下降与谨慎控制相结合,同时将它与制为初始的基点,从而利用Pareto robal robal roto rodal 一种我们采用一个原则式的模型的方法。我们用一个原则式的模型,我们用一个原则方法来展示式的模型,我们用一种方法来展示的模型,我们用一个方法来显示我们是如何的模型的模型的模型的模型。