Multi-Task Learning (MTL) is a powerful learning paradigm to improve generalization performance via knowledge sharing. However, existing studies find that MTL could sometimes hurt generalization, especially when two tasks are less correlated. One possible reason that hurts generalization is spurious correlation, i.e., some knowledge is spurious and not causally related to task labels, but the model could mistakenly utilize them and thus fail when such correlation changes. In MTL setup, there exist several unique challenges of spurious correlation. First, the risk of having non-causal knowledge is higher, as the shared MTL model needs to encode all knowledge from different tasks, and causal knowledge for one task could be potentially spurious to the other. Second, the confounder between task labels brings in a different type of spurious correlation to MTL. We theoretically prove that MTL is more prone to taking non-causal knowledge from other tasks than single-task learning, and thus generalize worse. To solve this problem, we propose Multi-Task Causal Representation Learning framework, aiming to represent multi-task knowledge via disentangled neural modules, and learn which module is causally related to each task via MTL-specific invariant regularization. Experiments show that it could enhance MTL model's performance by 5.5% on average over Multi-MNIST, MovieLens, Taskonomy, CityScape, and NYUv2, via alleviating spurious correlation problem.
翻译:多任务学习(MTL)是一个强大的学习模式,通过知识共享来提高一般化业绩。然而,现有的研究发现,MTL有时会伤害一般化,特别是当两项任务不那么相关时。一个可能伤害一般化的原因是虚假的关联性,即有些知识是虚假的,而不是因果性与任务标签相关,但模型可能会错误地利用它们,因此当这种关联变化发生时会失败。在MTL的设置中,存在一些虚假相关性的独特挑战。首先,拥有非因果性知识的风险更高,因为共享的 MTL 模型需要从不同任务中加密所有知识,而一项任务的因果性知识可能与另一项任务相关。第二,任务标签之间的混搭将不同种类的刺激性相关性与任务标签相关,即一些知识与任务标签相关,但我们理论上证明,MTL更容易从其他任务中获取非因果性知识,而不是单项性能学习,从而更加普遍化。为了解决这个问题,我们建议多任务、含因果性能性能学习框架,旨在代表多任务、多任务相关性,通过不透明性、中度性LMTMTB模块来提升每个任务。