When faced with data-starved or highly complex end-tasks, it is commonplace for machine learning practitioners to introduce auxiliary objectives as supplementary learning signals. Whilst much work has been done to formulate useful auxiliary objectives, their construction is still an art which proceeds by slow and tedious hand-design. Intuitions about how and when these objectives improve end-task performance have also had limited theoretical backing. In this work, we present an approach for automatically generating a suite of auxiliary objectives. We achieve this by deconstructing existing objectives within a novel unified taxonomy, identifying connections between them, and generating new ones based on the uncovered structure. Next, we theoretically formalize widely-held intuitions about how auxiliary learning improves generalization of the end-task. This leads us to a principled and efficient algorithm for searching the space of generated objectives to find those most useful to a specified end-task. With natural language processing (NLP) as our domain of study, we empirically verify that our automated auxiliary learning pipeline leads to strong improvements over competitive baselines across continued training experiments on a pre-trained model on 5 NLP end-tasks.
翻译:当面临数据紧缺或高度复杂的最终任务时,机器学习实践者通常会把辅助目标作为补充学习信号来引入辅助目标。虽然在制订有用的辅助目标方面做了大量工作,但其构建仍然是通过缓慢和乏味的手工设计来推进的艺术。关于这些目标如何和何时改进最终任务业绩的理论支持也很有限。在这项工作中,我们提出了一个自动生成一套辅助目标的方法。我们通过在新颖的统一分类中解构现有目标,查明它们之间的联系,并根据未发现的结构创造新的目标,来实现这一目标。接下来,我们理论上正式确定关于辅助学习如何改进最终任务一般化的广泛直觉。这导致我们为寻找产生目标的空间以找到对特定最终任务最有用的空间而采用有原则和有效的算法。我们的研究领域是自然语言处理,我们从经验上核实我们的自动辅助学习管道导致在5个最终任务前训练模型的持续培训实验中,在竞争基线方面有很强的改进。