Developing meta-learning algorithms that are un-biased toward a subset of training tasks often requires hand-designed criteria to weight tasks, potentially resulting in sub-optimal solutions. In this paper, we introduce a new principled and fully-automated task-weighting algorithm for meta-learning methods. By considering the weights of tasks within the same mini-batch as an action, and the meta-parameter of interest as the system state, we cast the task-weighting meta-learning problem to a trajectory optimisation and employ the iterative linear quadratic regulator to determine the optimal action or weights of tasks. We theoretically show that the proposed algorithm converges to an $\epsilon_{0}$-stationary point, and empirically demonstrate that the proposed approach out-performs common hand-engineering weighting methods in two few-shot learning benchmarks.
翻译:开发对一组培训任务没有偏向的元学习算法往往需要手工设计加权任务的标准,这可能导致次优的解决方案。 在本文中,我们为元学习方法引入了新的有原则的、完全自动化的任务加权算法。 通过考虑任务在同一个微型批量中的权重作为一种行动,以及系统状态中的利益元参数,我们将任务加权的元学习问题置于轨迹优化之下,并使用迭代线性线性二次调节器来确定任务的最佳动作或权重。 我们理论上表明,拟议的算法将汇集到一个$\ epsilon ⁇ 0} 固定点,并用经验证明,拟议的方法在两个微小的学习基准中超越了共同的手工工程权重法。