It has been shown that deep neural networks are prone to overfitting on biased training data. Towards addressing this issue, meta-learning employs a meta model for correcting the training bias. Despite the promising performances, super slow training is currently the bottleneck in the meta learning approaches. In this paper, we introduce a novel Faster Meta Update Strategy (FaMUS) to replace the most expensive step in the meta gradient computation with a faster layer-wise approximation. We empirically find that FaMUS yields not only a reasonably accurate but also a low-variance approximation of the meta gradient. We conduct extensive experiments to verify the proposed method on two tasks. We show our method is able to save two-thirds of the training time while still maintaining the comparable or achieving even better generalization performance. In particular, our method achieves the state-of-the-art performance on both synthetic and realistic noisy labels, and obtains promising performance on long-tailed recognition on standard benchmarks.
翻译:事实表明,深神经网络容易过度利用有偏向的培训数据。为解决这一问题,元学习采用一个元模型来纠正培训偏向。尽管表现良好,但超慢培训目前是元学习方法的瓶颈。在本文中,我们引入了新颖的快速元数据更新战略(FAMUS),以更快的层近似取代元梯度计算中最昂贵的一步。我们从经验中发现,FAMUS不仅产生一个合理准确的元梯度,而且还产生低差差近似值。我们进行了广泛的实验,以核实两种任务的拟议方法。我们展示了我们的方法能够节省三分之二的培训时间,同时仍然保持可比的或取得更好的概括性业绩。特别是,我们的方法在合成和现实的噪声标签上都取得了最先进的业绩,并在标准基准方面获得了长期的认可,取得了有希望的业绩。