AULos- Zero : 为通用任务从 Scratch 搜索丢失函数 (AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks)

Significant progress has been achieved in automating the design of various components in deep networks. However, the automatic design of loss functions for generic tasks with various evaluation metrics remains under-investigated. Previous works on handcrafting loss functions heavily rely on human expertise, which limits their extendibility. Meanwhile, existing efforts on searching loss functions mainly focus on specific tasks and particular metrics, with task-specific heuristics. Whether such works can be extended to generic tasks is not verified and questionable. In this paper, we propose AutoLoss-Zero, the first general framework for searching loss functions from scratch for generic tasks. Specifically, we design an elementary search space composed only of primitive mathematical operators to accommodate the heterogeneous tasks and evaluation metrics. A variant of the evolutionary algorithm is employed to discover loss functions in the elementary search space. A loss-rejection protocol and a gradient-equivalence-check strategy are developed so as to improve the search efficiency, which are applicable to generic tasks. Extensive experiments on various computer vision tasks demonstrate that our searched loss functions are on par with or superior to existing loss functions, which generalize well to different datasets and networks. Code shall be released.

翻译：在设计深层网络各组成部分方面已取得重大进展。然而,对各种评价指标通用任务损失功能的自动设计仍未得到充分调查。以前关于手工艺损失功能的工程严重依赖人的专门知识,这限制了其扩展性。与此同时,现有的寻找损失功能的工作主要侧重于具体任务和特定指标,并带有任务特有的杂质。这种工程能否扩大到通用任务,是无法核实和值得怀疑的。在本文件中,我们提议AutoLos-Zero,这是从头到尾查找损失功能以完成通用任务的第一个总框架。具体地说,我们设计了一个基本搜索空间,仅由原始数学操作员组成,以适应不同的任务和评估指标。采用演进算法的一种变式,以发现初级搜索空间的损失功能。为了提高搜索效率,将适用于通用任务,将制定损失反馈协议和梯度等值检查战略。关于各种计算机视觉任务的广泛实验表明,我们所搜索的损失功能与现有损失功能相同或优于现有损失功能,这些功能一般地适用于不同的数据集和网络。

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

【如何做研究】How to research ，22页ppt

专知会员服务

112+阅读 · 2021年4月17日

【ACL2020-Google】学习鲁棒度量的文本生成，BLEURT: Learning Robust Metrics for Text Generation

专知会员服务

17+阅读 · 2020年4月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

多伦多大学2020春季CSC311课程「机器学习导论」，学习ML基础知识

专知会员服务

54+阅读 · 2020年1月13日