启动功能演变:经验调查 (Evolution of Activation Functions: An Empirical Investigation)

The hyper-parameters of a neural network are traditionally designed through a time consuming process of trial and error that requires substantial expert knowledge. Neural Architecture Search (NAS) algorithms aim to take the human out of the loop by automatically finding a good set of hyper-parameters for the problem at hand. These algorithms have mostly focused on hyper-parameters such as the architectural configurations of the hidden layers and the connectivity of the hidden neurons, but there has been relatively little work on automating the search for completely new activation functions, which are one of the most crucial hyper-parameters to choose. There are some widely used activation functions nowadays which are simple and work well, but nonetheless, there has been some interest in finding better activation functions. The work in the literature has mostly focused on designing new activation functions by hand, or choosing from a set of predefined functions while this work presents an evolutionary algorithm to automate the search for completely new activation functions. We compare these new evolved activation functions to other existing and commonly used activation functions. The results are favorable and are obtained from averaging the performance of the activation functions found over 30 runs, with experiments being conducted on 10 different datasets and architectures to ensure the statistical robustness of the study.

翻译：神经网络的超参数传统上是通过一个耗时的试验和错误过程设计的,需要大量专家知识。神经结构搜索(NAS)算法的目的是通过自动找到一套针对手头问题的良好超参数,将人带出环圈。这些算法主要侧重于超参数,例如隐藏层的建筑配置和隐藏神经元的连接,但是,在将完全新的激活功能的搜索自动化方面,工作相对较少,这些功能是需要选择的最关键的超参数之一。目前,有些广泛使用的激活功能非常简单,运作良好,但对于找到更好的激活功能还是有一些兴趣。文献中的工作主要侧重于用手设计新的激活功能,或者从一组预先定义的功能中选择,而这项工作则提供了一种进化算法,将完全新的激活功能的搜索自动化。我们将这些新演变的激活功能与其他现有和常用的激活功能加以比较。这些结果是有利的,并且是从平均执行30多运行期的激活功能中发现的激活功能中获得的,但还是有一些兴趣。文献中的工作主要侧重于用手设计新的激活功能,或者从一组预先定义的功能中选择,而这项工作则是一种进化的进化算法,而正在对10个不同的数据设置进行实验。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

《常微分方程》笔记，419页pdf

专知会员服务

75+阅读 · 2020年8月2日