Neural networks are powerful models that have a remarkable ability to extract patterns that are too complex to be noticed by humans or other machine learning models. Neural networks are the first class of models that can train end-to-end systems with large learning capacities. However, we still have the difficult challenge of designing the neural network, which requires human experience and a long process of trial and error. As a solution, we can use a neural architecture search to find the best network architecture for the task at hand. Existing NAS algorithms generally evaluate the fitness of a new architecture by fully training from scratch, resulting in the prohibitive computational cost, even if operated on high-performance computers. In this paper, an end-to-end offline performance predictor is proposed to accelerate the evaluation of sampled architectures. Index Terms- Learning Curve Prediction, Neural Architecture Search, Reinforcement Learning.
翻译:神经网络是强大的模型,具有惊人的提取模式的能力,这些模式太复杂,无法被人类或其他机器学习模式所注意到。神经网络是能够培训具有巨大学习能力的端到端系统的第一流模型。然而,我们仍面临设计神经网络的艰巨挑战,这需要人的经验和漫长的试验和错误过程。作为一个解决方案,我们可以使用神经结构搜索来寻找当前任务的最佳网络结构。现有的NAS算法一般通过从零开始充分培训来评估新结构的适合性,导致过高的计算成本,即使操作的是高性能计算机。本文建议用一个端到端的离线性性业绩预测器来加速对抽样结构的评估。索引术语预测、神经结构搜索、强化学习。