Differentiable architecture search (DARTS) is an effective method for data-driven neural network design based on solving a bilevel optimization problem. Despite its success in many architecture search tasks, there are still some concerns about the accuracy of first-order DARTS and the efficiency of the second-order DARTS. In this paper, we formulate a single level alternative and a relaxed architecture search (RARTS) method that utilizes the whole dataset in architecture learning via both data and network splitting, without involving mixed second derivatives of the corresponding loss functions like DARTS. In our formulation of network splitting, two networks with different but related weights cooperate in search of a shared architecture. The advantage of RARTS over DARTS is justified by a convergence theorem and an analytically solvable model. Moreover, RARTS outperforms DARTS and its variants in accuracy and search efficiency, as shown in adequate experimental results. For the task of searching topological architecture, i.e., the edges and the operations, RARTS obtains a higher accuracy and 60\% reduction of computational cost than second-order DARTS on CIFAR-10. RARTS continues to out-perform DARTS upon transfer to ImageNet and is on par with recent variants of DARTS even though our innovation is purely on the training algorithm without modifying search space. For the task of searching width, i.e., the number of channels in convolutional layers, RARTS also outperforms the traditional network pruning benchmarks. Further experiments on the public architecture search benchmark like NATS-Bench also support the preeminence of RARTS.
翻译:差异建筑搜索(DARSS)是基于解决双级优化问题的数据驱动神经网络设计的有效方法。尽管它在许多结构搜索任务中取得了成功,但仍对一级DARSS的准确性以及二级DARSS的效率存在一些关切。在本文中,我们制定了单一的替代标准和宽松的建筑搜索方法,利用通过数据和网络分割学习的整个数据集(RARTS),而不涉及DARSS等相应损失基准的混合第二衍生物。在设计网络分拆时,两个不同但相关重量的网络合作寻找一个共享架构。RARTS对DARSS的优势在于一阶DARSS的精度和二阶DARSS的效率。此外,RARTS在适当的实验结果中超越了DARRTS及其精度和搜索效率的变体。在搜索之前,例如DARSS的精度结构搜索前,在网络的精度搜索中获得了更高的精度和60英尺的计算成本的削减。在SRARTS的二阶级结构上,在CD-10的搜索中,DRARTS的搜索中,在S的深度结构结构结构中继续使用DRRART的搜索。