Differentiable architecture search (DARTS) is a prevailing NAS solution to identify architectures. Based on the continuous relaxation of the architecture space, DARTS learns a differentiable architecture weight and largely reduces the search cost. However, its stability has been challenged for yielding deteriorating architectures as the search proceeds. We find that the precipitous validation loss landscape, which leads to a dramatic performance drop when distilling the final architecture, is an essential factor that causes instability. Based on this observation, we propose a perturbation-based regularization - SmoothDARTS (SDARTS), to smooth the loss landscape and improve the generalizability of DARTS-based methods. In particular, our new formulations stabilize DARTS-based methods by either random smoothing or adversarial attack. The search trajectory on NAS-Bench-1Shot1 demonstrates the effectiveness of our approach and due to the improved stability, we achieve performance gain across various search spaces on 4 datasets. Furthermore, we mathematically show that SDARTS implicitly regularizes the Hessian norm of the validation loss, which accounts for a smoother loss landscape and improved performance.
翻译:差异化建筑搜索(DARTS)是用于确定建筑的常用NAS解决方案。基于建筑空间的持续放松,DARTS学会了不同的建筑重量,并大大降低了搜索成本。然而,随着搜索的进行,其稳定性因建筑结构的恶化而受到挑战。我们发现,在蒸馏最终建筑时导致性能急剧下降的急剧验证损失景观是造成不稳定的一个基本要素。基于这一观察,我们提议以扰动为基础的正规化(SDARTS),以平滑损失景观,改善基于DARTS的方法的通用性。特别是,我们的新配方通过随机平滑或对抗性攻击来稳定基于DARTS的方法。NAS-Bench-1Shot1的搜索轨迹显示了我们的方法的有效性,并由于稳定性的改善,我们在4个数据集的各种搜索空间中取得了绩效收益。此外,我们用数学显示,SDARTRS隐含地调整了基于DARTS的校准损失规范,从而算出平稳损失和性能的改善。