Graph neural networks (GNNs) have emerged as a popular strategy for handling non-Euclidean data due to their state-of-the-art performance. However, most of the current GNN model designs mainly focus on task accuracy, lacking in considering hardware resources limitation and real-time requirements of edge application scenarios. Comprehensive profiling of typical GNN models indicates that their execution characteristics are significantly affected across different computing platforms, which demands hardware awareness for efficient GNN designs. In this work, HGNAS is proposed as the first Hardware-aware Graph Neural Architecture Search framework targeting resource constraint edge devices. By decoupling the GNN paradigm, HGNAS constructs a fine-grained design space and leverages an efficient multi-stage search strategy to explore optimal architectures within a few GPU hours. Moreover, HGNAS achieves hardware awareness during the GNN architecture design by leveraging a hardware performance predictor, which could balance the GNN model accuracy and efficiency corresponding to the characteristics of targeted devices. Experimental results show that HGNAS can achieve about $10.6\times$ speedup and $88.2\%$ peak memory reduction with a negligible accuracy loss compared to DGCNN on various edge devices, including Nvidia RTX3080, Jetson TX2, Intel i7-8700K and Raspberry Pi 3B+.
翻译:图神经网络(GNN)因其最先进的性能已成为处理非欧几里得数据的流行策略。然而,大多数当前的GNN模型设计主要关注任务准确性,缺乏考虑边缘应用场景下硬件资源限制和实时性要求的综合性能。典型GNN模型的全面性能分析表明,它们的执行特征受不同的计算平台显著影响,这要求硬件感知的高效GNN设计。本文提出了HGNAS作为第一个面向资源受限的边缘设备的硬件感知图神经体系结构搜索框架。通过解耦GNN范式,HGNAS构建了一个细粒度的设计空间,并利用有效的多阶段搜索策略在几个GPU小时内探索最优体系结构。此外,HGNAS通过利用硬件性能预测器在GNN架构设计中实现硬件感知,从而能够平衡GNN模型准确性和相应的设备特性的效率。实验结果表明,与DGCNN相比,在各种边缘设备上,包括Nvidia RTX3080、Jetson TX2、Intel i7-8700K和Raspberry Pi 3B+,HGNAS可以实现约10.6倍的加速和88.2%的内存峰值降低,并且仅有微不足道的精度损失。