Graph neural networks (GNNs) have emerged as a popular strategy for handling non-Euclidean data due to their state-of-the-art performance. However, most of the current GNN model designs mainly focus on task accuracy, lacking in considering hardware resources limitation and real-time requirements of edge application scenarios. Comprehensive profiling of typical GNN models indicates that their execution characteristics are significantly affected across different computing platforms, which demands hardware awareness for efficient GNN designs. In this work, HGNAS is proposed as the first Hardware-aware Graph Neural Architecture Search framework targeting resource constraint edge devices. By decoupling the GNN paradigm, HGNAS constructs a fine-grained design space and leverages an efficient multi-stage search strategy to explore optimal architectures within a few GPU hours. Moreover, HGNAS achieves hardware awareness during the GNN architecture design by leveraging a hardware performance predictor, which could balance the GNN model accuracy and efficiency corresponding to the characteristics of targeted devices. Experimental results show that HGNAS can achieve about $10.6\times$ speedup and $88.2\%$ peak memory reduction with a negligible accuracy loss compared to DGCNN on various edge devices, including Nvidia RTX3080, Jetson TX2, Intel i7-8700K and Raspberry Pi 3B+.
翻译:图神经网络(GNN)已经成为处理非欧几里得数据的流行策略,由于其最先进的表现。然而,大多数当前的GNN模型设计主要关注任务准确性,缺乏考虑硬件资源限制和边缘应用场景的实时要求。对典型GNN模型的全面剖析表明,其执行特性在不同的计算平台上受到显著影响,这需要硬件感知的高效GNN设计。本文提出了HGNAS作为第一个面向资源约束边缘设备的硬件感知图神经架构搜索框架。通过解耦GNN范式,HGNAS构建了一个细粒度的设计空间,并利用有效的多阶段搜索策略,在几个GPU小时内探索最优体系结构。此外,HGNAS在GNN架构设计过程中实现了对硬件的感知,利用硬件性能预测器,可以平衡与目标设备的特性相对应的GNN模型的准确性和效率。实验结果表明,与DGCNN相比,在各种边缘设备上,包括Nvidia RTX3080、Jetson TX2、Intel i7-8700K和Raspberry Pi 3B+,HGNAS可以实现约$10.6\times$的加速和$88.2\%$的峰值内存减少,几乎没有准确度损失。