Attributed graphs, typically characterized by irregular topologies and a mix of numerical and categorical attributes, are ubiquitous in diverse domains such as social networks, bioinformatics, and cheminformatics. While graph kernels provide a principled framework for measuring graph similarity, existing kernel methods often struggle to simultaneously capture heterogeneous attribute semantics and neighborhood information in attributed graphs. In this work, we propose the Neighborhood-Aware Star Kernel (NASK), a novel graph kernel designed for attributed graph learning. NASK leverages an exponential transformation of the Gower similarity coefficient to jointly model numerical and categorical features efficiently, and employs star substructures enhanced by Weisfeiler-Lehman iterations to integrate multi-scale neighborhood structural information. We theoretically prove that NASK is positive definite, ensuring compatibility with kernel-based learning frameworks such as SVMs. Extensive experiments are conducted on eleven attributed and four large-scale real-world graph benchmarks. The results demonstrate that NASK consistently achieves superior performance over sixteen state-of-the-art baselines, including nine graph kernels and seven Graph Neural Networks.
翻译:属性图通常具有不规则的拓扑结构以及数值与分类属性混合的特征,广泛存在于社交网络、生物信息学和化学信息学等多个领域。尽管图核为度量图相似性提供了原则性框架,但现有核方法往往难以同时捕捉属性图中的异质属性语义与邻域信息。本文提出邻域感知星形核(NASK),一种专为属性图学习设计的新型图核。NASK利用高厄相似系数的指数变换高效联合建模数值与分类特征,并采用经Weisfeiler-Lehman迭代增强的星形子结构以整合多尺度邻域结构信息。我们从理论上证明了NASK的正定性,确保其与支持向量机等基于核的学习框架兼容。在十一个属性图数据集和四个大规模真实图基准上进行了广泛实验,结果表明NASK在包括九个图核和七个图神经网络在内的十六个前沿基线方法中均取得显著优越的性能。