Graphs can model real-world, complex systems by representing entities and their interactions in terms of nodes and edges. To better exploit the graph structure, graph neural networks have been developed, which learn entity and edge embeddings for tasks such as node classification and link prediction. These models achieve good performance with respect to accuracy, but the confidence scores associated with the predictions might not be calibrated. That means that the scores might not reflect the ground-truth probabilities of the predicted events, which would be especially important for safety-critical applications. Even though graph neural networks are used for a wide range of tasks, the calibration thereof has not been sufficiently explored yet. We investigate the calibration of graph neural networks for node classification, study the effect of existing post-processing calibration methods, and analyze the influence of model capacity, graph density, and a new loss function on calibration. Further, we propose a topology-aware calibration method that takes the neighboring nodes into account and yields improved calibration compared to baseline methods.
翻译:图表可以模拟真实世界, 复杂的系统, 代表实体及其在节点和边缘方面的相互作用。 为了更好地利用图形结构, 已经开发了图形神经网络, 学习了用于节点分类和链接预测等任务的实体和边缘嵌入。 这些模型在准确性方面表现良好, 但与预测相关的信任分数可能无法校准 。 这意味着分数可能无法反映预测事件的地面真实性, 这对于安全关键应用尤其重要 。 即使图形神经网络被用于范围广泛的任务, 其校准尚未得到充分探索 。 我们研究用于节点分类的图形神经网络校准, 研究现有后处理校准方法的影响, 分析模型能力、 图形密度 和新的校准功能的影响 。 此外, 我们建议一种表层- 觉校准方法, 将相邻节点考虑在内, 并比基准方法产生更好的校准 。