Motivated by various data science applications including de-anonymizing user identities in social networks, we consider the graph alignment problem, where the goal is to identify the vertex/user correspondence between two correlated graphs. Existing work mostly recovers the correspondence by exploiting the user-user connections. However, in many real-world applications, additional information about the users, such as user profiles, might be publicly available. In this paper, we introduce the attributed graph alignment problem, where additional user information, referred to as attributes, is incorporated to assist graph alignment. We establish sufficient and necessary conditions for recovering vertex correspondence exactly, where the conditions match for a wide range of practical regimes. Our results recover existing tight information-theoretic limits for models where only the user-user connections are available, spanning the full spectrum between these models and models where only attribute information is available.
翻译:在各种数据科学应用的推动下,包括将社交网络中的用户身份非匿名化等数据科学应用,我们考虑了图形对齐问题,目的是查明两个相关图表之间的顶层/用户对应关系;现有工作大多通过利用用户-用户连接恢复通信;然而,在许多现实应用中,关于用户的额外信息,如用户概况等,可能公开提供。在本文中,我们引入了归属图对齐问题,将被称为属性的额外用户信息纳入其中,以协助图形对齐。我们为收回顶层通信建立了充分而必要的条件,以便准确恢复符合广泛实用制度的条件。我们的成果恢复了只有用户-用户连接的模型现有的严格信息理论限制,跨越了这些模型和只有属性信息的模型之间的全部范围。