Influence maximization is the task of selecting a small number of seed nodes in a social network to maximize the spread of the influence from these seeds, and it has been widely investigated in the past two decades. In the canonical setting, the whole social network as well as its diffusion parameters is given as input. In this paper, we consider the more realistic sampling setting where the network is unknown and we only have a set of passively observed cascades that record the set of activated nodes at each diffusion step. We study the task of influence maximization from these cascade samples (IMS), and present constant approximation algorithms for this task under mild conditions on the seed set distribution. To achieve the optimization goal, we also provide a novel solution to the network inference problem, that is, learning diffusion parameters and the network structure from the cascade data. Comparing with prior solutions, our network inference algorithm requires weaker assumptions and does not rely on maximum-likelihood estimation and convex programming. Our IMS algorithms enhance the learning-and-then-optimization approach by allowing a constant approximation ratio even when the diffusion parameters are hard to learn, and we do not need any assumption related to the network structure or diffusion parameters.
翻译:影响最大化的任务是在社会网络中选择少量种子节点,以最大限度地扩大这些种子的影响的传播。 在过去二十年中,已经对此进行了广泛调查。在典型的环境中,整个社会网络及其传播参数被作为投入。在本文中,我们考虑的是网络未知的更现实的抽样环境,我们只有一套被动观察的级联,记录每个传播步骤的一组激活节点。我们研究的是从这些级联样本(IMS)中影响最大化的任务,并在种子集分布的温和条件下为这一任务提出恒定的近似算法。为了实现优化目标,我们还从级联数据中为网络推论问题提供了新的解决办法,即学习扩散参数和网络结构。与先前的解决办法相比,我们的网络推论算法需要较弱的假设,并不依赖最大相似的估算和convex编程。我们的IMS算法通过允许一个恒定的近比方法,即使传播参数难以学习,我们也不需要任何网络参数。