We study the problem of identifying the source of a stochastic diffusion process spreading on a graph based on the arrival times of the diffusion at a few queried nodes. In a graph $G=(V,E)$, an unknown source node $v^* \in V$ is drawn uniformly at random, and unknown edge weights $w(e)$ for $e\in E$, representing the propagation delays along the edges, are drawn independently from a Gaussian distribution of mean $1$ and variance $\sigma^2$. An algorithm then attempts to identify $v^*$ by querying nodes $q \in V$ and being told the length of the shortest path between $q$ and $v^*$ in graph $G$ weighted by $w$. We consider two settings: non-adaptive, in which all query nodes must be decided in advance, and adaptive, in which each query can depend on the results of the previous ones. Both settings are motivated by an application of the problem to epidemic processes (where the source is called patient zero), which we discuss in detail. We characterize the query complexity when $G$ is an $n$-node path. In the non-adaptive setting, $\Theta(n\sigma^2)$ queries are needed for $\sigma^2 \leq 1$, and $\Theta(n)$ for $\sigma^2 \geq 1$. In the adaptive setting, somewhat surprisingly, only $\Theta(\log\log_{1/\sigma}n)$ are needed when $\sigma^2 \leq 1/2$, and $\Theta(\log \log n)+O_\sigma(1)$ when $\sigma^2 \geq 1/2$. This is the first mathematical study of source identification with time queries in a non-deterministic diffusion process.
翻译:我们研究一个问题, 如何根据在几个问题节点上扩散的到达时间在图表中找到一个传播过程的源头。 在一个 $G=( V, E) 的图形中, 一个未知源点点点点是随机的, 以美元计算, 以美元计算, 以美元计算, 以美元计算, 以美元计算, 以美元计算, 以美元计算, 以美元计算, 以美元计算, 以美元计, 以美元计, 以美元计, 以美元计 。 我们考虑两种设置: 非适应性, 所有查询节点都必须预先决定, 并且适应性, 在每种查询都取决于前一次的输出结果。 两种设置都由对流行病过程( 源被称为 耐心零) 的应用程序驱动, 我们对此进行详细讨论。 当需要 $G $ 的路径是 美元, 确定 复杂性是 美元 。