Neyman-Scott process (NSP) are point process models that generate clusters of points in time or space. They are natural models for a wide range of phenomena, ranging from neural spike trains to document streams. The clustering property is achieved via a doubly stochastic formulation: first, a set of latent events is drawn from a Poisson process; then, each latent event generates a set of observed data points according to another Poisson process. This construction is similar to Bayesian nonparametric mixture models like the Dirichlet process mixture model (DPMM) in that the number of latent events (i.e. clusters) is a random variable, but the point process formulation makes the NSP especially well suited to modeling spatiotemporal data. While many specialized algorithms have been developed for DPMMs, comparatively fewer works have focused on inference in NSPs. Here, we present novel connections between NSPs and DPMMs, with the key link being a third class of Bayesian mixture models called mixture of finite mixture models (MFMMs). Leveraging this connection, we adapt the standard collapsed Gibbs sampling algorithm for DPMMs to enable scalable Bayesian inference on NSP models. We demonstrate the potential of Neyman-Scott processes on a variety of applications including sequence detection in neural spike trains and event detection in document streams.
翻译:Neyman-Scott 进程(NSP) 是产生时间或空间点群集的点点进程模型。 它们是一系列现象的自然模型, 从神经钉钉列列列到文档流。 组合属性是通过一种双重的随机配方实现的: 首先, 一组潜在事件是从Poisson 进程中绘制的; 然后, 每个潜在事件都根据另一个 Poisson 进程生成一套观察到的数据点。 这个构造类似于 Bayesian 的非参数混合模型, 如 Drichlet 混合模型(DPMM ), 其潜在事件的数量( e. group) 是一个随机变量, 但点进程设计使 NSP 特别适合模拟时空数据 。 虽然已经为 DPMMs 开发了许多专门的算法, 但相对较少的工程侧重于 NSP 的推断。 在这里, 我们展示了 NSP 和 DPMM 之间的新的连接, 其关键链接是 一种叫做 Bayesian 文档流流的混合物模型 (MIMMs ) 混合 。 。 在这种连接中, 我们在 Bay- Clascolable rops 中, 我们在 Bastion imcal imcal imcal imcal 程序中将标准的测算算算为 DPMSP 以 程序显示了一种标准的 DPMMS 程序为, 。