In recent years, we have seen a handful of work on inference algorithms over non-stationary data streams. Given their flexibility, Bayesian non-parametric models are a good candidate for these scenarios. However, reliable streaming inference under the concept drift phenomenon is still an open problem for these models. In this work, we propose a variational inference algorithm for Dirichlet process mixture models. Our proposal deals with the concept drift by including an exponential forgetting over the prior global parameters. Our algorithm allows to adapt the learned model to the concept drifts automatically. We perform experiments in both synthetic and real data, showing that the proposed model is competitive with the state-of-the-art algorithms in the density estimation problem, and it outperforms them in the clustering problem.
翻译:近年来,我们看到了关于非静止数据流的推算法的少数工作。 由于其灵活性, 巴耶斯非参数模型是这些假设的好选择。 但是, 概念漂移现象下的可靠的流推论对于这些模型来说仍然是一个尚未解决的问题。 在这项工作中, 我们提议对Drichlet 工艺混合物模型采用变式推论法。 我们的提案涉及概念漂移, 将指数式的遗忘纳入先前的全球参数。 我们的算法允许自动调整所学模型以适应概念的漂移。 我们在合成和真实数据中进行实验, 显示拟议的模型与密度估计问题中的最新算法相比具有竞争力, 并在集群问题中优于这些算法。