We study the problem of estimating latent population flows from aggregated count data. This problem arises when individual trajectories are not available due to privacy issues or measurement fidelity. Instead, the aggregated observations are measured over discrete-time points, for estimating the population flows among states. Most related studies tackle the problems by learning the transition parameters of a time-homogeneous Markov process. Nonetheless, most real-world population flows can be influenced by various uncertainties such as traffic jam and weather conditions. Thus, in many cases, a time-homogeneous Markov model is a poor approximation of the much more complex population flows. To circumvent this difficulty, we resort to a multi-marginal optimal transport (MOT) formulation that can naturally represent aggregated observations with constrained marginals, and encode time-dependent transition matrices by the cost functions. In particular, we propose to estimate the transition flows from aggregated data by learning the cost functions of the MOT framework, which enables us to capture time-varying dynamic patterns. The experiments demonstrate the improved accuracy of the proposed algorithms than the related methods in estimating several real-world transition flows.
翻译:我们研究从综合计数数据中估计潜在人口流动的问题。当个人轨道由于隐私问题或测量忠诚性而无法利用时,就会产生这一问题。相反,为了估计国家间的人口流动,对综合观测进行分时间点测量,以估计国家间的人口流动。大多数相关研究通过学习时间-时间-均匀的Markov过程的过渡参数来解决问题。然而,大多数真实世界人口流动可能受到交通堵塞和天气条件等各种不确定性的影响。因此,在许多情况下,时间-均匀的Markov模型是复杂得多的人口流动的近似差。为避免这一困难,我们采用多边际最佳运输(MOT)公式,这可以自然地代表受限制边缘因素的汇总观测结果,并根据成本函数对取决于时间的过渡矩阵进行编码。我们特别建议,通过学习MOT框架的成本功能,从汇总数据中估算转型流,从而使我们能够捕捉时间变化的动态模式。实验表明,拟议的算法比估算几个真实世界转型流动的相关方法更加精确。