The movements of individuals within and among cities influence key aspects of our society, such as the objective and subjective well-being, the diffusion of innovations, the spreading of epidemics, and the quality of the environment. For this reason, there is increasing interest around the challenging problem of flow generation, which consists in generating the flows between a set of geographic locations, given the characteristics of the locations and without any information about the real flows. Existing solutions to flow generation are mainly based on mechanistic approaches, such as the gravity model and the radiation model, which suffer from underfitting and overdispersion, neglect important variables such as land use and the transportation network, and cannot describe non-linear relationships between these variables. In this paper, we propose the Multi-Feature Deep Gravity (MFDG) model as an effective solution to flow generation. On the one hand, the MFDG model exploits a large number of variables (e.g., characteristics of land use and the road network; transport, food, and health facilities) extracted from voluntary geographic information data (OpenStreetMap). On the other hand, our model exploits deep neural networks to describe complex non-linear relationships between those variables. Our experiments, conducted on commuting flows in England, show that the MFDG model achieves a significant increase in the performance (up to 250\% for highly populated areas) than mechanistic models that do not use deep neural networks, or that do not exploit geographic voluntary data. Our work presents a precise definition of the flow generation problem, which is a novel task for the deep learning community working with spatio-temporal data, and proposes a deep neural network model that significantly outperforms current state-of-the-art statistical models.
翻译:城市内部和城市之间的个人流动影响着我们社会的关键方面,如深度和主观福祉、创新的传播、流行病的传播、环境质量等,因此,人们对具有挑战性的流动生成问题越来越感兴趣,因为考虑到不同地点的特点,这种流动在一系列地理位置之间产生流动,而没有关于实际流动的任何信息。现有的流动生成解决方案主要基于机械化方法,如重力模型和辐射模型,这些模型因不完善和过度分散而受到损害,忽视了诸如土地使用和运输网络等重要变量,无法描述这些变量之间的非线性关系。在本文中,我们建议多功能深重力发电模式(MFDG)模式是流动的有效解决办法。一方面,MFDG模型利用大量变量(例如土地使用和道路网络的特点;运输、食品和卫生设施),这些变量来自自愿地理信息数据数据数据数据(Open Streport-reporterMex),这些模型利用了深度的当前网络来描述复杂的地理动态,而我们的数据(Orental-deal-al-modeal-mode) 数据(Oral-moal-modeal-modeal-modeal-modeal-modeal-modeal-mode-mode) romodeal-moud romode-mode-mocal-mocal-mocal-mocal-moudal-moudal-moud-moudal-moudal-mode-mode-mode-mode-modal-modal-mocal-momomodal-modisal-mocal-mocal-modemodismodisal-modismodismodismode-mode-mods-mode-mode-modal-modisal-mod-mod-mod-modal-mod-modal-modal-modal-modal-modal-modal-momodal-momod-mod-mod-mod-mod-mod-momod-momomomomomomomomomomomomomomomomod-mod-mod-mode-momomode-momod-mo