We study the problem of estimating potential revenue or demand at business facilities and understanding its generating mechanism. This problem arises in different fields such as operation research or urban science, and more generally, it is crucial for businesses' planning and decision making. We develop a Bayesian spatial interaction model, henceforth BSIM, which provides probabilistic predictions about revenues generated by a particular business location provided their features and the potential customers' characteristics in a given region. BSIM explicitly accounts for the competition among the competitive facilities through a probability value determined by evaluating a store-specific Gaussian distribution at a given customer location. We propose a scalable variational inference framework that, while being significantly faster than competing Markov Chain Monte Carlo inference schemes, exhibits comparable performances in terms of parameters identification and uncertainty quantification. We demonstrate the benefits of BSIM in various synthetic settings characterised by an increasing number of stores and customers. Finally, we construct a real-world, large spatial dataset for pub activities in London, UK, which includes over 1,500 pubs and 150,000 customer regions. We demonstrate how BSIM outperforms competing approaches on this large dataset in terms of prediction performances while providing results that are both interpretable and consistent with related indicators observed for the London region.
翻译:我们研究估计商业设施潜在收入或需求的问题,并了解其产生机制。这个问题出现在经营研究或城市科学等不同领域,更一般地说,对于企业的规划和决策至关重要。我们开发了一个贝叶斯空间互动模型,从BSIM开始,该模型根据特定商业地点的特点和特定区域潜在客户的特点,对特定商业设施产生的收入作出概率预测。BSIM明确说明竞争设施之间的竞争,其概率值是通过评价特定商店特定高斯公司在特定客户地点的分销量来确定的。我们提出了一个可扩展的可变推论框架,这一框架虽然大大快于相互竞争的Markov链蒙特卡洛推论计划,但在参数确定和不确定性量化方面表现出可比的绩效。我们展示了BSIM在不同合成环境中的好处,其特点是商店和客户数量不断增加。最后,我们构建了一个真实世界,大型空间数据集用于英国伦敦的酒吧活动,其中包括1,500多家酒吧和150 000个客户区域。我们展示了BSIM如何在这种大型数据配置方面超越了与所观察到的伦敦相关指标相一致的做法。我们展示的是,同时提供一致的预测结果。