Extracting relevant urban patterns from multiple data sources can be difficult using classical clustering algorithms since we have to make a suitable setup of the hyperparameters of the algorithms and deal with outliers. It should be addressed correctly to help urban planners in the decision-making process for the further development of a big city. For instance, experts' main interest in criminology is comprehending the relationship between crimes and the socio-economic characteristics at specific georeferenced locations. In addition, the classical clustering algorithms take little notice of the intricate spatial correlations in georeferenced data sources. This paper presents a new approach to detecting the most relevant urban patterns from multiple data sources based on tensor decomposition. Compared to classical methods, the proposed approach's performance is attested to validate the identified patterns' quality. The result indicates that the approach can effectively identify functional patterns to characterize the data set for further analysis in achieving good clustering quality. Furthermore, we developed a generic framework named TensorAnalyzer, where the effectiveness and usefulness of the proposed methodology are tested by a set of experiments and a real-world case study showing the relationship between the crime events around schools and students performance and other variables involved in the analysis.
翻译:从多个数据来源提取相关的城市模式可能很难使用典型的组合算法,因为我们必须对算法的超参数进行适当的设置,并处理外部线,应当正确地加以处理,以帮助城市规划者为进一步发展大城市的决策进程提供帮助,例如,专家对犯罪学的主要兴趣在于了解犯罪与具体地理参照地点的社会经济特征之间的关系;此外,典型组合算法很少注意到地理参照数据来源中复杂的空间相关性。本文介绍了一种新办法,从基于高温分解的多种数据来源中探测最相关的城市模式。与典型方法相比,拟议方法的绩效被证明验证了所确定的模式的质量。结果显示,该方法能够有效地确定功能模式,用以确定数据集的特点,以便进一步分析良好的组合质量。此外,我们开发了一个名为TensorAnalyzer的通用框架,其中对拟议方法的有效性和实用性进行了一系列实验和真实世界案例研究,展示了学校周围犯罪事件与学生绩效和其他变量在分析中的关系。