Obtaining a dynamic population distribution is key to many decision-making processes such as urban planning, disaster management and most importantly helping the government to better allocate socio-technical supply. For the aspiration of these objectives, good population data is essential. The traditional method of collecting population data through the census is expensive and tedious. In recent years, machine learning methods have been developed to estimate the population distribution. Most of the methods use data sets that are either developed on a small scale or not publicly available yet. Thus, the development and evaluation of the new methods become challenging. We fill this gap by providing a comprehensive data set for population estimation in 98 European cities. The data set comprises digital elevation model, local climate zone, land use classifications, nighttime lights in combination with multi-spectral Sentinel-2 imagery, and data from the Open Street Map initiative. We anticipate that it would be a valuable addition to the research community for the development of sophisticated machine learning-based approaches in the field of population estimation.
翻译:人口动态分布是城市规划、灾害管理等许多决策进程的关键,最重要的是帮助政府更好地分配社会技术供应。为了实现这些目标,良好的人口数据至关重要。通过人口普查收集人口数据的传统方法既昂贵又乏味。近年来,已经开发了机器学习方法来估计人口分布。大多数方法使用小规模开发或尚未公开的数据集。因此,新方法的开发和评价变得具有挑战性。我们为98个欧洲城市的人口估计提供了一套综合数据,以弥补这一差距。数据集包括数字升降模型、地方气候区、土地使用分类、夜间光与多光谱Sentinel-2图像相结合,以及开放街道地图倡议的数据。我们预计,这对于研究界在人口估计领域开发精密的机器学习方法将是一个宝贵的补充。