整个城市:拥有多层次和多视角图像的全能城市理解 (OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images)

This paper presents OmniCity, a new dataset for omnipotent city understanding from multi-level and multi-view images. More precisely, the OmniCity contains multi-view satellite images as well as street-level panorama and mono-view images, constituting over 100K pixel-wise annotated images that are well-aligned and collected from 25K geo-locations in New York City. To alleviate the substantial pixel-wise annotation efforts, we propose an efficient street-view image annotation pipeline that leverages the existing label maps of satellite view and the transformation relations between different views (satellite, panorama, and mono-view). With the new OmniCity dataset, we provide benchmarks for a variety of tasks including building footprint extraction, height estimation, and building plane/instance/fine-grained segmentation. We also analyze the impact of view on each task, the performance of different models, limitations of existing methods, etc. Compared with the existing multi-level and multi-view benchmarks, our OmniCity contains a larger number of images with richer annotation types and more views, provides more baseline results obtained from state-of-the-art models, and introduces a novel task for fine-grained building instance segmentation on street-level panorama images. Moreover, OmniCity provides new problem settings for existing tasks, such as cross-view image matching, synthesis, segmentation, detection, etc., and facilitates the developing of new methods for large-scale city understanding, reconstruction, and simulation. The OmniCity dataset as well as the benchmarks will be available at https://city-super.github.io/omnicity.

翻译：本文展示了OmniCity, 这是一种从多层次和多视图图像中获取全能城市理解的新数据集。更准确地说, OmniCity包含多视角卫星图像以及街头全景和单视图图像,构成100K像el-wi的附加说明图像,这些图像与纽约市25K地理位置相匹配并收集。为了减轻大量的像素-智慧说明努力,我们建议建立一个高效的街景图像注释管道,利用卫星视图和不同观点(卫星、全景和单视图)之间变异的标签地图(卫星、全景和单视图)的高级标签图象。有了新的OmniCity数据集,我们为各种任务提供了基准,包括建立足迹提取、高度估计和建平/ Instances/fine-graide 分割。我们还分析了对每项任务的影响、不同模型的性能、现有方法的局限性等等。与现有的多层次和多视角基准相比, 我们的OmniCity包含更多图像数量, 以及更富有的剖面图解的图像, 以及更先进的图像类型和更新版本任务。