Analysis of overhead imagery using computer vision is a problem that has received considerable attention in academic literature. Most techniques that operate in this space are both highly specialised and require expensive manual annotation of large datasets. These problems are addressed here through the development of a more generic framework, incorporating advances in representation learning which allows for more flexibility in analysing new categories of imagery with limited labeled data. First, a robust representation of an unlabeled aerial imagery dataset was created based on the momentum contrast mechanism. This was subsequently specialised for different tasks by building accurate classifiers with as few as 200 labeled images. The successful low-level detection of urban infrastructure evolution over a 10-year period from 60 million unlabeled images, exemplifies the substantial potential of our approach to advance quantitative urban research.
翻译:利用计算机图像对高空图像进行分析是一个在学术文献中受到相当重视的问题,在这一空间运作的多数技术都是高度专业化的,需要花费昂贵的人工说明大型数据集,这些问题在这里通过制定更通用的框架加以解决,其中纳入代表性学习的进展,从而在分析带有有限标签数据的新类型的图像时能够有更大的灵活性。首先,根据势头对比机制创建了未贴标签的航空图像数据集的有力代表性,随后通过建立只有200个标签图像的准确分类器,专门处理不同的任务。 成功地从6 000万个无标签图像中对城市基础设施10年期的演变进行了低水平的检测,体现了我们推进数量城市研究的方法的巨大潜力。