Self-supervised representation learning techniques utilize large datasets without semantic annotations to learn meaningful, universal features that can be conveniently transferred to solve a wide variety of downstream supervised tasks. In this paper, we propose a self-supervised method for learning representations of geographic locations from unlabeled GPS trajectories to solve downstream geospatial computer vision tasks. Tiles resulting from a raster representation of the earth's surface are modeled as nodes on a graph or pixels of an image. GPS trajectories are modeled as allowed Markovian paths on these nodes. A scalable and distributed algorithm is presented to compute image-like representations, called reachability summaries, of the spatial connectivity patterns between tiles and their neighbors implied by the observed Markovian paths. A convolutional, contractive autoencoder is trained to learn compressed representations, called reachability embeddings, of reachability summaries for every tile. Reachability embeddings serve as task-agnostic, feature representations of geographic locations. Using reachability embeddings as pixel representations for five different downstream geospatial tasks, cast as supervised semantic segmentation problems, we quantitatively demonstrate that reachability embeddings are semantically meaningful representations and result in 4-23% gain in performance, while using upto 67% less trajectory data, as measured using area under the precision-recall curve (AUPRC) metric, when compared to baseline models that use pixel representations that do not account for the spatial connectivity between tiles. Reachability embeddings transform sequential, spatiotemporal mobility data into semantically meaningful image-like representations that can be combined with other sources of imagery and are designed to facilitate multimodal learning in geospatial computer vision.
翻译:自我监督的代表学习技巧使用没有语义说明的大型数据集来学习有意义的通用特征,这些特征可以方便地传输,以解决一系列下游监管任务。在本文中,我们建议一种自监督的方法,从未贴标签的全球定位系统轨迹中学习地理空间计算机下游视觉任务中各地理位置的示意图。地球表面的极光显示产生的示意图是模拟成图或图像像素上的节点。全球定位系统的轨迹可以建模,这些节点上允许的Markovian路径是有意义的通用特征。一个可缩放和分布的算法可以用来计算像图像一样的表达方式,称为可伸缩缩缩缩缩缩缩缩缩缩缩缩缩缩缩图,用来学习压缩表达,称为可伸缩缩缩缩缩缩缩缩缩缩图,作为每个图的缩缩缩缩缩缩缩缩缩缩缩缩缩图,用于在5个下直缩缩缩缩缩缩缩缩缩缩略图中进行缩缩略图,作为缩缩略图的缩图,作为缩缩略图,作为缩缩缩缩缩图,用于缩略图的缩缩缩缩缩缩缩缩略图。