Deep learning has become a powerful tool for Mars exploration. Mars terrain segmentation is an important Martian vision task, which is the base of rover autonomous planning and safe driving. However, existing deep-learning-based terrain segmentation methods face two problems: one is the lack of sufficient detailed and high-confidence annotations, and the other is the over-reliance of models on annotated training data. In this paper, we address these two problems from the perspective of joint data and method design. We first present a new Mars terrain segmentation dataset which contains 6K high-resolution images and is sparsely annotated based on confidence, ensuring the high quality of labels. Then to learn from this sparse data, we propose a representation-learning-based framework for Mars terrain segmentation, including a self-supervised learning stage (for pre-training) and a semi-supervised learning stage (for fine-tuning). Specifically, for self-supervised learning, we design a multi-task mechanism based on the masked image modeling (MIM) concept to emphasize the texture information of images. For semi-supervised learning, since our dataset is sparsely annotated, we encourage the model to excavate the information of unlabeled area in each image by generating and utilizing pseudo-labels online. We name our dataset and method Self-Supervised and Semi-Supervised Segmentation for Mars (S$^{5}$Mars). Experimental results show that our method can outperform state-of-the-art approaches and improve terrain segmentation performance by a large margin.
翻译:深层学习已成为火星探索的强大工具。火星地形分割是一个重要的火星视野任务,它是漫游自主规划和安全驾驶的基础。然而,现有的深层学习地形分割方法面临两个问题:一个是缺乏足够详细和高度自信的描述,另一个是模型过分依赖附加说明的培训数据。在本文中,我们从联合数据和方法设计的角度处理这两个问题。我们首先展示一个新的火星地形分割数据集,其中包含6K高分辨率图像,并且基于信心,确保高质量标签质量,对此进行少许附加说明。然后从这一稀少的数据中学习,我们提议一个基于代表性学习的火星地形分割框架,包括一个自我监督的学习阶段(用于培训前)和一个半监督的学习阶段(用于微调)。具体地说,为了自我监督学习,我们设计了一个基于掩蔽图像模型模型(MIM)概念的多任务机制,以强调图像的文本信息。对于半监督的学习来说,由于我们的数据分类是粗略的,因此,我们用我们的数据标记的模型和正值区域来显示我们的一种内部的标签,我们用一个标记的方式,用我们的数据显示一个标记的自我标记的标签区域,用一个标记和正标的自我标记来显示我们的数据。