The exploitation of visible spectrum datasets has led deep networks to show remarkable success. However, real-world tasks include low-lighting conditions which arise performance bottlenecks for models trained on large-scale RGB image datasets. Thermal IR cameras are more robust against such conditions. Therefore, the usage of thermal imagery in real-world applications can be useful. Unsupervised domain adaptation (UDA) allows transferring information from a source domain to a fully unlabeled target domain. Despite substantial improvements in UDA, the performance gap between UDA and its supervised learning counterpart remains significant. By picking a small number of target samples to annotate and using them in training, active domain adaptation tries to mitigate this gap with minimum annotation expense. We propose an active domain adaptation method in order to examine the efficiency of combining the visible spectrum and thermal imagery modalities. When the domain gap is considerably large as in the visible-to-thermal task, we may conclude that the methods without explicit domain alignment cannot achieve their full potential. To this end, we propose a spectral transfer guided active domain adaptation method to select the most informative unlabeled target samples while aligning source and target domains. We used the large-scale visible spectrum dataset MS-COCO as the source domain and the thermal dataset FLIR ADAS as the target domain to present the results of our method. Extensive experimental evaluation demonstrates that our proposed method outperforms the state-of-the-art active domain adaptation methods. The code and models are publicly available.
翻译:摘要:可见光谱数据集的开发已经导致深度网络表现出了显著的成功。然而,真实世界的任务包括低光照环境,这会对在大规模RGB图像数据集上训练的模型产生性能瓶颈。热红外相机对这种情况更加稳健,因此在真实应用中使用热成像技术可以非常有用。无监督领域自适应(UDA)允许将信息从源领域转移到完全未标记的目标领域。尽管在UDA方面取得了实质性的改进,但是UDA与其监督学习的对应物之间的性能差距仍然很大。通过选择少量的目标样本进行注释并在训练中使用它们,主动领域自适应试图以最小的注释费用解决这个问题。我们提出了一种主动领域自适应方法,以检验组合可见光谱和热成像模态的效率。当领域差距非常大,如可见光到热成像任务时,我们可以得出结论,没有明确的领域对齐方法无法发挥其全部潜力。为此,我们提出了一种基于频谱转换引导的主动领域自适应方法,在对齐源领域和目标领域的同时选择最具信息量的未标记目标样本。我们将大规模的可见光数据集MS-COCO作为源域,将热红外数据集FLIR ADAS作为目标域来展示我们方法的结果。广泛的实验评估表明,我们提出的方法优于现有的主动领域自适应方法。代码和模型已公开提供。