Current artificial neural networks mainly conduct the learning process in the spatial domain but neglect the frequency domain learning. However, the learning course performed in the frequency domain can be more efficient than that in the spatial domain. In this paper, we fully explore frequency domain learning and propose a joint learning paradigm of frequency and spatial domains. This paradigm can take full advantage of the preponderances of frequency learning and spatial learning; specifically, frequency and spatial domain learning can effectively capture global and local information, respectively. Exhaustive experiments on two dense prediction tasks, i.e., self-supervised depth estimation and semantic segmentation, demonstrate that the proposed joint learning paradigm can 1) achieve performance competitive to those of state-of-the-art methods in both depth estimation and semantic segmentation tasks, even without pretraining; and 2) significantly reduce the number of parameters compared to other state-of-the-art methods, which provides more chance to develop real-world applications. We hope that the proposed method can encourage more research in cross-domain learning.
翻译:目前人工神经网络主要在空间领域开展学习过程,但忽略了频域学习。然而,在频率领域开展的学习课程可能比空间领域的效率更高。在本文件中,我们充分探索频域学习,并提议一个频率和空间领域的联合学习模式。这一模式可以充分利用频率学习和空间学习的优势;具体地说,频率和空间领域学习可以分别有效地捕捉全球和地方信息。关于两种密集的预测任务,即自我监督的深度估计和语义分化的抽查实验表明,拟议的联合学习模式可以(1) 在深度估计和语义分解任务方面,即使没有培训前,都能实现与最先进的方法的竞争性;(2) 与其他最先进的方法相比,参数数量大为减少,这为开发现实世界应用提供了更多的机会。我们希望,拟议的方法可以鼓励在跨领域学习方面进行更多的研究。