Deep learning based fusion methods have been achieving promising performance in image fusion tasks. This is attributed to the network architecture that plays a very important role in the fusion process. However, in general, it is hard to specify a good fusion architecture, and consequently, the design of fusion networks is still a black art, rather than science. To address this problem, we formulate the fusion task mathematically, and establish a connection between its optimal solution and the network architecture that can implement it. This approach leads to a novel method proposed in the paper of constructing a lightweight fusion network. It avoids the time-consuming empirical network design by a trial-and-test strategy. In particular we adopt a learnable representation approach to the fusion task, in which the construction of the fusion network architecture is guided by the optimisation algorithm producing the learnable model. The low-rank representation (LRR) objective is the foundation of our learnable model. The matrix multiplications, which are at the heart of the solution are transformed into convolutional operations, and the iterative process of optimisation is replaced by a special feed-forward network. Based on this novel network architecture, an end-to-end lightweight fusion network is constructed to fuse infrared and visible light images. Its successful training is facilitated by a detail-to-semantic information loss function proposed to preserve the image details and to enhance the salient features of the source images. Our experiments show that the proposed fusion network exhibits better fusion performance than the state-of-the-art fusion methods on public datasets. Interestingly, our network requires a fewer training parameters than other existing methods.
翻译:深度学习融合方法在图像融合任务中取得了良好的性能,这是由于网络架构在融合过程中起着非常重要的作用。然而,通常很难指定一个好的融合架构,因此,融合网络的设计仍然是黑箱艺术而不是科学。为了解决这个问题,我们从数学上阐述了融合任务,并建立了其最优解与可以实现它的网络架构之间的关系。这种方法导致了一种新的方法,即通过优化算法产生可学习模型来指导融合任务的轻量级融合网络的构建。低秩表示(LRR)目标是我们可学习模型的基础。解决方案的矩阵乘法中心被转换为卷积操作,优化的迭代过程被一个特殊的前向网络取代。基于这种新颖的网络架构,构建了一个端到端的轻量级融合网络,用于融合红外和可见光图像。我们提出了一种细节到语义的信息损失函数,以保留图像细节和增强源图像的显著特征,从而促进了网络的成功训练。我们的实验表明,所提出的融合网络在公共数据集上的融合性能优于现有的融合方法。有趣的是,我们的网络需要比其他现有方法更少的训练参数。