The recent emergence of new algorithms for permuting models into functionally equivalent regions of the solution space has shed some light on the complexity of error surfaces, and some promising properties like mode connectivity. However, finding the right permutation is challenging, and current optimization techniques are not differentiable, which makes it difficult to integrate into a gradient-based optimization, and often leads to sub-optimal solutions. In this paper, we propose a Sinkhorn re-basin network with the ability to obtain the transportation plan that better suits a given objective. Unlike the current state-of-art, our method is differentiable and, therefore, easy to adapt to any task within the deep learning domain. Furthermore, we show the advantage of our re-basin method by proposing a new cost function that allows performing incremental learning by exploiting the linear mode connectivity property. The benefit of our method is compared against similar approaches from the literature, under several conditions for both optimal transport finding and linear mode connectivity. The effectiveness of our continual learning method based on re-basin is also shown for several common benchmark datasets, providing experimental results that are competitive with state-of-art results from the literature.
翻译:最近出现了将模型转换成功能等同的解决方案空间区域的新算法,这在一定程度上揭示了错误表面的复杂性,以及一些有希望的特性,如模式连接。然而,找到正确的变异是富有挑战性的,目前的优化技术是无法区分的,这使得难以融入基于梯度的优化,并常常导致亚优的解决方案。在本文件中,我们建议建立一个Sinkhorn再定位网络,该网络有能力获得更适合特定目标的运输计划。与目前的先进技术不同,我们的方法是不同的,因此很容易适应深层学习领域的任何任务。此外,我们通过提出新的成本功能,表明我们的再定位方法的优势,通过利用线性模式连接属性进行递增学习。我们的方法的好处与文献中的类似方法相比,在最佳运输发现和线性模式连接的若干条件下,我们基于再定位的持续学习方法的有效性也体现在几个共同的基准数据集上,提供了实验结果,与文献中的状态结果具有竞争力。