蝴蝶花:用蝴蝶矩阵建造可垂直图层 (ButterflyFlow: Building Invertible Layers with Butterfly Matrices)

Normalizing flows model complex probability distributions using maps obtained by composing invertible layers. Special linear layers such as masked and 1x1 convolutions play a key role in existing architectures because they increase expressive power while having tractable Jacobians and inverses. We propose a new family of invertible linear layers based on butterfly layers, which are known to theoretically capture complex linear structures including permutations and periodicity, yet can be inverted efficiently. This representational power is a key advantage of our approach, as such structures are common in many real-world datasets. Based on our invertible butterfly layers, we construct a new class of normalizing flow models called ButterflyFlow. Empirically, we demonstrate that ButterflyFlows not only achieve strong density estimation results on natural images such as MNIST, CIFAR-10, and ImageNet 32x32, but also obtain significantly better log-likelihoods on structured datasets such as galaxy images and MIMIC-III patient cohorts -- all while being more efficient in terms of memory and computation than relevant baselines.

翻译：利用由不可逆层组成的地图实现流流模型复杂概率分布的正常化。特殊线性层,例如蒙面和1x1卷变,在现有结构中发挥着关键作用,因为它们在增加显像力的同时,又具有可移动的雅各和反射力。我们建议以蝴蝶层为基础,形成一个新的不可逆线性层,这些层在理论上可以捕捉复杂的线性结构,包括变异和周期性,但也可以有效地反转。这种代表力是我们方法的主要优势,因为这种结构在许多真实世界的数据集中是常见的。根据我们不可逆的蝴蝶层,我们建造了一个新的正常流模型类别,叫做“蝴蝶花花”。我们巧妙地证明,蝴蝶Flows不仅在诸如MMIST、CIFAR-10和图像网络32x32等自然图像上取得了强烈的密度估计结果,而且还在诸如银河系图像和MIMIC-III病人群等结构数据集上获得了显著更好的日比值。在记忆和计算方面比相关基线更有效率。