用于Manidew 学习的矩形流 (Rectangular Flows for Manifold Learning)

Normalizing flows are invertible neural networks with tractable change-of-volume terms, which allows optimization of their parameters to be efficiently performed via maximum likelihood. However, data of interest is typically assumed to live in some (often unknown) low-dimensional manifold embedded in high-dimensional ambient space. The result is a modelling mismatch since -- by construction -- the invertibility requirement implies high-dimensional support of the learned distribution. Injective flows, mapping from low- to high-dimensional space, aim to fix this discrepancy by learning distributions on manifolds, but the resulting volume-change term becomes more challenging to evaluate. Current approaches either avoid computing this term entirely using various heuristics, or assume the manifold is known beforehand and therefore are not widely applicable. Instead, we propose two methods to tractably calculate the gradient of this term with respect to the parameters of the model, relying on careful use of automatic differentiation and techniques from numerical linear algebra. Both approaches perform end-to-end nonlinear manifold learning and density estimation for data projected onto this manifold. We study the trade-offs between our proposed methods, empirically verify that we outperform approaches ignoring the volume-change term by more accurately learning manifolds and the corresponding distributions on them, and show promising results on out-of-distribution detection.

翻译：流的正常化是不可逆的神经网络,具有可移动的体积变化条件,这使得能够以最大的可能性有效地实现参数的优化。然而,人们通常认为,感兴趣的数据存在于高维环境空间内的一些(通常不为人知的)低维元体中,结果是建模不匹配,因为通过建筑,不可逆要求意味着对所学分布的高度支持。从低维空间到高维空间的预测,目的是通过在多元上学习分布来纠正这一差异,但由此产生的量变化术语更难评估。目前的方法要么是避免完全使用各种超常方法计算这一术语,要么是假设该元是事先已知的,因此不广泛适用。相反,我们建议了两种方法,可以很容易地计算该术语相对于模型参数的梯度,依靠谨慎地使用数字线性升格的自动区分和技术。两种方法都通过对预测到这一多元数据进行端到端的非线性多重学习和密度估计,目的是要对数据进行权衡。我们研究我们提出的方法之间的权衡,从实验性角度核查我们所走的公式,我们是如何超越了方向,以更准确的公式式的方法来显示,从而显示对量变化的结果。