剪切瓦西斯坦距离 (Augmented Sliced Wasserstein Distances)

While theoretically appealing, the application of the Wasserstein distance to large-scale machine learning problems has been hampered by its prohibitive computational cost. The sliced Wasserstein distance and its variants improve the computational efficiency through the random projection, yet they suffer from low accuracy if the number of projections is not sufficiently large, because the majority of projections result in trivially small values. In this work, we propose a new family of distance metrics, called augmented sliced Wasserstein distances (ASWDs), constructed by first mapping samples to higher-dimensional hypersurfaces parameterized by neural networks. It is derived from a key observation that (random) linear projections of samples residing on these hypersurfaces would translate to much more flexible nonlinear projections in the original sample space, so they can capture complex structures of the data distribution. We show that the hypersurfaces can be optimized by gradient ascent efficiently. We provide the condition under which the ASWD is a valid metric and show that this can be obtained by an injective neural network architecture. Numerical results demonstrate that the ASWD significantly outperforms other Wasserstein variants for both synthetic and real-world problems.

翻译：虽然在理论上具有吸引力,但瓦森斯坦距离对于大规模机器学习问题的应用却因其令人望而生畏的计算成本而受到了阻碍。切片瓦森斯坦距离及其变体通过随机预测提高了计算效率,但如果预测数量不够大,则其精确度较低,因为大多数预测都产生微不足道的数值。在这项工作中,我们提议建立一个新的距离度量组,称为扩大切片瓦森斯坦距离(ASWDs),由首次测绘样品到由神经网络参数测量的较高维度超表层。它源于一项关键观察,即对居住在这些超表层的样品的(随机)线性预测将转化为更灵活的原样空间的非线性预测,以便它们能够捕捉到数据分布的复杂结构。我们表明,高表层可以通过梯度有效优化。我们提供了一个条件,即ASWD是一种有效的衡量标准,并表明可以通过直射神经网络结构获得这一参数。数字结果表明,ASWD明显超出合成和实际世界问题的其他瓦列斯特斯坦变体。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【干货书】数据科学基础，429页pdf，Foundations of Data Science

专知会员服务

65+阅读 · 2021年8月11日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

专知会员服务

55+阅读 · 2020年12月15日