Global pooling is one of the most significant operations in many machine learning models and tasks, which works for information fusion and structured data (like sets and graphs) representation. However, without solid mathematical fundamentals, its practical implementations often depend on empirical mechanisms and thus lead to sub-optimal, even unsatisfactory performance. In this work, we develop a novel and generalized global pooling framework through the lens of optimal transport. The proposed framework is interpretable from the perspective of expectation-maximization. Essentially, it aims at learning an optimal transport across sample indices and feature dimensions, making the corresponding pooling operation maximize the conditional expectation of input data. We demonstrate that most existing pooling methods are equivalent to solving a regularized optimal transport (ROT) problem with different specializations, and more sophisticated pooling operations can be implemented by hierarchically solving multiple ROT problems. Making the parameters of the ROT problem learnable, we develop a family of regularized optimal transport pooling (ROTP) layers. We implement the ROTP layers as a new kind of deep implicit layer. Their model architectures correspond to different optimization algorithms. We test our ROTP layers in several representative set-level machine learning scenarios, including multi-instance learning (MIL), graph classification, graph set representation, and image classification. Experimental results show that applying our ROTP layers can reduce the difficulty of the design and selection of global pooling -- our ROTP layers may either imitate some existing global pooling methods or lead to some new pooling layers fitting data better. The code is available at \url{https://github.com/SDS-Lab/ROT-Pooling}.
翻译:全球集合是许多机器学习模式和任务中最重要的行动之一,它用于信息聚合和结构化数据(如数据集和图表)代表,然而,如果没有坚实的数学基础,其实际实施往往取决于经验机制,从而导致业绩不尽理想,甚至不能令人满意。在这项工作中,我们通过最佳运输的视角,开发了一个新颖和普遍的全球集合框架。从期望最大化的角度,可以解释拟议的框架。基本上,它旨在学习一种最佳运输方式,跨越抽样指数和特征层面,使相应的集合行动最大限度地达到投入数据的有条件期望。我们证明,大多数现有的集合方法都相当于解决不同专业的正规化最佳运输(ROT)问题,而更为复杂的集合行动则可以通过分级解决多种ROT问题来实施。我们从最佳运输问题的角度,我们开发了一套正规化的最佳运输集合(ROTP)层。我们把ROTP层次作为新的深度隐含层,它们的模型结构与不同的优化算法一致。我们用几个具有代表性的ROTP层次 -- 采用具有代表性的成熟的成熟的IM-TP水平的IMA标准, 并且用多层次的模型来显示我们现有图表结构结构的分类。