应用机器学习的子模块组合信息措施 (Submodular Combinatorial Information Measures with Applications in Machine Learning)

Information-theoretic quantities like entropy and mutual information have found numerous uses in machine learning. It is well known that there is a strong connection between these entropic quantities and submodularity since entropy over a set of random variables is submodular. In this paper, we study combinatorial information measures that generalize independence, (conditional) entropy, (conditional) mutual information, and total correlation defined over sets of (not necessarily random) variables. These measures strictly generalize the corresponding entropic measures since they are all parameterized via submodular functions that themselves strictly generalize entropy. Critically, we show that, unlike entropic mutual information in general, the submodular mutual information is actually submodular in one argument, holding the other fixed, for a large class of submodular functions whose third-order partial derivatives satisfy a non-negativity property. This turns out to include a number of practically useful cases such as the facility location and set-cover functions. We study specific instantiations of the submodular information measures on these, as well as the probabilistic coverage, graph-cut, and saturated coverage functions, and see that they all have mathematically intuitive and practically useful expressions. Regarding applications, we connect the maximization of submodular (conditional) mutual information to problems such as mutual-information-based, query-based, and privacy-preserving summarization -- and we connect optimizing the multi-set submodular mutual information to clustering and robust partitioning.

翻译：在机器学习中,人们都知道,这些微量和亚模式性之间有着紧密的联系,因为对于一组随机变数,这些微量和亚模式性是亚式的。在本文中,我们研究组合信息措施,以概括性、(有条件)增量、(有条件)互通性、(有条件)互通性等信息,以及一系列(不一定随机的)变数所定义的总相关性。这些措施严格概括对应的微量测量,因为它们都是通过子模块性功能来参数化的,而这种功能本身又严格地普遍地概括。关键地说,我们表明,与一般的共量性相互信息不同,子模块性互通信息实际上在一种参数中是子模块化的,而另一个则是固定的,用于一大批子模块性功能,其三阶部分衍生物部分衍生物满足非强化性属性。这包括一些实际有用的案例,如设施定位和设定覆盖功能等。我们研究这些子模块性信息测量措施的具体即时值,以及它们之间的精确性、多版本性数据覆盖和精确性亚化亚化亚化数据,这些功能的连接性、直径化、直径化、直径化、直径化、直径化、直径化的基化、直径化、直径化、直径化、直径化、直径化的亚化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直化、自、自、直化、直径化、直径化、直图和同步性、直径化的亚化、直径化、直径化、直径化、直径化、直图和同步的亚化的基的基的基的基的基的基的基的基的基的基的基的基的基的基的基的基的基的基的基、直对式、直系、直系、直径化、直系、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、直径化、