Submodular function maximization is central in numerous data science applications, including data summarization, influence maximization, and recommendation. In many of these problems, our goal is to find a solution that maximizes the \emph{average} of the utilities for all users, each measured by a monotone submodular function. When the population of users is composed of several demographic groups, another critical problem is whether the utility is fairly distributed across groups. In the context of submodular optimization, we seek to improve the welfare of the \emph{least well-off} group, i.e., to maximize the minimum utility for any group, to ensure fairness. Although the \emph{utility} and \emph{fairness} objectives are both desirable, they might contradict each other, and, to our knowledge, little attention has been paid to optimizing them jointly. In this paper, we propose a novel problem called \emph{Bicriteria Submodular Maximization} (BSM) to strike a balance between utility and fairness. Specifically, it requires finding a fixed-size solution to maximize the utility function, subject to the value of the fairness function not being below a threshold. Since BSM is inapproximable within any constant factor in general, we propose efficient data-dependent approximation algorithms for BSM by converting it into other submodular optimization problems and utilizing existing algorithms for the converted problems to obtain solutions to BSM. Using real-world and synthetic datasets, we showcase applications of our framework in three submodular maximization problems, namely maximum coverage, influence maximization, and facility location.
翻译:子模块函数最大化是众多数据科学应用的核心, 包括数据总和、 影响最大化以及建议。 在很多这些问题中, 我们的目标是找到一个解决方案, 使所有用户的公用设施最大化, 每个用户都是以单调子模块函数衡量的。 当用户人口由几个人口组组成时, 另一个关键问题是该工具是否在各组间公平分配。 在亚模块优化的背景下, 我们寻求改善 \ emph{ 最小端端端点 组的福利, 也就是说, 最大限度地提高任何组的最小效用, 以确保公平性。 尽管 \ emph{ umplity} 和\ emph{ / 平均} 为所有用户找到一个解决方案, 使所有用户的公用设施最大化 \ emph{ / 平均端点 。 当用户的用户群群群群群群中, 我们提出一个新的问题, 叫做 emph{ 标准 子模块最大化, 以平衡 。 具体来说, 它需要找到一个固定的解决方案, 使任何组群群的最大用途解决方案, 确保最高值 的用户端端端值 的 将数据转换为 Blorimalalalal 值 值 。