具有理论保证的多目标进化等级的多样化 (Result Diversification by Multi-objective Evolutionary Algorithms with Theoretical Guarantees)

Given a ground set of items, the result diversification problem aims to select a subset with high "quality" and "diversity" while satisfying some constraints. It arises in various real-world artificial intelligence applications, such as web-based search, document summarization and feature selection, and also has applications in other areas, e.g., computational geometry, databases, finance and operations research. Previous algorithms are mainly based on greedy or local search. In this paper, we propose to reformulate the result diversification problem as a bi-objective maximization problem, and solve it by a multi-objective evolutionary algorithm (EA), i.e., the GSEMO. We theoretically prove that the GSEMO can achieve the (asymptotically) optimal theoretical guarantees under both static and dynamic environments. For cardinality constraints, the GSEMO can achieve the optimal polynomial-time approximation ratio, $1/2$. For more general matroid constraints, the GSEMO can achieve the asymptotically optimal polynomial-time approximation ratio, $1/2-\epsilon/(4n)$. Furthermore, when the objective function (i.e., a linear combination of quality and diversity) changes dynamically, the GSEMO can maintain this approximation ratio in polynomial running time, addressing the open question proposed by Borodin et al. This also theoretically shows the superiority of EAs over local search for solving dynamic optimization problems for the first time, and discloses the robustness of the mutation operator of EAs against dynamic changes. Experiments on the applications of web-based search, multi-label feature selection and document summarization show the superior performance of the GSEMO over the state-of-the-art algorithms (i.e., the greedy algorithm and local search) under both static and dynamic environments.

翻译：根据一组地面项目,结果多样化问题旨在选择一个“质量”和“多样性”高“质量”和“多样性”的子集,同时满足一些限制。它出现在各种现实世界人工智能应用中,例如基于网络的搜索、文档总和和和特征选择,并且还在其他领域也有应用,例如计算几何、数据库、财务和业务研究。以前的算法主要基于贪婪或本地搜索。在本文中,我们提议重新将结果多样化问题作为一个双目标最大化问题,并通过多目标进化算法(EA),即GSEMO。我们理论上证明,在静态和动态环境中,GSEMO可以实现(暂时)最佳的理论保障。对于基离强的运行率,GEMO(I) 和(i) 快速的运行率(IFIO ), 快速的运行质量, 和动态的运行率(IFIO), 运行的运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中, 运行中,运行中,运行中, 运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中。。。。。。。。。。。。。,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行中,运行