高效第一阶预测器-更正器多目标优化,以公平识别错误信息 (Efficient first-order predictor-corrector multiple objective optimization for fair misinformation detection)

Multiple-objective optimization (MOO) aims to simultaneously optimize multiple conflicting objectives and has found important applications in machine learning, such as minimizing classification loss and discrepancy in treating different populations for fairness. At optimality, further optimizing one objective will necessarily harm at least another objective, and decision-makers need to comprehensively explore multiple optima (called Pareto front) to pinpoint one final solution. We address the efficiency of finding the Pareto front. First, finding the front from scratch using stochastic multi-gradient descent (SMGD) is expensive with large neural networks and datasets. We propose to explore the Pareto front as a manifold from a few initial optima, based on a predictor-corrector method. Second, for each exploration step, the predictor solves a large-scale linear system that scales quadratically in the number of model parameters and requires one backpropagation to evaluate a second-order Hessian-vector product per iteration of the solver. We propose a Gauss-Newton approximation that only scales linearly, and that requires only first-order inner-product per iteration. This also allows for a choice between the MINRES and conjugate gradient methods when approximately solving the linear system. The innovations make predictor-corrector possible for large networks. Experiments on multi-objective (fairness and accuracy) misinformation detection tasks show that 1) the predictor-corrector method can find Pareto fronts better than or similar to SMGD with less time; and 2) the proposed first-order method does not harm the quality of the Pareto front identified by the second-order method, while further reduce running time.

翻译：多重目标优化 (MOO) 旨在同时优化多重冲突目标, 并在机器学习中发现重要应用, 例如将分类损失和差异降低到最低程度, 以便公平对待不同人群。在最佳性方面, 进一步优化一个目标将必然至少伤害另一个目标, 决策者需要全面探索多个opima( 称为 Pareto Front) 来定位一个最终解决方案。我们处理找到 Pareto 前端的效率。首先, 使用随机多级下层( SMGD) 找到前端非常昂贵, 使用大型神经网络和数据集。我们提议将Pareto前端作为从几个初始选项中的一个元件进行探索, 以预测者- 校准方法为基础, 以预测者- 优化方法为基础, 预选第一阶前端前端的内产值, 且在前端的直径直径分析中只需要第一阶前端的内产值运行时间。此预测方法还可以选择大型直径直径网络的直径直度,, 并选择大型直径直径的直径直路路路路路。。此方法可以选择大型直径系统,,, 并选择大型直径直径对大型直径直。