学习层次分组：一种提高分类公平性而不失准确性的多层次优化框架 (Learning by Grouping: A Multilevel Optimization Framework for Improving Fairness in Classification without Losing Accuracy)

The integration of machine learning models in various real-world applications is becoming more prevalent to assist humans in their daily decision-making tasks as a result of recent advancements in this field. However, it has been discovered that there is a tradeoff between the accuracy and fairness of these decision-making tasks. In some cases, these AI systems can be unfair by exhibiting bias or discrimination against certain social groups, which can have severe consequences in real life. Inspired by one of the most well-known human learning skills called grouping, we address this issue by proposing a novel machine learning framework where the ML model learns to group a diverse set of problems into distinct subgroups to solve each subgroup using its specific sub-model. Our proposed framework involves three stages of learning, which are formulated as a three-level optimization problem: (i) learning to group problems into different subgroups; (ii) learning group-specific sub-models for problem-solving; and (iii) updating group assignments of training examples by minimizing the validation loss. These three learning stages are performed end-to-end in a joint manner using gradient descent. To improve fairness and accuracy, we develop an efficient optimization algorithm to solve this three-level optimization problem. To further reduce the risk of overfitting in small datasets, we incorporate domain adaptation techniques in the second stage of training. We further apply our method to neural architecture search. Extensive experiments on various datasets demonstrate our method's effectiveness and performance improvements in both fairness and accuracy. Our proposed Learning by Grouping can reduce overfitting and achieve state-of-the-art performances with fixed human-designed network architectures and searchable network architectures on various datasets.

翻译：近年来，随着人工智能领域的不断发展，机器学习模型在各种实际应用中的融入越来越普遍，以协助人类进行日常决策任务。然而，分类准确性和公平性之间存在权衡取舍。在某些情况下，这些AI系统可能是不公平但又具有偏见或歧视，这可能会在现实生活中产生严重后果。受人类最为著名的学习技能——分组的启示，我们提出了一种新颖的机器学习框架，在该框架中，ML模型学习将多样化的问题分为不同的子组，使用特定的子模型解决每个子组的问题。我们的提议涉及三个学习阶段，被公式化为三层优化问题：（i）学习将问题分组为不同的子组；（ii）学习针对问题解决的特定子模型;和（iii）通过最小化验证损失更新训练示例的群组分配。这三个学习阶段通过梯度下降以联合方式端对端执行。为了提高公平性和准确性，我们开发了一种有效的优化算法来解决这三层优化问题。在第二阶段的培训中，引入了领域自适应技术，以进一步降低小型数据集过拟合风险。我们进一步应用了这种方法进行神经架构搜索。对各种数据集进行的广泛实验证明了我们方法的有效性和性能改进，既能在公平性上取得进一步进步，又能在准确性上取得state-of-the-art的性能表现。我们提出的层级分组学习可以在各种数据集上减少过拟合并采用定制的人类设计网络体系结构和可搜索的网络体系结构达到最先进的性能表现。