公平需要的只是你所需要的 (Distraction is All You Need for Fairness)

Bias in training datasets must be managed for various groups in classification tasks to ensure parity or equal treatment. With the recent growth in artificial intelligence models and their expanding role in automated decision-making, ensuring that these models are not biased is vital. There is an abundance of evidence suggesting that these models could contain or even amplify the bias present in the data on which they are trained, inherent to their objective function and learning algorithms; Many researchers direct their attention to this issue in different directions, namely, changing data to be statistically independent, adversarial training for restricting the capabilities of a particular competitor who aims to maximize parity, etc. These methods result in information loss and do not provide a suitable balance between accuracy and fairness or do not ensure limiting the biases in training. To this end, we propose a powerful strategy for training deep learning models called the Distraction module, which can be theoretically proven effective in controlling bias from affecting the classification results. This method can be utilized with different data types (e.g., Tabular, images, graphs, etc.). We demonstrate the potency of the proposed method by testing it on UCI Adult and Heritage Health datasets (tabular), POKEC-Z, POKEC-N and NBA datasets (graph), and CelebA dataset (vision). Using state-of-the-art methods proposed in the fairness literature for each dataset, we exhibit our model is superior to these proposed methods in minimizing bias and maintaining accuracy.

翻译：培训数据集中的偏见必须针对不同群体进行分类任务管理,以确保平等或平等待遇。随着最近人工智能模型的发展及其在自动化决策中日益扩大的作用,确保这些模型没有偏向性至关重要。有大量证据表明,这些模型可以包含甚至扩大所培训数据中存在的偏见,这是它们客观功能和学习算法所固有的; 许多研究人员将注意力转向不同方向,即改变数据,使其在统计上独立,进行对抗性培训,以限制特定竞争者的能力,而该竞争者的目标是最大限度地实现均等等。这些方法导致信息丢失,不能在准确性和公平性之间提供适当平衡,或者不能确保限制培训中的偏见。为此目的,我们提出一个强有力的培训深层次学习模型的策略,称为困症模块,从理论上可以有效地控制影响分类结果的偏差。这种方法可以用不同的数据类型来使用(例如模型、图像、图表等)。我们通过测试UCI的成人和遗产健康文献的准确性和公正性,而不是确保适当平衡。我们提出的方法,在每一个拟议的国家-健康数据库中使用了我们提议的高级数据集(Ably-deal-deal-degraphal- data-degraphal-de the the the the State data-degraphet the west data-magistration the data-magistration), 和Cal