Recent research suggests that predictions made by machine-learning models can amplify biases present in the training data. When a model amplifies bias, it makes certain predictions at a higher rate for some groups than expected based on training-data statistics. Mitigating such bias amplification requires a deep understanding of the mechanics in modern machine learning that give rise to that amplification. We perform the first systematic, controlled study into when and how bias amplification occurs. To enable this study, we design a simple image-classification problem in which we can tightly control (synthetic) biases. Our study of this problem reveals that the strength of bias amplification is correlated to measures such as model accuracy, model capacity, model overconfidence, and amount of training data. We also find that bias amplification can vary greatly during training. Finally, we find that bias amplification may depend on the difficulty of the classification task relative to the difficulty of recognizing group membership: bias amplification appears to occur primarily when it is easier to recognize group membership than class membership. Our results suggest best practices for training machine-learning models that we hope will help pave the way for the development of better mitigation strategies. Code can be found at https://github.com/facebookresearch/cv_bias_amplification.
翻译:最近的研究表明,机器学习模型所作的预测可以扩大培训数据中存在的偏见。当模型放大偏差时,它会根据培训数据统计数据对某些群体作出比预期的要高的预测。缩小这种偏差放大需要深入了解现代机器学习中导致这种放大的机械学力。我们首先对何时和如何发生偏差放大进行系统、受控制的研究。为了能够进行这项研究,我们设计了一个简单的图像分类问题,我们可以在其中严格控制(合成)偏见。我们对这个问题的研究显示,偏差放大的强度与诸如模型精确度、模型能力、模型过度自信和数量培训数据等衡量标准相关。我们还发现,偏差放大在培训期间可能有很大差异。最后,我们发现偏差放大可能取决于分类任务相对于承认群体成员资格的困难:偏差放大似乎主要发生在我们比较容易地承认群体成员而不是阶级成员的时候。我们发现在培训机器学习模型方面的最佳做法,我们希望这将有助于为改进的缓解战略的发展铺平道路。 httpsco 将找到最佳做法。