The performance of machine learning algorithms can be considerably improved when trained over larger datasets. In many domains, such as medicine and finance, larger datasets can be obtained if several parties, each having access to limited amounts of data, collaborate and share their data. However, such data sharing introduces significant privacy challenges. While multiple recent studies have investigated methods for private collaborative machine learning, the fairness of such collaborative algorithms was overlooked. In this work we suggest a feasible privacy-preserving pre-process mechanism for enhancing fairness of collaborative machine learning algorithms. Our experimentation with the proposed method shows that it is able to enhance fairness considerably with only a minor compromise in accuracy.
翻译:机器学习算法的性能如果经过关于较大数据集的培训,就可以大大改进。在许多领域,如医学和金融领域,如果几个方面,每个方面都能够获得数量有限的数据、合作和分享数据,就可以获得更大的数据集。然而,这种数据共享带来了重大的隐私挑战。虽然最近多项研究调查了私人合作机器学习的方法,但这种合作算法的公正性被忽视了。在这项工作中,我们建议建立一个可行的隐私保护预处理机制,以增强协作机器学习算法的公平性。我们对拟议方法的实验表明,它能够大大加强公平性,但准确性只有很小的妥协。