在可信赖的机器学习中的职能构成:实施选择、深入观察和问题 (Function Composition in Trustworthy Machine Learning: Implementation Choices, Insights, and Questions)

Ensuring trustworthiness in machine learning (ML) models is a multi-dimensional task. In addition to the traditional notion of predictive performance, other notions such as privacy, fairness, robustness to distribution shift, adversarial robustness, interpretability, explainability, and uncertainty quantification are important considerations to evaluate and improve (if deficient). However, these sub-disciplines or 'pillars' of trustworthiness have largely developed independently, which has limited us from understanding their interactions in real-world ML pipelines. In this paper, focusing specifically on compositions of functions arising from the different pillars, we aim to reduce this gap, develop new insights for trustworthy ML, and answer questions such as the following. Does the composition of multiple fairness interventions result in a fairer model compared to a single intervention? How do bias mitigation algorithms for fairness affect local post-hoc explanations? Does a defense algorithm for untargeted adversarial attacks continue to be effective when composed with a privacy transformation? Toward this end, we report initial empirical results and new insights from 9 different compositions of functions (or pipelines) on 7 real-world datasets along two trustworthy dimensions - fairness and explainability. We also report progress, and implementation choices, on an extensible composer tool to encourage the combination of functionalities from multiple pillars. To-date, the tool supports bias mitigation algorithms for fairness and post-hoc explainability methods. We hope this line of work encourages the thoughtful consideration of multiple pillars when attempting to formulate and resolve a trustworthiness problem.

翻译：确保机器学习(ML)模式的可信度是一项多层面的任务。除了预测性业绩的传统概念外,其他概念,如隐私、公平、可靠和分配转换、对抗性强、可解释性、可解释性和不确定性量化等,也是评估和改进(如果不足的话)的重要考虑因素。然而,这些次级纪律或“支柱”的可信度在很大程度上是独立发展起来的,这限制了我们理解其在真实世界ML管道中的互动。在本文件中,我们特别侧重于不同支柱产生的职能构成,我们的目标是缩小这一差距,为可靠的ML开发新的洞察力,并回答以下问题。多重公平干预的构成是否导致一种较公平的模式?公平性方面的减少偏差算法如何影响当地后的解释?在进行隐私改革时,这些非有针对性的对抗性攻击的防御算法是否继续有效?为此目的,我们报告从9种不同职能构成(或管道)的初步经验结果和新的洞察力,以及7种真实世界数据设置在两个可信赖的多维维度层面――公平和解释性方面,我们的报告,还支持从可比较性、可理解性改革后的工具中推算。