Speech-centric machine learning systems have revolutionized many leading domains ranging from transportation and healthcare to education and defense, profoundly changing how people live, work, and interact with each other. However, recent studies have demonstrated that many speech-centric ML systems may need to be considered more trustworthy for broader deployment. Specifically, concerns over privacy breaches, discriminating performance, and vulnerability to adversarial attacks have all been discovered in ML research fields. In order to address the above challenges and risks, a significant number of efforts have been made to ensure these ML systems are trustworthy, especially private, safe, and fair. In this paper, we conduct the first comprehensive survey on speech-centric trustworthy ML topics related to privacy, safety, and fairness. In addition to serving as a summary report for the research community, we point out several promising future research directions to inspire the researchers who wish to explore further in this area.
翻译:以言语为中心的机器学习系统使许多主要领域发生了革命性的变化,从交通和保健到教育和防御,深刻地改变了人们的生活、工作和互动方式。然而,最近的研究表明,许多以言语为中心的ML系统可能需要被认为更值得信赖,以便更广泛地部署。具体地说,在ML研究领域都发现了对隐私侵犯、歧视性表现和易受对抗性攻击的担忧。为了应对上述挑战和风险,为确保这些ML系统值得信赖,特别是私人、安全和公平,作出了大量努力。我们在本文件中对与隐私、安全和公平有关的以言语为中心的值得信赖的ML专题进行了第一次全面调查。除了作为研究界的简要报告外,我们还指出了若干有希望的未来研究方向,以激励希望在这一领域进一步探索的研究人员。