Speech-centric machine learning systems have revolutionized many leading domains ranging from transportation and healthcare to education and defense, profoundly changing how people live, work, and interact with each other. However, recent studies have demonstrated that many speech-centric ML systems may need to be considered more trustworthy for broader deployment. Specifically, concerns over privacy breaches, discriminating performance, and vulnerability to adversarial attacks have all been discovered in ML research fields. In order to address the above challenges and risks, a significant number of efforts have been made to ensure these ML systems are trustworthy, especially private, safe, and fair. In this paper, we conduct the first comprehensive survey on speech-centric trustworthy ML topics related to privacy, safety, and fairness. In addition to serving as a summary report for the research community, we point out several promising future research directions to inspire the researchers who wish to explore further in this area.
翻译:语音中心的机器学习系统已经在许多领域领导,从交通和医疗保健到教育和国防,深刻改变了人们生活和互动方式。然而,最近的研究表明,许多语音中心的机器学习系统可能需要更加可靠才能进行更广泛的部署。具体而言,在ML研究领域发现了隐私泄露、歧视性能和容易受到对抗性攻击的问题。为了解决上述挑战和风险,已经进行了大量的工作,以确保这些ML系统是值得信赖的,特别是保持隐私、安全和公平性。在本文中,我们进行了第一次关于语音中心的值得信赖的机器学习主题的全面调查,这些主题涉及到隐私、安全和公平性。除了为研究社区提供摘要报告外,我们还指出了几个有前途的未来研究方向,以激励希望在这一领域继续探索的研究人员。