With advances seen in deep learning, voice-based applications are burgeoning, ranging from personal assistants, affective computing, to remote disease diagnostics. As the voice contains both linguistic and paralinguistic information (e.g., vocal pitch, intonation, speech rate, loudness), there is growing interest in voice anonymization to preserve speaker privacy and identity. Voice privacy challenges have emerged over the last few years and focus has been placed on removing speaker identity while keeping linguistic content intact. For affective computing and disease monitoring applications, however, the paralinguistic content may be more critical. Unfortunately, the effects that anonymization may have on these systems are still largely unknown. In this paper, we fill this gap and focus on one particular health monitoring application: speech-based COVID-19 diagnosis. We test two popular anonymization methods and their impact on five different state-of-the-art COVID-19 diagnostic systems using three public datasets. We validate the effectiveness of the anonymization methods, compare their computational complexity, and quantify the impact across different testing scenarios for both within- and across-dataset conditions. Lastly, we show the benefits of anonymization as a data augmentation tool to help recover some of the COVID-19 diagnostic accuracy loss seen with anonymized data.
翻译:随着深度学习技术的不断进步,基于语音的应用正在蓬勃发展,包括个人助手、情感计算以及远程疾病诊断等。由于声音包含语言和语调等抑扬顿挫的信息,因此匿名化技术逐渐引起了人们的关注,以维护讲话者的隐私和身份。然而,对于情感计算和疾病监测等应用程序而言,语调等信息可能更为关键。不幸的是,目前匿名化技术对这些系统的影响仍然不明确。本文针对其中一个特殊的健康监测应用程序:基于语音的COVID-19诊断,测试了两种流行的匿名化方法及其对五种不同的COVID-19诊断系统,使用三个公共数据集,并验证了匿名化方法的有效性,比较了它们的计算复杂性,并在不同的测试场景下量化了匿名化对系统的影响。最后,我们展示了匿名化作为数据增强工具的优势,以帮助恢复部分受匿名化数据影响的COVID-19诊断准确性丢失。