Speech emotion sensing in communication networks has a wide range of applications in real life. In these applications, voice data are transmitted from the user to the central server for storage, processing, and decision making. However, speech data contain vulnerable information that can be used maliciously without the user's consent by an eavesdropping adversary. In this work, we present a privacy-enhanced emotion communication system for preserving the user personal information in emotion-sensing applications. We propose the use of an adversarial learning framework that can be deployed at the edge to unlearn the users' private information in the speech representations. These privacy-enhanced representations can be transmitted to the central server for decision making. We evaluate the proposed model on multiple speech emotion datasets and show that the proposed model can hide users' specific demographic information and improve the robustness of emotion identification without significantly impacting performance. To the best of our knowledge, this is the first work on a privacy-preserving framework for emotion sensing in the communication network.
翻译:在通信网络中,语音情感感知在现实生活中有着广泛的应用。在这些应用中,语音数据从用户传送到中央服务器,供存储、处理和决策之用。然而,语音数据包含易感信息,未经用户同意,无需窃听对手同意,即可恶意使用。在这项工作中,我们提出了一个增强隐私的情感通信系统,用于在情感感应应用中保护用户个人信息。我们建议使用一个对抗性学习框架,在语言演示中将用户的私人信息解密。这些增强隐私的表达方式可以传送到中央服务器,供决策使用。我们评估了多语种情绪数据集的拟议模型,并表明拟议的模型可以隐藏用户的具体人口信息,提高情绪识别的稳健性,而不会对性能产生重大影响。据我们所知,这是在通信网络中为情感感知而建立隐私保护框架的首项工作。