Using our voices to access, and interact with, online services raises concerns about the trade-offs between convenience, privacy, and security. The conflict between maintaining privacy and ensuring input authenticity has often been hindered by the need to share raw data, which contains all the paralinguistic information required to infer a variety of sensitive characteristics. Users of voice assistants put their trust in service providers; however, this trust is potentially misplaced considering the emergence of first-party 'honest-but-curious' or 'semi-honest' threats. A further security risk is presented by imposters gaining access to systems by pretending to be the user leveraging replay or 'deepfake' attacks. Our objective is to design and develop a new voice input-based system that offers the following specifications: local authentication to reduce the need for sharing raw voice data, local privacy preservation based on user preferences, allowing more flexibility in integrating such a system given target applications privacy constraints, and achieving good performance in these targeted applications. The key idea is to locally derive token-based credentials based on unique-identifying attributes obtained from the user's voice and offer selective sensitive information filtering before transmitting raw data. Our system consists of (i) 'VoiceID', boosted with a liveness detection technology to thwart replay attacks; (ii) a flexible privacy filter that allows users to select the level of privacy protection they prefer for their data. The system yields 98.68% accuracy in verifying legitimate users with cross-validation and runs in tens of milliseconds on a CPU and single-core ARM processor without specialized hardware. Our system demonstrates the feasibility of filtering raw voice input closer to users, in accordance with their privacy preferences, while maintaining their authenticity.
翻译:使用我们的声音访问在线服务,并与之互动,这引起了人们对方便、隐私和安全之间的权衡问题的关切。维护隐私和确保投入真实性之间的冲突往往由于需要共享原始数据而受阻。原始数据包含为推断各种敏感特征所需的所有语言信息。语音助理用户信任服务提供商;然而,考虑到第一当事方的“诚实但可靠”或“半诚实”威胁的出现,这种信任可能错失。假冒者通过假冒用户利用重播或“深藏”攻击来获取系统,从而带来进一步的安全风险。我们的目标是设计和开发一个新的语音输入系统,提供以下规格:当地认证,以减少共享原始语音数据的需求,根据用户的偏好度保护当地隐私;根据用户的“诚实但有欺骗性”或“半诚实”威胁。关键思想是,根据用户声音的独特识别特征,在当地获取基于象征性的认证信息,并在传输准确度前提供有选择性的敏感信息过滤系统,以更精确的准确度为我们用户提供更精确的数据检测。我们系统的系统在进行更精确的测试之前,通过更精确的系统进行更精确的测试,从而进行更灵活地评估。</s>