推出一个实验性扭曲-容忍性语音加密计划,用于安全语音通信 (Introducing an experimental distortion-tolerant speech encryption scheme for secure voice communication)

The current increasing need for privacy-preserving voice communications is leading to new ideas for securing voice transmission. This paper refers to a relatively new concept of sending encrypted speech as pseudo-speech in the audio domain over digital voice communication infrastructures, like 3G cellular network and VoIP. This work presents a novel distortion-tolerant speech encryption scheme for secure voice communications over voice channels that combines the robustness of analog speech scrambling and elevated security offered by digital ciphers like AES-CTR. The system scrambles vocal parameters of a speech signal (loudness, pitch, timbre) using distance-preserving pseudo-random translations and rotations on a hypersphere of parameters. Next, scrambled parameters are encoded to a pseudo-speech signal adapted to transmission over digital voice channels equipped with voice activity detection. Upon reception of this pseudo-speech signal, the legitimate receiver restores distorted copies of the initial vocal parameters. Despite some deciphering errors, an integrated neural-based vocoder based on the LPCNet architecture reconstructs an intelligible speech. The experimental implementation of this speech encryption scheme has been tested by simulations and sending an encrypted signal over FaceTime between two iPhones 6 connected to the same WiFi network. Moreover, speech excerpts restored from encrypted signals were evaluated by a speech quality assessment on a group of about 40 participants. The experiments demonstrated that the proposed scheme produces intelligible speech with a gracefully progressive quality degradation depending on the channel noise. Finally, the preliminary computational analysis suggested that the presented setting may operate on high-end portable devices in nearly real-time.

翻译：目前对隐私保护语音通信的需求日益增加,这正在导致确保语音传输的新想法。本文提到在数字语音通信基础设施( 如3G蜂窝网络和VoIP)的音频域中,将加密语音作为假话发送到音频通信基础设施上, 这是一种相对较新的概念。这项工作为语音频道的安全语音通信提供了一种新的扭曲容忍语音加密计划, 将模拟语音拼接和像AES- CTR这样的数字密码器提供的强化安全结合起来。系统使用远程保存假兰言翻译和旋转在高频参数上进行远程保存的假话翻译和旋转, 这是一种相对较新的概念。调的参数被编码为假话音信号传递到配有语音活动检测的数码语音频道上。收到模拟语音信号后, 合法接收器恢复了最初语音参数的扭曲副本。尽管存在一些错误, 但基于 LPCNet 架构的基于移动式语音代码的综合音频调调参数参数, 重建了超前40级语音语言语言的递增语言分析。试验性语音定位系统运行了一个模拟系统, 模拟演示了SIMLILLLLLLLL 的升级系统, 模拟的升级系统运行了一个模拟版本。