Privacy in speech and audio has many facets. A particularly under-developed area of privacy in this domain involves consideration for information related to content and context. Speech content can include words and their meaning or even stylistic markers, pathological speech, intonation patterns, or emotion. More generally, audio captured in-the-wild may contain background speech or reveal contextual information such as markers of location, room characteristics, paralinguistic sounds, or other audible events. Audio recording devices and speech technologies are becoming increasingly commonplace in everyday life. At the same time, commercialised speech and audio technologies do not provide consumers with a range of privacy choices. Even where privacy is regulated or protected by law, technical solutions to privacy assurance and enforcement fall short. This position paper introduces three important and timely research challenges for content privacy in speech and audio. We highlight current gaps and opportunities, and identify focus areas, that could have significant implications for developing ethical and safer speech technologies.
翻译:语音和音频隐私有许多方面。这一领域一个特别不发达的隐私领域涉及对内容和背景信息的审议。发言内容可包括文字及其含义和含义,甚至包括文体标记、病理言论、通俗模式或情感。更一般地说,在音乐中捕捉的音频可能包含背景演讲或披露背景信息,如定位标志、房间特征、语言声音或其他旁听事件。录音装置和语音技术在日常生活中越来越普遍。与此同时,商业化的语音和音频技术不为消费者提供一系列隐私选择。即使隐私受到法律的监管或保护,对隐私保障和执法的技术解决方案也不尽如人意。本立场文件介绍了对语言和音频内容隐私的三项重要和及时的研究挑战。我们强调当前的差距和机会,并查明可能对发展道德和安全的语音技术产生重大影响的重点领域。