In this paper the current status and open challenges of synthetic speech detection are addressed. The work comprises an initial analysis of available open datasets and of existing detection methods, a description of the requirements for new research datasets compliant with regulations and better representing real-case scenarios, and a discussion of the desired characteristics of future trustworthy detection methods in terms of both functional and non-functional requirements. Compared to other works, based on specific detection solutions or presenting single dataset of synthetic speeches, our paper is meant to orient future state-of-the-art research in the domain, to quickly lessen the current gap between synthesis and detection approaches.
翻译:本文讨论了合成语音探测的现状和公开挑战,包括初步分析现有开放数据集和现有探测方法,说明符合规章的新研究数据集的要求,更好地反映实际情况,从功能要求和非功能要求的角度讨论未来可信赖的探测方法的预期特点。与其他工作相比,根据具体的探测解决办法或提出合成演讲的单一数据集,我们的文件旨在指导未来该领域的最新研究,以迅速缩小目前合成和探测方法之间的差距。