承认家庭议长的基线和议定书 (Baselines and Protocols for Household Speaker Recognition)

Speaker recognition on household devices, such as smart speakers, features several challenges: (i) robustness across a vast number of heterogeneous domains (households), (ii) short utterances, (iii) possibly absent speaker labels of the enrollment data (passive enrollment), and (iv) presence of unknown persons (guests). While many commercial products exist, there is less published research and no publicly-available evaluation protocols or open-source baselines. Our work serves to bridge this gap by providing an accessible evaluation benchmark derived from public resources (VoxCeleb and ASVspoof 2019 data) along with a preliminary pool of open-source baselines. This includes four algorithms for active enrollment (speaker labels available) and one algorithm for passive enrollment.

翻译：发言人对家庭设备(如智能演讲者)的认可,具有若干挑战:(一) 众多不同领域(家庭)的稳健性;(二) 短话;(三) 可能没有入学数据(被动招生)的语音标签;(四) 身份不明者(客人)的存在;虽然存在许多商业产品,但出版物的研究较少,也没有公开可用的评价协议或公开来源基线;我们的工作通过提供来自公共资源的无障碍评价基准(VoxCeleb和ASVspoof 2019年数据)以及初步的开放源基线库来弥补这一差距,其中包括主动招生的四种算法(有语音标签)和被动招生的一种算法。

相关内容

声纹识别

关注 0

说话人识别（Speaker Recognition），或者称为声纹识别（Voiceprint Recognition, VPR），是根据语音中所包含的说话人个性信息，利用计算机以及现在的信息识别技术，自动鉴别说话人身份的一种生物特征识别技术。说话人识别研究的目的就是从语音中提取具有说话人表征性的特征，建立有效的模型和系统，实现自动精准的说话人鉴别。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日