Recently, ad-hoc microphone array has been widely studied. Unlike traditional microphone array settings, the spatial arrangement and number of microphones of ad-hoc microphone arrays are not known in advance, which hinders the adaptation of traditional speaker verification technologies to ad-hoc microphone arrays. To overcome this weakness, in this paper, we propose attention-based multi-channel speaker verification with ad-hoc microphone arrays. Specifically, we add an inter-channel processing layer and a global fusion layer after the pooling layer of a single-channel speaker verification system. The inter-channel processing layer applies a so-called residual self-attention along the channel dimension for allocating weights to different microphones. The global fusion layer integrates all channels in a way that is independent to the number of the input channels. We further replace the softmax operator in the residual self-attention with sparsemax, which forces the channel weights of very noisy channels to zero. Experimental results with ad-hoc microphone arrays of over 30 channels demonstrate the effectiveness of the proposed methods. For example, the multi-channel speaker verification with sparsemax achieves an equal error rate (EER) of over 20% lower than oracle one-best system on semi-real data sets, and over 30% lower on simulation data sets, in test scenarios with both matched and mismatched channel numbers.
翻译:与传统的麦克风阵列设置不同,专用麦克风阵列的空间安排和麦克风的麦克风数量事先并不为人所知,这妨碍了传统扬声器核查技术适应临时麦克风阵列。为了克服这一弱点,我们在本文件中提议用临时麦克风阵列进行多声道声带校验。具体地说,我们在一个单一通道扬声器核查系统的集合层之后,增加一个通道间处理层和一个全球聚变层。在频道间处理层沿频道设置应用所谓的剩余自留自留装置,将重量分配给不同的麦克风。全球聚变层将所有频道都结合到与输入通道数目独立的方式。为了克服这一弱点,我们在本文件中提议用特别热声麦克风阵列进行基于关注的多声道声器校验。我们进一步将剩余自留声式麦克风声阵列中的软式传声器操作器操作员替换为零。在30多个频道扬声器的自动麦克风阵列中展示了拟议方法的有效性。例如,多声带自留音器的自留自留自留自留自留自留,在对不同麦克声器的声器上进行对各声器的自留式自留式自留式自留置式自留式自留式自留式自留式自留至不同麦克式自留式自留置,对各频道内,对各频道内,对各频道的模拟,对准率率率都为20,对等的低速率数据,对准率数据,对等的模拟,对准数据,对调率率率率率率率数据,对准率率次,对准数据,对准率率率率率率率率在20,对准数据,对准率在20,对准率率率在20,对准率上,对准率次,对准率,对准数据,对准数据,对准率率率率率率在20,对准率均,对准率,对准率,对准率,对准率均,对低,对低,对低,对低,对低,对低,对低,对低,对低,对低,对低,对低,对低,对低,对低,对低,对低,对低,对低,对低,对低,对