PickNet: 特别麦克风阵列实时频道选择 (PickNet: Real-Time Channel Selection for Ad Hoc Microphone Arrays)

This paper proposes PickNet, a neural network model for real-time channel selection for an ad hoc microphone array consisting of multiple recording devices like cell phones. Assuming at most one person to be vocally active at each time point, PickNet identifies the device that is spatially closest to the active person for each time frame by using a short spectral patch of just hundreds of milliseconds. The model is applied to every time frame, and the short time frame signals from the selected microphones are concatenated across the frames to produce an output signal. As the personal devices are usually held close to their owners, the output signal is expected to have higher signal-to-noise and direct-to-reverberation ratios on average than the input signals. Since PickNet utilizes only limited acoustic context at each time frame, the system using the proposed model works in real time and is robust to changes in acoustic conditions. Speech recognition-based evaluation was carried out by using real conversational recordings obtained with various smartphones. The proposed model yielded significant gains in word error rate with limited computational cost over systems using a block-online beamformer and a single distant microphone.

翻译：本文提议PickNet, 这是一种由手机等多个录音装置组成的临时麦克风阵列实时频道选择的神经网络模型。假设大多数人在每次时点上都能发出声音, PickNet通过使用短频谱谱谱谱段,确定每个时点上与活动人员空间最接近的设备, 模型适用于每个时点, 选定的麦克风的短时段信号在框架之间相互连接, 以产生输出信号。由于个人设备通常紧贴其所有者, 预计输出信号平均比输入信号使用的比例更高。由于PickNet在每一时点上只使用有限的声学环境, 使用拟议模型的系统可以实时工作, 并且对音响条件的变化非常有力。语音识别评价是通过使用与各种智能手机获得的实时对话记录进行的。提议的模型在单线上和单远程麦克风的系统计算成本有限的情况下,在文字错误率上取得了显著的增益。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【简明书】数学，统计和机器学习的动手入门，57页pdf，A Hands-On Introduction to Math, Stats, and Machine Learning

专知会员服务

43+阅读 · 2022年2月26日