根据真实音源和第一顺序反射的本地化 (Room geometry blind inference based on the localization of real sound source and first order reflections)

The conventional room geometry blind inference techniques with acoustic signals are conducted based on the prior knowledge of the environment, such as the room impulse response (RIR) or the sound source position, which will limit its application under known scenarios. To solve this problem, we have proposed a room geometry reconstruction method in this paper by using the geometric relation between the direct signal and first-order reflections. In addition to the information of the compact microphone array itself, this method does not need any precognition of the environmental parameters. Besides, the learning-based DNN models are designed and used to improve the accuracy and integrity of the localization results of the direct source and first-order reflections. The direction of arrival (DOA) and time difference of arrival (TDOA) information of the direct and reflected signals are firstly estimated using the proposed DCNN and TD-CNN models, which have higher sensitivity and accuracy than the conventional methods. Then the position of the sound source is inferred by integrating the DOA, TDOA and array height using the proposed DNN model. After that, the positions of image sources and corresponding boundaries are derived based on the geometric relation. Experimental results of both simulations and real measurements verify the effectiveness and accuracy of the proposed techniques compared with the conventional methods under different reverberant environments.

翻译：传统的室内失明推断技术,加上音频信号,是根据事先的环境知识,如室脉冲反应(RIR)或健全的源位置,进行常规室间失明推断技术,这将限制其在已知假设情景下的应用。为了解决这一问题,我们建议本文件采用直接信号和一级反射之间的几何关系,在直接信号和一级反射之间采用一个室间几何重建方法。除了压缩麦克风阵列本身的信息外,这一方法不需要对环境参数有任何认识。此外,基于学习的DNNN模型的设计和使用是为了提高直接源和一级反射的本地化结果的准确性和完整性。直接和反映信号的到达方向和到达时间差(TDOA)信息首先使用拟议的DCNNN和TD-CNN模型进行估计,这些模型比常规方法更敏感和准确。然后,通过将DA、TDOA和阵列高度与拟议DNN模型结合起来,推断出声音来源的位置。此后,图像来源的位置和相应的边界根据拟议实际测算法的精确度,根据不同的常规实验环境,根据不同的测算结果和测算结果得出。