The conventional room geometry blind inference techniques with acoustic signals are conducted based on the prior knowledge of the environment, such as the room impulse response (RIR) or the sound source position, which will limit its application under unknown scenarios. To solve this problem, we have proposed a room geometry reconstruction method in this paper by using the geometric relation between the direct signal and first-order reflections. In addition to the information of the compact microphone array itself, this method does not need any precognition of the environmental parameters. Besides, the learning-based DNN models are designed and used to improve the accuracy and integrity of the localization results of the direct source and first-order reflections. The direction of arrival (DOA) and time difference of arrival (TDOA) information of the direct and reflected signals are firstly estimated using the proposed DCNN and TD-CNN models, which have higher sensitivity and accuracy than the conventional methods. Then the position of the sound source is inferred by integrating the DOA, TDOA and array height using the proposed DNN model. After that, the positions of image sources and corresponding boundaries are derived based on the geometric relation. Experimental results of both simulations and real measurements verify the effectiveness and accuracy of the proposed techniques compared with the conventional methods under different reverberant environments.
翻译:传统的室内失明推断技术,加上音频信号,是根据事先的环境知识,如室脉冲反应(RIR)或健全的源位置,进行常规室间失明推断技术,这将限制其在未知情景下的应用。为解决这一问题,我们建议本文件采用直接信号和一阶反射之间的几何关系,在直接信号和一阶反射之间的几何关系中采用室间几何重建方法。除了紧凑麦克风阵列本身的信息外,这一方法不需要对环境参数作任何确认。此外,基于学习的DNNN模型的设计和使用是为了提高直接源和一阶反射的本地化结果的准确性和完整性。直接信号和反射信号的到达方向和到达时间差(TDOA)信息首先使用拟议的DCNNN和TD-CNN模型进行估计,这些模型比常规方法的灵敏度和精确度更高。然后,通过将DA、TDOA和阵列高度与拟议DNN模型加以推断,然后,图像来源的位置和相应的边界根据拟议常规测量方法的精确度,根据拟议的实际测算结果,根据不同的实验环境得出。