In the current method for the sound field translation tasks based on spherical harmonic (SH) analysis, the solution based on the additive theorem usually faces the problem of singular values caused by large matrix condition numbers. The influence of different distances and frequencies of the spherical radial function on the stability of the translation matrix will affect the accuracy of the SH coefficients at the selected point. Due to the problems mentioned above, we propose a neural network scheme based on the dual-path transformer. More specifically, the dual-path network is constructed by the self-attention module along the two dimensions of the frequency and order axes. The transform-average-concatenate layer and upscaling layer are introduced in the network, which provides solutions for multiple sampling points and upscaling. Numerical simulation results indicate that both the working frequency range and the distance range of the translation are extended. More accurate higher-order SH coefficients are obtained with the proposed dual-path network.
翻译:在目前基于球体口音(SH)分析的音频实地翻译任务方法中,基于添加式定理的解决方案通常面临由大矩阵条件数造成的单值问题。球形射线函数不同距离和频率对翻译矩阵稳定性的影响将影响选定点的SH系数的准确性。由于上述问题,我们提议基于双向变压器的神经网络计划。更具体地说,双管网络是由自控模块沿频率和顺序轴的两个维度建造的。在网络中引入变换平均相交错层和升降层,为多个取样点和升降尺度提供解决方案。数字模拟结果显示,翻译的工作频率范围和距离范围都扩大了。拟议的双向网络获得了更精确的更高级SH系数。