相似性和独立- 软件- 软件- 软件- 软件- 软件- 与使用参考参考目标源采掘的迭代播种和推进启动目标源采掘 (Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference)

Target source extraction is significant for improving human speech intelligibility and the speech recognition performance of computers. This study describes a method for target source extraction, called the similarity-and-independence-aware beamformer (SIBF). The SIBF extracts the target source using a rough magnitude spectrogram as the reference signal. The advantage of the SIBF is that it can obtain a more accurate signal than the spectrogram generated by target-enhancing methods such as speech enhancement based on deep neural networks. For the extraction, we extend the framework of deflationary independent component analysis (ICA) by considering the similarities between the reference and extracted target sources, in addition to the mutual independence of all the potential sources. To solve the extraction problem by maximum-likelihood estimation, we introduce three source models that can reflect the similarities. The major contributions of this study are as follows. First, the extraction performance is improved using two methods, namely boost start for faster convergence and iterative casting for generating a more accurate reference. The effectiveness of these methods is verified through experiments using the CHiME3 dataset. Second, a concept of a fixed point pertaining to accuracy is developed. This concept facilitates understanding the relationship between the reference and SIBF output in terms of accuracy. Third, a unified formulation of the SIBF and mask-based beamformer is realized to apply the expertise of conventional BFs to the SIBF. The findings of this study can also improve the performance of the SIBF and promote research on ICA and conventional beamformers. Index Terms: beamformer, independent component analysis, source separation, speech enhancement, target source extraction

翻译：目标源的提取对于提高人的言语感知度和计算机语音识别性能意义重大。本研究描述了一种目标源提取方法,称为“相似和独立意识光谱仪(SIBF)” 。SIMF利用粗微规模光谱谱作为参考信号提取目标源。SIMF的优点是,它可以获得比基于深层神经网络的增强言词增强声调等增强目标方法产生的光谱学更准确的信号。关于提取,我们通过考虑参考和提取目标源之间的相似性,以及所有潜在源的相互独立性,扩展通缩独立组成部分分析框架。为了通过最大相似性估计解决提取问题,我们采用了三种来源模型,可以反映相似性。本研究的主要贡献如下。首先,利用两种方法,即加速启动更快的趋同和迭接生成更准确性,这些方法的有效性通过使用CHIME3数据集的实验得到验证。其次,一个与准确性相关的固定点概念是所有潜在来源。为了通过最大似异性估算性估算,SIMF的SI-BF的精确性分析,这个概念可以促进SIMF的提高IMF的常规源的精确性研究。