Finite-state transducers (FSTs) are frequently used in speech recognition. Transducer composition is an essential operation for combining different sources of information at different granularities. However, composition is also one of the more computationally expensive operations. Due to the heterogeneous structure of FSTs, parallel algorithms for composition are suboptimal in efficiency, generality, or both. We propose an algorithm for parallel composition and implement it on graphics processing units. We benchmark our parallel algorithm on the composition of random graphs and the composition of graphs commonly used in speech recognition. The parallel composition scales better with the size of the input graphs and for large graphs can be as much as 10 to 30 times faster than a sequential CPU algorithm.
翻译:在语音识别中经常使用极不稳定状态传感器(FSTs) 。 转换器构成是将不同颗粒的不同信息来源组合起来的基本操作。 但是,构成也是计算成本更高的操作之一。 由于FSTs的复杂结构, 组合的平行算法在效率、 普遍性或两者上都低于最优化水平。 我们为平行构成提出算法, 并在图形处理器上实施。 我们以随机图的构成和语音识别中常用的图表的构成作为我们的平行算法的基准。 与输入图和大图表的大小相比,平行构成比例可能比顺序的 CPU 算法要快10至30倍 。