Sequence alignment forms an important backbone in many sequencing applications. A commonly used strategy for sequence alignment is an approximate string matching with a two-dimensional dynamic programming approach. Although some prior work has been conducted on GPU acceleration of a sequence alignment, we identify several shortcomings that limit exploiting the full computational capability of modern GPUs. This paper presents SaLoBa, a GPU-accelerated sequence alignment library focused on seed extension. Based on the analysis of previous work with real-world sequencing data, we propose techniques to exploit the data locality and improve workload balancing. The experimental results reveal that SaLoBa significantly improves the seed extension kernel compared to state-of-the-art GPU-based methods.
翻译:序列对齐是许多测序应用程序的重要支柱。 序列对齐的常用策略是近似字符串,与二维动态编程方法相匹配。 尽管先前已经就GPU加速序列对齐问题开展了一些工作,但我们发现了一些限制利用现代GPU全部计算能力的缺陷。本文介绍了以种子扩展为重点的GPU加速序列对齐库SaLoBa。根据对以往与现实世界测序数据所做工作的分析,我们提出了利用数据地点和改善工作量平衡的技术。实验结果表明,SaLOBA显著改进了种子扩展内核,而不是以最先进的GPU为基础的方法。