NeurST is an open-source toolkit for neural speech translation. The toolkit mainly focuses on end-to-end speech translation, which is easy to use, modify, and extend to advanced speech translation research and products. NeurST aims at facilitating the speech translation research for NLP researchers and building reliable benchmarks for this field. It provides step-by-step recipes for feature extraction, data preprocessing, distributed training, and evaluation. In this paper, we will introduce the framework design of NeurST and show experimental results for different benchmark datasets, which can be regarded as reliable baselines for future research. The toolkit is publicly available at https://github.com/bytedance/neurst/ and we will continuously update the performance of NeurST with other counterparts and studies at https://st-benchmark.github.io/.
翻译:NeurST是神经语音翻译的开放源码工具包,主要侧重于终端到终端语音翻译,易于使用、修改和扩展至先进的语音翻译研究和产品;NeurST旨在便利国家语言方案研究人员的语音翻译研究,并为该领域建立可靠的基准;它为特征提取、数据预处理、分发培训和评估提供逐步配方;在本文件中,我们将介绍NeurST的框架设计,并展示不同基准数据集的实验结果,这些数据集可被视为未来研究的可靠基线;工具包可在https://github.com/byedance/neurst/上公开查阅,我们将与其他对应方一起不断更新NeurST的绩效,并在https://st-benchmark.github.io/上不断更新。