In simultaneous speech translation (SimulST), finding the best trade-off between high translation quality and low latency is a challenging task. To meet the latency constraints posed by the different application scenarios, multiple dedicated SimulST models are usually trained and maintained, generating high computational costs. In this paper, motivated by the increased social and environmental impact caused by these costs, we investigate whether a single model trained offline can serve not only the offline but also the simultaneous task without the need for any additional training or adaptation. Experiments on en->{de, es} indicate that, aside from facilitating the adoption of well-established offline techniques and architectures without affecting latency, the offline solution achieves similar or better translation quality compared to the same model trained in simultaneous settings, as well as being competitive with the SimulST state of the art.
翻译:在同时的语音翻译(SimulST)中,找到高翻译质量和低延迟之间的最佳平衡是一项具有挑战性的任务。为了应对不同应用情景造成的延迟限制,通常对多个专门的SimulST模型进行培训和维护,从而产生高昂的计算成本。在本文中,由于这些费用造成的社会和环境影响增加,我们调查一个经过培训的单一模型离线是否不仅能够为离线工作服务,而且能够同时工作,而无需任何额外的培训或适应。关于en- ⁇ de, es}的实验表明,除了促进采用成熟的离线技术和结构而不影响延时技术之外,离线解决方案还实现了类似或更好的翻译质量,与同时培训的同一模型相比,并与SimulST的艺术状态具有竞争力。