Recent advances in spoken language understanding benefited from Self-Supervised models trained on large speech corpora. For French, the LeBenchmark project has made such models available and has led to impressive progress on several tasks including spoken language understanding. These advances have a non-negligible cost in terms of computation time and energy consumption. In this paper, we compare several learning strategies aiming at reducing such cost while keeping competitive performances. The experiments are performed on the MEDIA corpus, and show that it is possible to reduce the learning cost while maintaining state-of-the-art performances.
翻译:最近在口语理解方面取得的进展得益于在大型语言公司培训的自我监督模式,在法语方面,LeBenchmark项目提供了这种模式,并导致在包括口语理解在内的若干任务方面取得令人印象深刻的进展,这些进步在计算时间和能源消耗方面成本不可忽略,在本文件中,我们比较了旨在降低这种成本同时保持竞争性业绩的若干学习战略,实验是在MEDIA系统进行,并表明在保持最新业绩的同时降低学习成本是可能的。