This paper introduces Timers and Such, a new open source dataset of spoken English commands for common voice control use cases involving numbers. We describe the gap in existing spoken language understanding datasets that Timers and Such fills, the design and creation of the dataset, and experiments with a number of ASR-based and end-to-end baseline models, the code for which has been made available as part of the SpeechBrain toolkit.
翻译:本文介绍关于通用语音控制使用案例通用语音控制的英语口语指令的新的开放源数据集Timers and Such。我们描述了现有口语理解数据集中的空白,这些数据集是定时器和这种数据库填充的,数据集的设计与创建,以及一些基于ASR和端到端基线模型的实验,该模型的代码已作为SpeopleBrain工具包的一部分提供。