By using a computer keyboard as a finger recording device, we construct the largest existing dataset for gesture recognition via surface electromyography (sEMG), and use deep learning to achieve over 90% character-level accuracy on reconstructing typed text entirely from measured muscle potentials. We prioritize the temporal structure of the EMG signal instead of the spatial structure of the electrode layout, using network architectures inspired by those used for real-time spoken language transcription. Our architecture recognizes the rapid movements of natural computer typing, which occur at irregular intervals and often overlap in time. The extensive size of our dataset also allows us to study gesture recognition after synthetically downgrading the spatial or temporal resolution, showing the system capabilities necessary for real-time gesture recognition.
翻译:通过使用计算机键盘作为手指记录装置,我们构建了通过表面电感学(SEMG)进行手势识别的最大现有数据集,并利用深层学习实现90%以上的字符级精确度,完全根据测量的肌肉潜力对打字文本进行重建。我们利用实时口语笔录所用的网络结构,将环境管理小组信号的时间结构而不是电极布局的空间结构列为优先事项。我们的架构承认自然计算机打字的快速移动,这种移动间隔不定期,而且往往在时间上重叠。我们数据集的广度也使我们能够在合成地降级空间或时间分辨率后研究手势识别,显示实时手势识别所需的系统能力。