利用深学习探测在现实世界家庭环境中的婴儿哭泣 (Detection of Infant Crying in Real-World Home Environments Using Deep Learning)

In the domain of social signal processing, audio event detection is a promising avenue for accessing daily behaviors that contribute to health and well-being. However, despite advances in mobile computing and machine learning, audio behavior detection models are largely constrained to data collected in controlled settings, such as call centers. This is problematic as it means their performance is unlikely to generalize to real-world applications. In this paper, we present a novel dataset of infant distress vocalizations compiled from over 780 hours of real-world audio data, collected via recorders worn by infants. We develop a model that combines deep spectrum and acoustic features to detect and classify infant distress vocalizations, which dramatically outperforms models trained on equivalent real-world data (F1 score of 0.597 vs 0.166). We end by discussing how dataset size can facilitate such gains in accuracy, critical when considering noisy and complex naturalistic data.

翻译：在社会信号处理领域,音频事件探测是获取有助于健康和福祉的日常行为的有希望的途径,然而,尽管在移动计算和机器学习方面有所进展,但音频行为探测模型主要限于在诸如呼叫中心等受控环境中收集的数据。这意味着其性能不可能概括为现实世界应用,因此存在问题。在本文件中,我们提供了一套新颖的关于婴儿遇难声的数据集,该数据集来自780多小时的真实世界音频数据,通过婴儿所戴的录音机收集。我们开发了一种模型,将深海频谱和声学特征结合起来,以探测和分类婴儿遇险声学,该模型大大优于在同等现实世界数据方面受过培训的模型(F1分,0.597比0.166)。我们最后讨论了数据集大小如何促进这种准确性的进展,在考虑吵闹和复杂的自然数据时至关重要。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

多标签学习的新趋势（2020 Survey）

专知会员服务

44+阅读 · 2020年12月6日

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

28+阅读 · 2020年6月13日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日