预声:聋人和重听用户个性化和可扩缩的健全识别系统 (ProtoSound: A Personalized and Scalable Sound Recognition System for Deaf and Hard-of-Hearing Users) - 专知论文

会员服务 ·

0

模型评估 · MoDELS · INTERACT · Performer · Better ·

2022 年 2 月 22 日

ProtoSound: A Personalized and Scalable Sound Recognition System for Deaf and Hard-of-Hearing Users

翻译：预声:聋人和重听用户个性化和可扩缩的健全识别系统

Dhruv Jain,Khoa Huynh Anh Nguyen,Steven Goodman,Rachel Grossman-Kahn,Hung Ngo,Aditya Kusupati,Ruofei Du,Alex Olwal,Leah Findlater,Jon E. Froehlich

from arxiv, Published at the ACM CHI Conference on Human Factors in Computing Systems (CHI) 2022

Recent advances have enabled automatic sound recognition systems for deaf and hard of hearing (DHH) users on mobile devices. However, these tools use pre-trained, generic sound recognition models, which do not meet the diverse needs of DHH users. We introduce ProtoSound, an interactive system for customizing sound recognition models by recording a few examples, thereby enabling personalized and fine-grained categories. ProtoSound is motivated by prior work examining sound awareness needs of DHH people and by a survey we conducted with 472 DHH participants. To evaluate ProtoSound, we characterized performance on two real-world sound datasets, showing significant improvement over state-of-the-art (e.g., +9.7% accuracy on the first dataset). We then deployed ProtoSound's end-user training and real-time recognition through a mobile application and recruited 19 hearing participants who listened to the real-world sounds and rated the accuracy across 56 locations (e.g., homes, restaurants, parks). Results show that ProtoSound personalized the model on-device in real-time and accurately learned sounds across diverse acoustic contexts. We close by discussing open challenges in personalizable sound recognition, including the need for better recording interfaces and algorithmic improvements.

翻译：最近的进展使聋人和听力困难(DHH)用户在移动设备上实现了自动听觉识别系统;然而,这些工具使用预先训练的通用声音识别模型,这些模型无法满足DHH用户的不同需要。我们引入了普罗托Sound,这是一个定制声音识别模型的互动系统,通过记录几个实例,从而能够实现个性化和细微的分类。普罗托Sound的动机是事先研究DHH(DHH)用户的正确认识需求,以及我们与472 DHH(DH)参与者进行了一项调查。为了评估普罗托Sound,我们在两个真实世界声音数据集上的表现表现突出,显示比最新数据集(例如,+9.7%的精确度)有显著改善。我们随后采用了普罗托Sound的最终用户培训和实时识别,通过移动应用程序实时识别,并招募了19名听者,他们倾听了DHHHH(DHH)人的正确度,并在56个地点(例如家、餐馆、公园)进行了评估。结果显示,ProtoSoundSound使模型在实时和准确了解最新声音声音声音声音的改进了各种声学界面界面界面。我们通过讨论了更深入了解了需要。

0

相关内容

模型评估

机器学习系统设计系统评估标准

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

以PqsR为靶点筛选铜绿假单胞菌群体感应调控抑制剂及联合用药研究

国家自然科学基金

0+阅读 · 2015年12月31日

可压缩Euler-Maxwell方程解的性态研究

国家自然科学基金

0+阅读 · 2015年12月31日

非局部Schrödinger方程的高效守恒算法

国家自然科学基金

0+阅读 · 2015年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

氢化对硅烯及硅烯纳米条带电子输运性质的调制

国家自然科学基金

0+阅读 · 2013年12月31日

电磁场对海马神经元TRP离子通道作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

量子物理中的压缩传感理论及其应用

国家自然科学基金

1+阅读 · 2012年12月31日

蜡样芽孢杆菌Bacillus cereus 905锰超氧化物歧化酶（MnSOD）基因表达调控途径的研究

国家自然科学基金

0+阅读 · 2011年12月31日

不同色纠缠光子的同步

国家自然科学基金

0+阅读 · 2009年12月31日

超过程及相关SPDE的研究

国家自然科学基金

0+阅读 · 2008年12月31日

A Mobile Food Recognition System for Dietary Assessment

A Mobile Food Recognition System for Dietary Assessment

Arxiv

0+阅读 · 2022年4月20日

A Survey of Video-based Action Quality Assessment

Arxiv

0+阅读 · 2022年4月20日

Towards General Purpose Vision Systems

Arxiv

0+阅读 · 2022年4月19日

Real-Time Face Recognition System

Arxiv

0+阅读 · 2022年4月19日

Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking

Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking

Arxiv

0+阅读 · 2022年4月19日

Scalable Verification of GNN-based Job Schedulers

Arxiv

0+阅读 · 2022年4月19日

Advances in Thunder Sound Synthesis

Arxiv

0+阅读 · 2022年4月17日

Towards Porting Operating Systems with Program Synthesis

Arxiv

0+阅读 · 2022年4月15日

Ear Wearable (Earable) User Authentication via Acoustic Toothprint

Arxiv

0+阅读 · 2022年4月14日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

VIP会员

文章信息

相关主题

相关VIP内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

A Mobile Food Recognition System for Dietary Assessment

A Mobile Food Recognition System for Dietary Assessment

Arxiv

0+阅读 · 2022年4月20日

A Survey of Video-based Action Quality Assessment

Arxiv

0+阅读 · 2022年4月20日

Towards General Purpose Vision Systems

Arxiv

0+阅读 · 2022年4月19日

Real-Time Face Recognition System

Arxiv

0+阅读 · 2022年4月19日

Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking

Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking

Arxiv

0+阅读 · 2022年4月19日

Scalable Verification of GNN-based Job Schedulers

Arxiv

0+阅读 · 2022年4月19日

Advances in Thunder Sound Synthesis

Arxiv

0+阅读 · 2022年4月17日

Towards Porting Operating Systems with Program Synthesis

Arxiv

0+阅读 · 2022年4月15日

Ear Wearable (Earable) User Authentication via Acoustic Toothprint

Arxiv

0+阅读 · 2022年4月14日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

相关基金

以PqsR为靶点筛选铜绿假单胞菌群体感应调控抑制剂及联合用药研究

国家自然科学基金

0+阅读 · 2015年12月31日

可压缩Euler-Maxwell方程解的性态研究

国家自然科学基金

0+阅读 · 2015年12月31日

非局部Schrödinger方程的高效守恒算法

国家自然科学基金

0+阅读 · 2015年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

氢化对硅烯及硅烯纳米条带电子输运性质的调制

国家自然科学基金

0+阅读 · 2013年12月31日

电磁场对海马神经元TRP离子通道作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

量子物理中的压缩传感理论及其应用

国家自然科学基金

1+阅读 · 2012年12月31日

蜡样芽孢杆菌Bacillus cereus 905锰超氧化物歧化酶（MnSOD）基因表达调控途径的研究

国家自然科学基金

0+阅读 · 2011年12月31日

不同色纠缠光子的同步

国家自然科学基金

0+阅读 · 2009年12月31日

超过程及相关SPDE的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员