ERSAM: 用于高效能和实时社交氛围测量的神经架构搜索 (ERSAM: Neural Architecture Search For Energy-Efficient and Real-Time Social Ambiance Measurement) - 专知论文

会员服务 ·

0

模型评估 · INTERACT · state-of-the-art · CC · 错误率 ·

2023 年 3 月 19 日

ERSAM: Neural Architecture Search For Energy-Efficient and Real-Time Social Ambiance Measurement

翻译：ERSAM: 用于高效能和实时社交氛围测量的神经架构搜索

Chaojian Li,Wenwan Chen,Jiayi Yuan, Yingyan, Lin,Ashutosh Sabharwal

from arxiv, Accepted by ICASSP'23

Social ambiance describes the context in which social interactions happen, and can be measured using speech audio by counting the number of concurrent speakers. This measurement has enabled various mental health tracking and human-centric IoT applications. While on-device Socal Ambiance Measure (SAM) is highly desirable to ensure user privacy and thus facilitate wide adoption of the aforementioned applications, the required computational complexity of state-of-the-art deep neural networks (DNNs) powered SAM solutions stands at odds with the often constrained resources on mobile devices. Furthermore, only limited labeled data is available or practical when it comes to SAM under clinical settings due to various privacy constraints and the required human effort, further challenging the achievable accuracy of on-device SAM solutions. To this end, we propose a dedicated neural architecture search framework for Energy-efficient and Real-time SAM (ERSAM). Specifically, our ERSAM framework can automatically search for DNNs that push forward the achievable accuracy vs. hardware efficiency frontier of mobile SAM solutions. For example, ERSAM-delivered DNNs only consume 40 mW x 12 h energy and 0.05 seconds processing latency for a 5 seconds audio segment on a Pixel 3 phone, while only achieving an error rate of 14.3% on a social ambiance dataset generated by LibriSpeech. We can expect that our ERSAM framework can pave the way for ubiquitous on-device SAM solutions which are in growing demand.

翻译：社交氛围描述了社交互动发生的上下文，可以使用语音音频通过计算同时发言者的数量来测量。这种测量启用了各种精神健康跟踪和以人为中心的IoT应用。虽然在设备上使用社交氛围测量（SAM）非常理想，以确保用户隐私，从而促进上述应用的广泛采用，但基于深度神经网络（DNN）的现代SAM解决方案所需的计算复杂度与移动设备上的通常受限资源存在矛盾。此外，由于各种隐私约束和所需的人力投入，在临床环境下进行SAM时只有有限的标记数据可用或实用，这进一步挑战了在设备上实现的SAM解决方案的可实现准确性。为此，我们提出了一种专门的神经架构搜索框架，用于能源高效和实时SAM（ERSAM）。具体而言，我们的ERSAM框架可以自动搜索推进移动SAM解决方案的可实现准确性与硬件效率的前沿的DNN。例如，我们提供的ERSAM-DNN仅在Pixel 3手机上对5秒音频片段消耗40 mW x 12 h的能量和0.05秒的处理延迟，同时仅在通过LibriSpeech生成的社交氛围数据集上达到14.3％的误差率。我们可以预期，我们的ERSAM框架可以为普及的在设备上使用的SAM解决方案铺平道路，这在需求不断增长。

0

相关内容

模型评估

机器学习系统设计系统评估标准

AAAI2021 | 图神经网络的异质图结构学习，Heterogeneous Graph Structure Learning for Graph Neural Networks

专知会员服务

92+阅读 · 2021年1月20日

【论文】持续学习的图神经网络用于检测社交媒体的假新闻，Graph Neural Networks with Continual Learning for Fake News Detection from Social Media

【论文】持续学习的图神经网络用于检测社交媒体的假新闻，Graph Neural Networks with Continual Learning for Fake News Detection from Social Media

专知会员服务

41+阅读 · 2020年7月14日

【2020论文翻译】基于SARSA的深度强化学习的移动边缘计算任务分流和资源分配

【2020论文翻译】基于SARSA的深度强化学习的移动边缘计算任务分流和资源分配

专知会员服务

21+阅读 · 2020年5月20日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【论文】边缘计算:对当前计划的全面调查和可持续边缘计算发展的路线图（Edge Computing: A Comprehensive Surveyof Current Initiativesand a Roadmap for a Sustainable Edge Computing Development）

【论文】边缘计算:对当前计划的全面调查和可持续边缘计算发展的路线图（Edge Computing: A Comprehensive Surveyof Current Initiativesand a Roadmap for a Sustainable Edge Computing Development）

专知会员服务

29+阅读 · 2019年12月19日

【Svitlana博士论文以及答辩slides】基于知识的对话搜索（Knowledge-based Conversational Search），附145页pdf论文，55页ppt

【Svitlana博士论文以及答辩slides】基于知识的对话搜索（Knowledge-based Conversational Search），附145页pdf论文，55页ppt

专知会员服务

48+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

专知会员服务

16+阅读 · 2019年10月3日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

【推荐】(TensorFlow)SSD实时手部检测与追踪（附代码）

【推荐】(TensorFlow)SSD实时手部检测与追踪（附代码）

机器学习研究会

11+阅读 · 2017年12月5日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

云计算环境下属性基密码及其应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

多租户数据管理关键技术研究

国家自然科学基金

6+阅读 · 2015年12月31日

人毛囊真皮乳头细胞的体外分化与毛囊重建

国家自然科学基金

0+阅读 · 2014年12月31日

可并行化计算雷达网信号级融合检测算法研究

国家自然科学基金

2+阅读 · 2013年12月31日

SO2暴露诱导肺癌肿瘤转移效应及其上皮间质转化（EMT）调控机制

国家自然科学基金

0+阅读 · 2013年12月31日

面向移动用户的Web数据集成技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

强激光驱动气体产生的强太赫兹辐射源的理论和数值模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

时频重叠信号脉内特征提取与协同分选

国家自然科学基金

0+阅读 · 2012年12月31日

不确定环境下集装箱码头物流运作能力仿真建模与动态评估

国家自然科学基金

0+阅读 · 2011年12月31日

灌注微生物反应器扩增培养ADSCs在缺血性脑梗死动物模型中的功能性神经网络构建

国家自然科学基金

0+阅读 · 2011年12月31日

Toward Connecting Speech Acts and Search Actions in Conversational Search Tasks

Arxiv

0+阅读 · 2023年5月8日

Machine Learning Systems are Bloated and Vulnerable

Arxiv

1+阅读 · 2023年5月8日

Portfolio-Based Incentive Mechanism Design for Cross-Device Federated Learning

Arxiv

0+阅读 · 2023年5月6日

The Application of Affective Measures in Text-based Emotion Aware Recommender Systems

Arxiv

0+阅读 · 2023年5月4日

Trust in Human-AI Interaction: Scoping Out Models, Measures, and Methods

Arxiv

22+阅读 · 2022年4月30日

Graph Neural Networks in IoT: A Survey

Arxiv

22+阅读 · 2022年3月31日

Artificial Intelligence for the Metaverse: A Survey

Arxiv

31+阅读 · 2022年2月15日

A Survey of Machine Learning for Computer Architecture and Systems

Arxiv

18+阅读 · 2021年2月16日

Graph Neural Network for Traffic Forecasting: A Survey

Arxiv

35+阅读 · 2021年1月27日

Neural Architecture Search: A Survey

Arxiv

12+阅读 · 2018年9月5日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

AAAI2021 | 图神经网络的异质图结构学习，Heterogeneous Graph Structure Learning for Graph Neural Networks

专知会员服务

92+阅读 · 2021年1月20日

【论文】持续学习的图神经网络用于检测社交媒体的假新闻，Graph Neural Networks with Continual Learning for Fake News Detection from Social Media

【论文】持续学习的图神经网络用于检测社交媒体的假新闻，Graph Neural Networks with Continual Learning for Fake News Detection from Social Media

专知会员服务

41+阅读 · 2020年7月14日

【2020论文翻译】基于SARSA的深度强化学习的移动边缘计算任务分流和资源分配

【2020论文翻译】基于SARSA的深度强化学习的移动边缘计算任务分流和资源分配

专知会员服务

21+阅读 · 2020年5月20日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【论文】边缘计算:对当前计划的全面调查和可持续边缘计算发展的路线图（Edge Computing: A Comprehensive Surveyof Current Initiativesand a Roadmap for a Sustainable Edge Computing Development）

【论文】边缘计算:对当前计划的全面调查和可持续边缘计算发展的路线图（Edge Computing: A Comprehensive Surveyof Current Initiativesand a Roadmap for a Sustainable Edge Computing Development）

专知会员服务

29+阅读 · 2019年12月19日

【Svitlana博士论文以及答辩slides】基于知识的对话搜索（Knowledge-based Conversational Search），附145页pdf论文，55页ppt

【Svitlana博士论文以及答辩slides】基于知识的对话搜索（Knowledge-based Conversational Search），附145页pdf论文，55页ppt

专知会员服务

48+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

专知会员服务

16+阅读 · 2019年10月3日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】在低维与高维空间中对潜在表征的分析、建模与变换

《美军使用大语言模型技术生成领域特定文档》2025最新379页

【NeurIPS 2025】以语言为中心的全模态表征学习的可扩展性研究

智能体化多模态大语言模型综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

【推荐】(TensorFlow)SSD实时手部检测与追踪（附代码）

【推荐】(TensorFlow)SSD实时手部检测与追踪（附代码）

机器学习研究会

11+阅读 · 2017年12月5日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

相关论文

Toward Connecting Speech Acts and Search Actions in Conversational Search Tasks

Arxiv

0+阅读 · 2023年5月8日

Machine Learning Systems are Bloated and Vulnerable

Arxiv

1+阅读 · 2023年5月8日

Portfolio-Based Incentive Mechanism Design for Cross-Device Federated Learning

Arxiv

0+阅读 · 2023年5月6日

The Application of Affective Measures in Text-based Emotion Aware Recommender Systems

Arxiv

0+阅读 · 2023年5月4日

Trust in Human-AI Interaction: Scoping Out Models, Measures, and Methods

Arxiv

22+阅读 · 2022年4月30日

Graph Neural Networks in IoT: A Survey

Arxiv

22+阅读 · 2022年3月31日

Artificial Intelligence for the Metaverse: A Survey

Arxiv

31+阅读 · 2022年2月15日

A Survey of Machine Learning for Computer Architecture and Systems

Arxiv

18+阅读 · 2021年2月16日

Graph Neural Network for Traffic Forecasting: A Survey

Arxiv

35+阅读 · 2021年1月27日

Neural Architecture Search: A Survey

Arxiv

12+阅读 · 2018年9月5日

相关基金

云计算环境下属性基密码及其应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

多租户数据管理关键技术研究

国家自然科学基金

6+阅读 · 2015年12月31日

人毛囊真皮乳头细胞的体外分化与毛囊重建

国家自然科学基金

0+阅读 · 2014年12月31日

可并行化计算雷达网信号级融合检测算法研究

国家自然科学基金

2+阅读 · 2013年12月31日

SO2暴露诱导肺癌肿瘤转移效应及其上皮间质转化（EMT）调控机制

国家自然科学基金

0+阅读 · 2013年12月31日

面向移动用户的Web数据集成技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

强激光驱动气体产生的强太赫兹辐射源的理论和数值模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

时频重叠信号脉内特征提取与协同分选

国家自然科学基金

0+阅读 · 2012年12月31日

不确定环境下集装箱码头物流运作能力仿真建模与动态评估

国家自然科学基金

0+阅读 · 2011年12月31日

灌注微生物反应器扩增培养ADSCs在缺血性脑梗死动物模型中的功能性神经网络构建

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员