微小、始终不变和脆弱:通过设计选择,在设备机上学习工作流程中进行偏见的传播 (Tiny, always-on and fragile: Bias propagation through design choices in on-device machine learning workflows) - 专知论文

会员服务 ·

0

有偏 · ML · Machine Learning · 离线推断 · 学成 ·

2022 年 1 月 26 日

Tiny, always-on and fragile: Bias propagation through design choices in on-device machine learning workflows

翻译：微小、始终不变和脆弱:通过设计选择,在设备机上学习工作流程中进行偏见的传播

Wiebke Toussaint,Akhil Mathur,Fahim Kawsar,Aaron Yi Ding

Billions of distributed, heterogeneous and resource constrained smart consumer devices deploy on-device machine learning (ML) to deliver private, fast and offline inference on personal data. On-device ML systems are highly context dependent, and sensitive to user, usage, hardware and environmental attributes. Despite this sensitivity and the propensity towards bias in ML, bias in on-device ML has not been studied. This paper studies the propagation of bias through design choices in on-device ML development workflows. We position reliability bias, which arises from disparate device failures across demographic groups, as a source of unfairness in on-device ML settings and quantify metrics to evaluate it. We then identify complex and interacting technical design choices in the on-device ML workflow that can lead to disparate performance across user groups, and thus reliability bias. Finally, we show with an empirical case study that seemingly innocuous design choices such as the data sample rate, pre-processing parameters used to construct input features and pruning hyperparameters propagate reliability bias through an audio keyword spotting development workflow. We leverage our insights to suggest strategies for developers to develop fairer on-device ML.

翻译：分布式、多式和资源有限的智能消费者装置的数十亿个分布式、多式和资源有限的智能消费者装置在设备机上部署,以提供私人、快速和离线的个人数据推断。在线设计 ML系统高度依赖环境,并且对用户、使用、硬件和环境属性敏感。尽管这种敏感性和偏向于 ML 中存在偏向性倾向,但还没有研究在设备ML 上存在的偏向。本文研究通过设计选择在设备ML 开发工作流程中传播偏见。我们将可靠性偏向定位于各人口群体不同的设备失灵,作为在设备ML 设置上存在不公平之处的一个来源,并量化评价它的指标。我们然后在设备ML 工作流程中确定复杂和互动的技术设计选择,这可能导致不同用户群体不同的性,从而产生可靠性偏差。最后,我们通过经验案例研究显示,在数据抽样率、用于建立输入特征和通过音频关键值识别工作流程传播可靠性偏差。我们利用我们的洞察力来建议开发者战略,以更公平的方式发展Mice-L 。

0

相关内容

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

动物源性食品中青霉素残留的电化学核酸适配体传感检测机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

高分辨率极化SAR图像对象化目标分解方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

高温高压水热体系中CO2传感器的研制

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

β-Sarcoglycan在mSOD1介导ALS骨骼肌病变中的机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于微裂纹演化的岩石蠕变损伤跨层次分析

国家自然科学基金

1+阅读 · 2012年12月31日

多层结构过程控制系统性能实时监控、评估与优化

国家自然科学基金

0+阅读 · 2011年12月31日

南亚热带森林白蚁多样性及其对环境变化的响应机制

国家自然科学基金

0+阅读 · 2011年12月31日

影响海底沉积物声学特性的环境因素研究

国家自然科学基金

0+阅读 · 2011年12月31日

针刺长强对FMRX1基因敲除小鼠突触可塑性影响的研究

国家自然科学基金

0+阅读 · 2009年12月31日

A Brief Guide to Designing and Evaluating Human-Centered Interactive Machine Learning

Arxiv

0+阅读 · 2022年4月20日

Investigating Data Variance in Evaluations of Automatic Machine Translation Metrics

Arxiv

0+阅读 · 2022年4月19日

A Catalogue of Concerns for Specifying Machine Learning-Enabled Systems

Arxiv

0+阅读 · 2022年4月15日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Recent Advances and Trends in Multimodal Deep Learning: A Review

Arxiv

57+阅读 · 2021年5月24日

Recent Advances in Large Margin Learning

Arxiv

12+阅读 · 2021年3月25日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

Backdoor Learning: A Survey

Arxiv

15+阅读 · 2020年10月26日

Distributed Machine Learning on Mobile Devices: A Survey

Distributed Machine Learning on Mobile Devices: A Survey

Arxiv

37+阅读 · 2019年9月18日

Interpretable machine learning: definitions, methods, and applications

Interpretable machine learning: definitions, methods, and applications

Arxiv

19+阅读 · 2019年1月14日

VIP会员

文章信息

相关主题

Machine Learning

相关VIP内容

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】用于提升含优化层学习的算法与体系结构

【NeurIPS2025】有何不同于过去？基于自监督偏差学习的时空时间序列预测

超越决策优势：情报在创新与适应中的作用

量子计算发展态势研究报告（2025年）

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

A Brief Guide to Designing and Evaluating Human-Centered Interactive Machine Learning

Arxiv

0+阅读 · 2022年4月20日

Investigating Data Variance in Evaluations of Automatic Machine Translation Metrics

Arxiv

0+阅读 · 2022年4月19日

A Catalogue of Concerns for Specifying Machine Learning-Enabled Systems

Arxiv

0+阅读 · 2022年4月15日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Recent Advances and Trends in Multimodal Deep Learning: A Review

Arxiv

57+阅读 · 2021年5月24日

Recent Advances in Large Margin Learning

Arxiv

12+阅读 · 2021年3月25日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

Backdoor Learning: A Survey

Arxiv

15+阅读 · 2020年10月26日

Distributed Machine Learning on Mobile Devices: A Survey

Distributed Machine Learning on Mobile Devices: A Survey

Arxiv

37+阅读 · 2019年9月18日

Interpretable machine learning: definitions, methods, and applications

Interpretable machine learning: definitions, methods, and applications

Arxiv

19+阅读 · 2019年1月14日

相关基金

动物源性食品中青霉素残留的电化学核酸适配体传感检测机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

高分辨率极化SAR图像对象化目标分解方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

高温高压水热体系中CO2传感器的研制

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

β-Sarcoglycan在mSOD1介导ALS骨骼肌病变中的机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于微裂纹演化的岩石蠕变损伤跨层次分析

国家自然科学基金

1+阅读 · 2012年12月31日

多层结构过程控制系统性能实时监控、评估与优化

国家自然科学基金

0+阅读 · 2011年12月31日

南亚热带森林白蚁多样性及其对环境变化的响应机制

国家自然科学基金

0+阅读 · 2011年12月31日

影响海底沉积物声学特性的环境因素研究

国家自然科学基金

0+阅读 · 2011年12月31日

针刺长强对FMRX1基因敲除小鼠突触可塑性影响的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员