数据科学家和主题物专家之间示范性绩效通信可视化准则 (Visualization Guidelines for Model Performance Communication Between Data Scientists and Subject Matter Experts) - 专知论文

会员服务 ·

0

Performer · 模型性能 · 讲稿 · MoDELS · 可理解性 ·

2022 年 5 月 11 日

Visualization Guidelines for Model Performance Communication Between Data Scientists and Subject Matter Experts

翻译：数据科学家和主题物专家之间示范性绩效通信可视化准则

Ashley Suh,Gabriel Appleby,Erik W. Anderson,Luca Finelli,Remco Chang,Dylan Cashman

Presenting the complexities of a model's performance is a communication bottleneck that threatens collaborations between data scientists and subject matter experts. Accuracy and error metrics alone fail to tell the whole story of a model - its risks, strengths, and limitations - making it difficult for subject matter experts to feel confident in deciding to use a model. As a result, models may fail in unexpected ways if their weaknesses are not clearly understood. Alternatively, models may go unused, as subject matter experts disregard poorly presented models in favor of familiar, yet arguably substandard methods. In this paper, we propose effective use of visualization as a medium for communication between data scientists and subject matter experts. Our research addresses the gap between common practices in model performance communication and the understanding of subject matter experts and decision makers. We derive a set of communication guidelines and recommended visualizations for communicating model performance based on interviews of both data scientists and subject matter experts at the same organization. We conduct a follow-up study with subject matter experts to evaluate the efficacy of our guidelines in presentations of model performance with and without our recommendations. We find that our proposed guidelines made subject matter experts more aware of the tradeoffs of the presented model. Participants realized that current communication methods left them without a robust understanding of the model's performance, potentially giving them misplaced confidence in the use of the model.

翻译：提出模型性能的复杂性是一个通信瓶颈,威胁到数据科学家和主题事项专家之间的合作; 精确度和误差度单靠精确度和误差度无法说明模型的整个故事——其风险、长处和局限性,使主题事项专家难以感到有信心决定使用模型; 结果,模型的弱点如果不被清楚理解,就可能以出乎意料的方式失败; 或者,模型可能没有使用,因为主题事项专家忽视了模型性能的模型,而忽略了模型性能的模型,而偏好于熟悉的、但可以说不合标准的方法; 在本文件中,我们建议有效利用可视化作为数据科学家和主题事项专家之间交流的媒介; 我们的研究解决了模型性能交流的共同做法与专题专家和决策者理解之间的差距; 我们制定了一套通信准则,用以根据对数据科学家和同一组织的专题专家的访谈,交流模型性能; 我们与主题事项专家进行后续研究,以评价我们在介绍模型性能和没有我们的建议时的指导方针的有效性。我们发现,我们提出的准则使主题事项专家更了解模型的利弊,而没有很好地理解模型。

0

相关内容

Performer

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Chemerin通过调节p38MAPK通路参与动脉粥样硬化分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

类石墨型氮化碳/铋基复合氧化物异质结光催化剂的制备及降解酚类内分泌干扰物废水研究

国家自然科学基金

0+阅读 · 2013年12月31日

POSS封口介孔二氧化硅/聚合物复合材料的介电性能与微观结构调控

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

SLC22A3-Histamin-LDL途径介导冠心病的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

多孔金属有机物骨架材料储氢性能改进的理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

生物可降解性多模态纳米微粒构建与TIMP-2、Endostatin联合靶向转运抑制动脉粥样硬化易损斑块血管发生的研究

国家自然科学基金

0+阅读 · 2009年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

AUDITEM: Toward an Automated and Efficient Data Integrity Verification Model Using Blockchain

Arxiv

0+阅读 · 2022年7月1日

CEDAR: Communication Efficient Distributed Analysis for Regressions

Arxiv

0+阅读 · 2022年7月1日

Hire the Experts: Combinatorial Auction Based Scheme for Experts Selection in E-Healthcare

Arxiv

0+阅读 · 2022年6月30日

Performative Reinforcement Learning

Arxiv

0+阅读 · 2022年6月30日

Neural Annotation Refinement: Development of a New 3D Dataset for Adrenal Gland Analysis

Neural Annotation Refinement: Development of a New 3D Dataset for Adrenal Gland Analysis

Arxiv

0+阅读 · 2022年6月30日

An Intermediate-level Attack Framework on The Basis of Linear Regression

Arxiv

0+阅读 · 2022年6月30日

Learnable Model-Driven Performance Prediction and Optimization for Imperfect MIMO System: Framework and Application

Arxiv

0+阅读 · 2022年6月30日

SoK: Content Moderation in Social Media, from Guidelines to Enforcement, and Research to Practice

Arxiv

0+阅读 · 2022年6月29日

MATCH: Metadata-Aware Text Classification in A Large Hierarchy

Arxiv

12+阅读 · 2021年2月15日

Multi-Pointer Co-Attention Networks for Recommendation

Arxiv

12+阅读 · 2018年1月28日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025】迈向鲁棒的零样本强化学习

一种基于视觉算法生成三维场景重建的多任务系统 | 2025最新200页

【普林斯顿博士论文】量化、评估与缓解现代机器学习系统中的风险

遥感中基于深度学习的领域自适应方法：全面综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

AUDITEM: Toward an Automated and Efficient Data Integrity Verification Model Using Blockchain

Arxiv

0+阅读 · 2022年7月1日

CEDAR: Communication Efficient Distributed Analysis for Regressions

Arxiv

0+阅读 · 2022年7月1日

Hire the Experts: Combinatorial Auction Based Scheme for Experts Selection in E-Healthcare

Arxiv

0+阅读 · 2022年6月30日

Performative Reinforcement Learning

Arxiv

0+阅读 · 2022年6月30日

Neural Annotation Refinement: Development of a New 3D Dataset for Adrenal Gland Analysis

Neural Annotation Refinement: Development of a New 3D Dataset for Adrenal Gland Analysis

Arxiv

0+阅读 · 2022年6月30日

An Intermediate-level Attack Framework on The Basis of Linear Regression

Arxiv

0+阅读 · 2022年6月30日

Learnable Model-Driven Performance Prediction and Optimization for Imperfect MIMO System: Framework and Application

Arxiv

0+阅读 · 2022年6月30日

SoK: Content Moderation in Social Media, from Guidelines to Enforcement, and Research to Practice

Arxiv

0+阅读 · 2022年6月29日

MATCH: Metadata-Aware Text Classification in A Large Hierarchy

Arxiv

12+阅读 · 2021年2月15日

Multi-Pointer Co-Attention Networks for Recommendation

Arxiv

12+阅读 · 2018年1月28日

相关基金

Chemerin通过调节p38MAPK通路参与动脉粥样硬化分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

类石墨型氮化碳/铋基复合氧化物异质结光催化剂的制备及降解酚类内分泌干扰物废水研究

国家自然科学基金

0+阅读 · 2013年12月31日

POSS封口介孔二氧化硅/聚合物复合材料的介电性能与微观结构调控

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

SLC22A3-Histamin-LDL途径介导冠心病的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

多孔金属有机物骨架材料储氢性能改进的理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

生物可降解性多模态纳米微粒构建与TIMP-2、Endostatin联合靶向转运抑制动脉粥样硬化易损斑块血管发生的研究

国家自然科学基金

0+阅读 · 2009年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员