设计对AI系统:选择、考虑和取舍的分类评价 (Designing Disaggregated Evaluations of AI Systems: Choices, Considerations, and Tradeoffs) - 专知论文

会员服务 ·

0

Performer · Better · SimPLe · 可理解性 · 设计 ·

2021 年 12 月 1 日

Designing Disaggregated Evaluations of AI Systems: Choices, Considerations, and Tradeoffs

翻译：设计对AI系统:选择、考虑和取舍的分类评价

Solon Barocas,Anhong Guo,Ece Kamar,Jacquelyn Krones,Meredith Ringel Morris,Jennifer Wortman Vaughan,Duncan Wadsworth,Hanna Wallach

Disaggregated evaluations of AI systems, in which system performance is assessed and reported separately for different groups of people, are conceptually simple. However, their design involves a variety of choices. Some of these choices influence the results that will be obtained, and thus the conclusions that can be drawn; others influence the impacts -- both beneficial and harmful -- that a disaggregated evaluation will have on people, including the people whose data is used to conduct the evaluation. We argue that a deeper understanding of these choices will enable researchers and practitioners to design careful and conclusive disaggregated evaluations. We also argue that better documentation of these choices, along with the underlying considerations and tradeoffs that have been made, will help others when interpreting an evaluation's results and conclusions.

翻译：在概念上,对不同人群的系统进行系统性能评估和单独报告的独立评价系统进行分类评价,从概念上讲,是简单的,但是,其设计涉及各种选择,其中一些选择影响到将获得的结果,从而影响可以得出的结论;另一些选择影响分门别类的评价对人的影响,既有益又有害,包括利用数据进行评价的人。我们争辩说,更深入地了解这些选择将使研究人员和从业人员能够设计仔细和结论性分门别类的评价。我们还认为,更好地记录这些选择以及基本考虑和权衡,将有助于他人解释评价结果和结论。

0

相关内容

Performer

深度神经网络模型的个体差异，Individual differences among deep neural network models

深度神经网络模型的个体差异，Individual differences among deep neural network models

专知会员服务

9+阅读 · 2020年1月11日

【AAAI Tutorials 2019】定价和拍卖自动化机制设计的新领域(New Frontiers of Automated Mechanism Design for Pricing and Auctions)

【AAAI Tutorials 2019】定价和拍卖自动化机制设计的新领域(New Frontiers of Automated Mechanism Design for Pricing and Auctions)

专知会员服务

7+阅读 · 2019年11月18日

【Freddy Lecue博士】Thales嵌入式可解释AI：关键系统中AI的采用（Thales Embedded Explainable AI: Towards the Adoption of AI in Critical Systems.），AI Accelerator Summit 2019

【Freddy Lecue博士】Thales嵌入式可解释AI：关键系统中AI的采用（Thales Embedded Explainable AI: Towards the Adoption of AI in Critical Systems.），AI Accelerator Summit 2019

专知会员服务

20+阅读 · 2019年11月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

47+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

33+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

174+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

81+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

103+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

52+阅读 · 2019年9月29日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

9+阅读 · 2019年1月29日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

Arxiv

0+阅读 · 2022年2月5日

Perspectives of Visualization Onboarding and Guidance in VA

Arxiv

0+阅读 · 2022年2月4日

Algorithmic Fairness Datasets: the Story so Far

Arxiv

0+阅读 · 2022年2月3日

Understanding the Role of Context in Creating Enjoyable Co-Located Interactions

Arxiv

0+阅读 · 2022年2月3日

Measuring Disparate Outcomes of Content Recommendation Algorithms with Distributional Inequality Metrics

Measuring Disparate Outcomes of Content Recommendation Algorithms with Distributional Inequality Metrics

Arxiv

0+阅读 · 2022年2月3日

Metrics for Evaluating Social Conformity of Crowd Navigation Algorithms

Metrics for Evaluating Social Conformity of Crowd Navigation Algorithms

Arxiv

0+阅读 · 2022年2月2日

FuxiCTR: An Open Benchmark for Click-Through Rate Prediction

Arxiv

7+阅读 · 2020年9月12日

The Measure of Intelligence

The Measure of Intelligence

Arxiv

6+阅读 · 2019年11月5日

Context in Neural Machine Translation: A Review of Models and Evaluations

Arxiv

5+阅读 · 2019年1月25日

Physical Primitive Decomposition

Physical Primitive Decomposition

Arxiv

4+阅读 · 2018年9月13日

VIP会员

文章信息

相关主题

相关VIP内容

深度神经网络模型的个体差异，Individual differences among deep neural network models

深度神经网络模型的个体差异，Individual differences among deep neural network models

专知会员服务

9+阅读 · 2020年1月11日

【AAAI Tutorials 2019】定价和拍卖自动化机制设计的新领域(New Frontiers of Automated Mechanism Design for Pricing and Auctions)

【AAAI Tutorials 2019】定价和拍卖自动化机制设计的新领域(New Frontiers of Automated Mechanism Design for Pricing and Auctions)

专知会员服务

7+阅读 · 2019年11月18日

【Freddy Lecue博士】Thales嵌入式可解释AI：关键系统中AI的采用（Thales Embedded Explainable AI: Towards the Adoption of AI in Critical Systems.），AI Accelerator Summit 2019

【Freddy Lecue博士】Thales嵌入式可解释AI：关键系统中AI的采用（Thales Embedded Explainable AI: Towards the Adoption of AI in Critical Systems.），AI Accelerator Summit 2019

专知会员服务

20+阅读 · 2019年11月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

47+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

33+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

174+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

81+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

103+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

52+阅读 · 2019年9月29日

热门VIP内容

相关资讯

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

9+阅读 · 2019年1月29日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

Arxiv

0+阅读 · 2022年2月5日

Perspectives of Visualization Onboarding and Guidance in VA

Arxiv

0+阅读 · 2022年2月4日

Algorithmic Fairness Datasets: the Story so Far

Arxiv

0+阅读 · 2022年2月3日

Understanding the Role of Context in Creating Enjoyable Co-Located Interactions

Arxiv

0+阅读 · 2022年2月3日

Measuring Disparate Outcomes of Content Recommendation Algorithms with Distributional Inequality Metrics

Measuring Disparate Outcomes of Content Recommendation Algorithms with Distributional Inequality Metrics

Arxiv

0+阅读 · 2022年2月3日

Metrics for Evaluating Social Conformity of Crowd Navigation Algorithms

Metrics for Evaluating Social Conformity of Crowd Navigation Algorithms

Arxiv

0+阅读 · 2022年2月2日

FuxiCTR: An Open Benchmark for Click-Through Rate Prediction

Arxiv

7+阅读 · 2020年9月12日

The Measure of Intelligence

The Measure of Intelligence

Arxiv

6+阅读 · 2019年11月5日

Context in Neural Machine Translation: A Review of Models and Evaluations

Arxiv

5+阅读 · 2019年1月25日

Physical Primitive Decomposition

Physical Primitive Decomposition

Arxiv

4+阅读 · 2018年9月13日

微信扫码咨询专知VIP会员