使用多目标遗传算法的公平地物子集选择 (Fair Feature Subset Selection using Multiobjective Genetic Algorithm) - 专知论文

会员服务 ·

0

Facebook AI Research · 模型评估 · Machine Learning · 学成 · Performer ·

2022 年 4 月 30 日

Fair Feature Subset Selection using Multiobjective Genetic Algorithm

翻译：使用多目标遗传算法的公平地物子集选择

Ayaz Ur Rehman,Anas Nadeem,Muhammad Zubair Malik

The feature subset selection problem aims at selecting the relevant subset of features to improve the performance of a Machine Learning (ML) algorithm on training data. Some features in data can be inherently noisy, costly to compute, improperly scaled, or correlated to other features, and they can adversely affect the accuracy, cost, and complexity of the induced algorithm. The goal of traditional feature selection approaches has been to remove such irrelevant features. In recent years ML is making a noticeable impact on the decision-making processes of our everyday lives. We want to ensure that these decisions do not reflect biased behavior towards certain groups or individuals based on protected attributes such as age, sex, or race. In this paper, we present a feature subset selection approach that improves both fairness and accuracy objectives and computes Pareto-optimal solutions using the NSGA-II algorithm. We use statistical disparity as a fairness metric and F1-Score as a metric for model performance. Our experiments on the most commonly used fairness benchmark datasets with three different machine learning algorithms show that using the evolutionary algorithm we can effectively explore the trade-off between fairness and accuracy.

翻译：特性子集选择问题旨在选择相关特征的子集,以改善培训数据机器学习算法的性能。数据的某些特征本身可能很吵,计算费用昂贵,计算费用过高,规模不适当,或与其他特征相关,而且可能对引算算法的准确性、成本和复杂性产生不利影响。传统特征选择方法的目标是消除这些不相干的特点。近年来,ML正在对我们日常生活的决策进程产生显著影响。我们希望确保这些决定不反映基于年龄、性别或种族等受保护属性对某些群体或个人的偏向行为。在本文中,我们提出一个特征子集选择方法,既能提高公平性和准确性,又能利用NSGA-II算法计算Pareto最佳解决办法。我们用统计差异作为公平度和F1-Score作为模型业绩的衡量标准。我们用三种不同机器学习算法对最常用的公平基准数据集进行的实验表明,我们可以用进化算法有效地探索公平与准确性之间的交易。

0

相关内容

Facebook AI Research

Facebook AI Research

Facebook AI Research

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

复式钢管混凝土钢梁节点设计方法及节点性能控制研究

国家自然科学基金

0+阅读 · 2014年12月31日

神经系统seipin缺失诱发精神迟滞的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于全基因组和候选基因关联分析发掘芝麻枯萎病抗性基因及其功能验证

国家自然科学基金

0+阅读 · 2013年12月31日

OsDCL3b基因调控稻穗生长发育的遗传机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

新发仔猪腹泻病因的筛选及鉴定

国家自然科学基金

0+阅读 · 2012年12月31日

内配型钢的钢管高性能混凝土构件压弯扭剪复合受力工作机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

mTOR激活对吗啡耐受的调控及其分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

抛物和椭圆界面问题的间断有限元方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

运载火箭多场耦合计算的多尺度有限元方法

国家自然科学基金

0+阅读 · 2009年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

Fairer Machine Learning Software on Multiple Sensitive Attributes With Data Preprocessing

Fairer Machine Learning Software on Multiple Sensitive Attributes With Data Preprocessing

Arxiv

0+阅读 · 2022年6月21日

Algorithmic Gaussianization through Sketching: Converting Data into Sub-gaussian Random Designs

Arxiv

0+阅读 · 2022年6月21日

Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks

Arxiv

0+阅读 · 2022年6月20日

Complexity of the Multiobjective Spanner Problem

Arxiv

0+阅读 · 2022年6月17日

Active Sampling for Min-Max Fairness

Arxiv

0+阅读 · 2022年6月17日

Maximum Class Separation as Inductive Bias in One Matrix

Arxiv

0+阅读 · 2022年6月17日

Neural Network Weights Do Not Converge to Stationary Points: An Invariant Measure Perspective

Arxiv

0+阅读 · 2022年6月17日

abess: A Fast Best Subset Selection Library in Python and R

Arxiv

0+阅读 · 2022年6月17日

Active Fairness Auditing

Arxiv

0+阅读 · 2022年6月16日

Powershap: A Power-full Shapley Feature Selection Method

Arxiv

0+阅读 · 2022年6月16日

VIP会员

文章信息

相关主题

Facebook AI Research

Machine Learning

相关VIP内容

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《战区安全决策课程体系》最新244页

《"无人机航母"原型平台》

任务规划与地形分析：现代复杂环境作战导航体系

《攻击场景描述形式化模型研究》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

相关论文

Fairer Machine Learning Software on Multiple Sensitive Attributes With Data Preprocessing

Fairer Machine Learning Software on Multiple Sensitive Attributes With Data Preprocessing

Arxiv

0+阅读 · 2022年6月21日

Algorithmic Gaussianization through Sketching: Converting Data into Sub-gaussian Random Designs

Arxiv

0+阅读 · 2022年6月21日

Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks

Arxiv

0+阅读 · 2022年6月20日

Complexity of the Multiobjective Spanner Problem

Arxiv

0+阅读 · 2022年6月17日

Active Sampling for Min-Max Fairness

Arxiv

0+阅读 · 2022年6月17日

Maximum Class Separation as Inductive Bias in One Matrix

Arxiv

0+阅读 · 2022年6月17日

Neural Network Weights Do Not Converge to Stationary Points: An Invariant Measure Perspective

Arxiv

0+阅读 · 2022年6月17日

abess: A Fast Best Subset Selection Library in Python and R

Arxiv

0+阅读 · 2022年6月17日

Active Fairness Auditing

Arxiv

0+阅读 · 2022年6月16日

Powershap: A Power-full Shapley Feature Selection Method

Arxiv

0+阅读 · 2022年6月16日

相关基金

复式钢管混凝土钢梁节点设计方法及节点性能控制研究

国家自然科学基金

0+阅读 · 2014年12月31日

神经系统seipin缺失诱发精神迟滞的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于全基因组和候选基因关联分析发掘芝麻枯萎病抗性基因及其功能验证

国家自然科学基金

0+阅读 · 2013年12月31日

OsDCL3b基因调控稻穗生长发育的遗传机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

新发仔猪腹泻病因的筛选及鉴定

国家自然科学基金

0+阅读 · 2012年12月31日

内配型钢的钢管高性能混凝土构件压弯扭剪复合受力工作机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

mTOR激活对吗啡耐受的调控及其分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

抛物和椭圆界面问题的间断有限元方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

运载火箭多场耦合计算的多尺度有限元方法

国家自然科学基金

0+阅读 · 2009年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员