根据Markov决定程序框架按顺序公平分配资源 (Sequential Fair Resource Allocation under a Markov Decision Process Framework) - 专知论文

会员服务 ·

0

Facebook AI Research · Markov · Processing（编程语言） · Performer · 情景 ·

2023 年 1 月 10 日

Sequential Fair Resource Allocation under a Markov Decision Process Framework

翻译：根据Markov决定程序框架按顺序公平分配资源

Parisa Hassanzadeh,Eleonora Kreacic,Sihan Zeng,Yuchen Xiao,Sumitra Ganesh

We study the sequential decision-making problem of allocating a limited resource to agents that reveal their stochastic demands on arrival over a finite horizon. Our goal is to design fair allocation algorithms that exhaust the available resource budget. This is challenging in sequential settings where information on future demands is not available at the time of decision-making. We formulate the problem as a discrete time Markov decision process (MDP). We propose a new algorithm, SAFFE, that makes fair allocations with respect to the entire demands revealed over the horizon by accounting for expected future demands at each arrival time. The algorithm introduces regularization which enables the prioritization of current revealed demands over future potential demands depending on the uncertainty in agents' future demands. Using the MDP formulation, we show that SAFFE optimizes allocations based on an upper bound on the Nash Social Welfare fairness objective, and we bound its gap to optimality with the use of concentration bounds on total future demands. Using synthetic and real data, we compare the performance of SAFFE against existing approaches and a reinforcement learning policy trained on the MDP. We show that SAFFE leads to more fair and efficient allocations and achieves close-to-optimal performance in settings with dense arrivals.

翻译：我们研究将有限资源分配给在一定的地平线上到达时显示其随机需求的代理商的顺序决策问题。我们的目标是设计公平的分配算法,以用尽现有的资源预算。这在决策时没有关于未来需求的信息的顺序环境中具有挑战性。我们将这一问题作为一个离散的时间马尔科夫决策过程(MDP)来表述。我们提出了一个新的算法,即SAFE,通过计算每次到达时的预期未来需求,对在地平线上显示的全部需求进行公平的分配。这一算法引入了正规化,使得根据代理人未来需求的不确定性,能够将目前披露的需求相对于未来潜在需求进行优先排序。我们利用MDP的提法,显示SAFE优化了基于纳什社会福利公平目标的上限分配,我们将其差距与利用集中界限对未来总需求的最佳性联系起来。我们使用合成和真实的数据,将SAFE的绩效与现有的方法和在MDP上受过培训的强化学习政策进行比较。我们表明,SAFFE导致更公平和高效的分配,并实现与高密度目的地环境上的接近性业绩。

0

相关内容

Facebook AI Research

Facebook AI Research

Facebook AI Research

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

102+阅读 · 2020年6月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

电脉冲调控的铜在硅通孔内自底向上沉积及纳米孪晶生长

国家自然科学基金

0+阅读 · 2014年12月31日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

共轭聚合物与细菌的相互作用及群体感应与耐药性研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

Cell-in-cell介导非易感细胞病毒感染及其免疫逃逸机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

烯二炔化合物的离子型Bergman环化聚合反应制备共轭聚合物的研究

国家自然科学基金

0+阅读 · 2014年12月31日

Riemann-Hilbert 方法和随机矩阵谱分析中的 Painleve 渐近

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

强双光子吸收分子材料的非线性光学参量与过程的动力学表征

国家自然科学基金

0+阅读 · 2009年12月31日

手性金团簇的动力学控制合成、表征及性质研究

国家自然科学基金

0+阅读 · 2008年12月31日

An Online Algorithm for Chance Constrained Resource Allocation

Arxiv

0+阅读 · 2023年3月6日

SPRT-based Efficient Best Arm Identification in Stochastic Bandits

Arxiv

0+阅读 · 2023年3月4日

Principled Data-Driven Decision Support for Cyber-Forensic Investigations

Arxiv

0+阅读 · 2023年3月3日

Spectral learning of Bernoulli linear dynamical systems models for decision-making

Spectral learning of Bernoulli linear dynamical systems models for decision-making

Arxiv

0+阅读 · 2023年3月3日

On the complexity of PAC learning in Hilbert spaces

Arxiv

0+阅读 · 2023年3月3日

FairShap: A Data Re-weighting Approach for Algorithmic Fairness based on Shapley Values

Arxiv

0+阅读 · 2023年3月3日

Minimizing the Outage Probability in a Markov Decision Process

Arxiv

0+阅读 · 2023年3月3日

Implicit models, latent compression, intrinsic biases, and cheap lunches in community detection

Arxiv

0+阅读 · 2023年3月2日

Sequential Attention for Feature Selection

Arxiv

0+阅读 · 2023年3月2日

On the Robustness of Safe Reinforcement Learning under Observational Perturbations

Arxiv

0+阅读 · 2023年3月2日

VIP会员

文章信息

相关主题

Facebook AI Research

Processing（编程语言）

相关VIP内容

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

102+阅读 · 2020年6月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

An Online Algorithm for Chance Constrained Resource Allocation

Arxiv

0+阅读 · 2023年3月6日

SPRT-based Efficient Best Arm Identification in Stochastic Bandits

Arxiv

0+阅读 · 2023年3月4日

Principled Data-Driven Decision Support for Cyber-Forensic Investigations

Arxiv

0+阅读 · 2023年3月3日

Spectral learning of Bernoulli linear dynamical systems models for decision-making

Spectral learning of Bernoulli linear dynamical systems models for decision-making

Arxiv

0+阅读 · 2023年3月3日

On the complexity of PAC learning in Hilbert spaces

Arxiv

0+阅读 · 2023年3月3日

FairShap: A Data Re-weighting Approach for Algorithmic Fairness based on Shapley Values

Arxiv

0+阅读 · 2023年3月3日

Minimizing the Outage Probability in a Markov Decision Process

Arxiv

0+阅读 · 2023年3月3日

Implicit models, latent compression, intrinsic biases, and cheap lunches in community detection

Arxiv

0+阅读 · 2023年3月2日

Sequential Attention for Feature Selection

Arxiv

0+阅读 · 2023年3月2日

On the Robustness of Safe Reinforcement Learning under Observational Perturbations

Arxiv

0+阅读 · 2023年3月2日

相关基金

电脉冲调控的铜在硅通孔内自底向上沉积及纳米孪晶生长

国家自然科学基金

0+阅读 · 2014年12月31日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

共轭聚合物与细菌的相互作用及群体感应与耐药性研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

Cell-in-cell介导非易感细胞病毒感染及其免疫逃逸机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

烯二炔化合物的离子型Bergman环化聚合反应制备共轭聚合物的研究

国家自然科学基金

0+阅读 · 2014年12月31日

Riemann-Hilbert 方法和随机矩阵谱分析中的 Painleve 渐近

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

强双光子吸收分子材料的非线性光学参量与过程的动力学表征

国家自然科学基金

0+阅读 · 2009年12月31日

手性金团簇的动力学控制合成、表征及性质研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员