强化学习辅助的基因程序设计算法：考虑人-职业匹配的团队组建问题 (A Reinforcement Learning-assisted Genetic Programming Algorithm for Team Formation Problem Considering Person-Job Matching) - 专知论文

会员服务 ·

0

程序设计 · 代理模型 · 算法 · 强化学习 · 规划模型 ·

2023 年 4 月 8 日

A Reinforcement Learning-assisted Genetic Programming Algorithm for Team Formation Problem Considering Person-Job Matching

翻译：强化学习辅助的基因程序设计算法：考虑人-职业匹配的团队组建问题

Yangyang Guo,Hao Wang,Lei He,Witold Pedrycz,P. N. Suganthan,Yanjie Song

from arxiv, 16 pages

An efficient team is essential for the company to successfully complete new projects. To solve the team formation problem considering person-job matching (TFP-PJM), a 0-1 integer programming model is constructed, which considers both person-job matching and team members' willingness to communicate on team efficiency, with the person-job matching score calculated using intuitionistic fuzzy numbers. Then, a reinforcement learning-assisted genetic programming algorithm (RL-GP) is proposed to enhance the quality of solutions. The RL-GP adopts the ensemble population strategies. Before the population evolution at each generation, the agent selects one from four population search modes according to the information obtained, thus realizing a sound balance of exploration and exploitation. In addition, surrogate models are used in the algorithm to evaluate the formation plans generated by individuals, which speeds up the algorithm learning process. Afterward, a series of comparison experiments are conducted to verify the overall performance of RL-GP and the effectiveness of the improved strategies within the algorithm. The hyper-heuristic rules obtained through efficient learning can be utilized as decision-making aids when forming project teams. This study reveals the advantages of reinforcement learning methods, ensemble strategies, and the surrogate model applied to the GP framework. The diversity and intelligent selection of search patterns along with fast adaptation evaluation, are distinct features that enable RL-GP to be deployed in real-world enterprise environments.

翻译：为了解决考虑人-职业匹配的团队组建问题（TFP-PJM），本文构建了一个0-1整数规划模型，该模型考虑了人-职业匹配以及团队成员愿意交流对团队效率的影响，其中人-职业匹配得分采用直觉模糊数计算。基于强化学习的基因程序设计算法（RL-GP）被提出来增强解的质量。RL-GP采用集成种群策略，在每个生成的种群进化之前，代理根据获得的信息从四种搜索模式中选择一种，从而实现探索和开发的平衡。此外，算法中使用代理模型来评估个体生成的组建方案，从而加速算法学习过程。然后，进行了一系列比较实验来验证RL-GP的整体性能和算法中改进策略的有效性。通过高效学习获得的超启发式规则可用作组建项目团队时的决策辅助工具。本研究揭示了强化学习方法、集成策略、代理模型应用于GP框架的优势。探索和智能选择搜索模式的多样性以及快速适应性评估是RL-GP的独特特点，使其可以部署在实际企业环境中。

0

相关内容

程序设计

【华盛顿大学Simon S. Du】离线单智能体和多智能体强化学习

【华盛顿大学Simon S. Du】离线单智能体和多智能体强化学习

专知会员服务

46+阅读 · 2022年11月10日

斯坦福大学最新【强化学习】2022课程，含ppt

斯坦福大学最新【强化学习】2022课程，含ppt

专知会员服务

131+阅读 · 2022年2月27日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日

【机器学习面试】《Machine Learning Interviews - YouTube》by Huyen Chip [Senior Deep Learning Engineer, NVIDIA]

【机器学习面试】《Machine Learning Interviews - YouTube》by Huyen Chip [Senior Deep Learning Engineer, NVIDIA]

专知会员服务

44+阅读 · 2019年12月24日

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

专知会员服务

38+阅读 · 2019年12月17日

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

专知会员服务

121+阅读 · 2019年11月24日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

专知会员服务

19+阅读 · 2019年11月5日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

WSDM2022推荐算法部分论文整理（附直播课程）

WSDM2022推荐算法部分论文整理（附直播课程）

机器学习与推荐算法

0+阅读 · 2022年7月21日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

美国化学会 (ACS) 北京代表处招聘

美国化学会 (ACS) 北京代表处招聘

知社学术圈

11+阅读 · 2018年9月4日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

【资源】Python强化学习实战，Anaconda公司的高级数据科学家讲解（附相关Python开源库）

【资源】Python强化学习实战，Anaconda公司的高级数据科学家讲解（附相关Python开源库）

专知

13+阅读 · 2017年12月10日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

水稻转录因子OsMADS57参与硝酸盐调控根系伸长的机制

国家自然科学基金

0+阅读 · 2014年12月31日

微小RNA-34家族抑制EMT逆转肺癌EGFR-TKI获得性耐药的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

进化数据驱动的群体智能算法及其分布式计算模型研究

国家自然科学基金

5+阅读 · 2014年12月31日

异质多智能体系统的分布式协调问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

针对大规模复杂制造系统多重入多瓶颈特征的混合智能调度优化方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

HIC1调控CIITA转录机制研究及其在B细胞分化中的意义

国家自然科学基金

0+阅读 · 2012年12月31日

协同生态粒子群计算模型及动态优化方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

非线性系统优先级多目标模型预测控制的稳定性与鲁棒性理论及应用

国家自然科学基金

0+阅读 · 2012年12月31日

吴茱萸有效成分的累积和转化规律与其资源品质的相关性研究

国家自然科学基金

0+阅读 · 2012年12月31日

智能仿真优化理论与方法研究

国家自然科学基金

9+阅读 · 2011年12月31日

A Simulation Environment and Reinforcement Learning Method for Waste Reduction

Arxiv

0+阅读 · 2023年5月26日

MARLlib: A Scalable Multi-agent Reinforcement Learning Library

Arxiv

0+阅读 · 2023年5月26日

Metaheuristic planner for cooperative multi-agent wall construction with UAVs

Arxiv

0+阅读 · 2023年5月25日

DeepFreight: Integrating Deep Reinforcement Learning and Mixed Integer Programming for Multi-transfer Truck Freight Delivery

Arxiv

0+阅读 · 2023年5月25日

DIFFER: Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年5月25日

No-Regret Online Prediction with Strategic Experts

Arxiv

0+阅读 · 2023年5月24日

Learning Reward Machines in Cooperative Multi-Agent Tasks

Arxiv

0+阅读 · 2023年5月24日

Towards Efficient Multi-Agent Learning Systems

Arxiv

0+阅读 · 2023年5月24日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

VIP会员

文章信息

相关主题

相关VIP内容

【华盛顿大学Simon S. Du】离线单智能体和多智能体强化学习

【华盛顿大学Simon S. Du】离线单智能体和多智能体强化学习

专知会员服务

46+阅读 · 2022年11月10日

斯坦福大学最新【强化学习】2022课程，含ppt

斯坦福大学最新【强化学习】2022课程，含ppt

专知会员服务

131+阅读 · 2022年2月27日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日

【机器学习面试】《Machine Learning Interviews - YouTube》by Huyen Chip [Senior Deep Learning Engineer, NVIDIA]

【机器学习面试】《Machine Learning Interviews - YouTube》by Huyen Chip [Senior Deep Learning Engineer, NVIDIA]

专知会员服务

44+阅读 · 2019年12月24日

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

专知会员服务

38+阅读 · 2019年12月17日

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

专知会员服务

121+阅读 · 2019年11月24日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

专知会员服务

19+阅读 · 2019年11月5日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

WSDM2022推荐算法部分论文整理（附直播课程）

WSDM2022推荐算法部分论文整理（附直播课程）

机器学习与推荐算法

0+阅读 · 2022年7月21日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

美国化学会 (ACS) 北京代表处招聘

美国化学会 (ACS) 北京代表处招聘

知社学术圈

11+阅读 · 2018年9月4日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

【资源】Python强化学习实战，Anaconda公司的高级数据科学家讲解（附相关Python开源库）

【资源】Python强化学习实战，Anaconda公司的高级数据科学家讲解（附相关Python开源库）

专知

13+阅读 · 2017年12月10日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

A Simulation Environment and Reinforcement Learning Method for Waste Reduction

Arxiv

0+阅读 · 2023年5月26日

MARLlib: A Scalable Multi-agent Reinforcement Learning Library

Arxiv

0+阅读 · 2023年5月26日

Metaheuristic planner for cooperative multi-agent wall construction with UAVs

Arxiv

0+阅读 · 2023年5月25日

DeepFreight: Integrating Deep Reinforcement Learning and Mixed Integer Programming for Multi-transfer Truck Freight Delivery

Arxiv

0+阅读 · 2023年5月25日

DIFFER: Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年5月25日

No-Regret Online Prediction with Strategic Experts

Arxiv

0+阅读 · 2023年5月24日

Learning Reward Machines in Cooperative Multi-Agent Tasks

Arxiv

0+阅读 · 2023年5月24日

Towards Efficient Multi-Agent Learning Systems

Arxiv

0+阅读 · 2023年5月24日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

相关基金

水稻转录因子OsMADS57参与硝酸盐调控根系伸长的机制

国家自然科学基金

0+阅读 · 2014年12月31日

微小RNA-34家族抑制EMT逆转肺癌EGFR-TKI获得性耐药的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

进化数据驱动的群体智能算法及其分布式计算模型研究

国家自然科学基金

5+阅读 · 2014年12月31日

异质多智能体系统的分布式协调问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

针对大规模复杂制造系统多重入多瓶颈特征的混合智能调度优化方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

HIC1调控CIITA转录机制研究及其在B细胞分化中的意义

国家自然科学基金

0+阅读 · 2012年12月31日

协同生态粒子群计算模型及动态优化方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

非线性系统优先级多目标模型预测控制的稳定性与鲁棒性理论及应用

国家自然科学基金

0+阅读 · 2012年12月31日

吴茱萸有效成分的累积和转化规律与其资源品质的相关性研究

国家自然科学基金

0+阅读 · 2012年12月31日

智能仿真优化理论与方法研究

国家自然科学基金

9+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员