An efficient team is essential for the company to successfully complete new projects. To solve the team formation problem considering person-job matching (TFP-PJM), a 0-1 integer programming model is constructed, which considers both person-job matching and team members' willingness to communicate on team efficiency, with the person-job matching score calculated using intuitionistic fuzzy numbers. Then, a reinforcement learning-assisted genetic programming algorithm (RL-GP) is proposed to enhance the quality of solutions. The RL-GP adopts the ensemble population strategies. Before the population evolution at each generation, the agent selects one from four population search modes according to the information obtained, thus realizing a sound balance of exploration and exploitation. In addition, surrogate models are used in the algorithm to evaluate the formation plans generated by individuals, which speeds up the algorithm learning process. Afterward, a series of comparison experiments are conducted to verify the overall performance of RL-GP and the effectiveness of the improved strategies within the algorithm. The hyper-heuristic rules obtained through efficient learning can be utilized as decision-making aids when forming project teams. This study reveals the advantages of reinforcement learning methods, ensemble strategies, and the surrogate model applied to the GP framework. The diversity and intelligent selection of search patterns along with fast adaptation evaluation, are distinct features that enable RL-GP to be deployed in real-world enterprise environments.
翻译:为了解决考虑人-职业匹配的团队组建问题(TFP-PJM),本文构建了一个0-1整数规划模型,该模型考虑了人-职业匹配以及团队成员愿意交流对团队效率的影响,其中人-职业匹配得分采用直觉模糊数计算。基于强化学习的基因程序设计算法(RL-GP)被提出来增强解的质量。RL-GP采用集成种群策略,在每个生成的种群进化之前,代理根据获得的信息从四种搜索模式中选择一种,从而实现探索和开发的平衡。此外,算法中使用代理模型来评估个体生成的组建方案,从而加速算法学习过程。然后,进行了一系列比较实验来验证RL-GP的整体性能和算法中改进策略的有效性。通过高效学习获得的超启发式规则可用作组建项目团队时的决策辅助工具。本研究揭示了强化学习方法、集成策略、代理模型应用于GP框架的优势。探索和智能选择搜索模式的多样性以及快速适应性评估是RL-GP的独特特点,使其可以部署在实际企业环境中。