与强化学习有关的多任务建议</s> (Multi-Task Recommendations with Reinforcement Learning)

In recent years, Multi-task Learning (MTL) has yielded immense success in Recommender System (RS) applications. However, current MTL-based recommendation models tend to disregard the session-wise patterns of user-item interactions because they are predominantly constructed based on item-wise datasets. Moreover, balancing multiple objectives has always been a challenge in this field, which is typically avoided via linear estimations in existing works. To address these issues, in this paper, we propose a Reinforcement Learning (RL) enhanced MTL framework, namely RMTL, to combine the losses of different recommendation tasks using dynamic weights. To be specific, the RMTL structure can address the two aforementioned issues by (i) constructing an MTL environment from session-wise interactions and (ii) training multi-task actor-critic network structure, which is compatible with most existing MTL-based recommendation models, and (iii) optimizing and fine-tuning the MTL loss function using the weights generated by critic networks. Experiments on two real-world public datasets demonstrate the effectiveness of RMTL with a higher AUC against state-of-the-art MTL-based recommendation models. Additionally, we evaluate and validate RMTL's compatibility and transferability across various MTL models.

翻译：近年来,多任务学习(MTL)在建议系统应用方面取得了巨大成功,然而,目前以MTL为基础的建议模式往往忽视了用户项目互动的会话模式,因为它们主要是根据项目数据集构建的。此外,平衡多重目标始终是这一领域的一个挑战,通常通过现有工程中的线性估算来避免。为了解决这些问题,我们在本文件中提议加强学习(RL)强化的MTL框架,即RMTL,用动态权重合并不同建议任务的损失。具体而言,RMTL结构可以通过以下方式解决上述两个问题:(一) 从会话互动中构建一个MTL环境;(二) 培训多任务行为者-气候网络结构,这与大多数现有的基于MTL的建议模式相一致,以及(三) 利用批评网络产生的权重优化和调整MTL损失功能。两个现实世界公共数据集的实验展示了RMTL的有效性,用更高的AML环境环境环境,与基于州-MT的兼容性模型和跨州-MTMT的ML建议转让。</s>

相关内容

多任务学习

关注 161

多任务学习（MTL）是机器学习的一个子领域，可以同时解决多个学习任务，同时利用各个任务之间的共性和差异。与单独训练模型相比，这可以提高特定任务模型的学习效率和预测准确性。多任务学习是归纳传递的一种方法，它通过将相关任务的训练信号中包含的域信息用作归纳偏差来提高泛化能力。通过使用共享表示形式并行学习任务来实现,每个任务所学的知识可以帮助更好地学习其它任务。

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

专知会员服务

39+阅读 · 2020年11月3日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日