可解释的神经加法模型强化学习在库存管理中的应用 (Interpretable Reinforcement Learning via Neural Additive Models for Inventory Management) - 专知论文

会员服务 ·

0

Learning · 强化学习 · MoDELS · COVID-19 · AIM ·

2023 年 3 月 22 日

Interpretable Reinforcement Learning via Neural Additive Models for Inventory Management

翻译：可解释的神经加法模型强化学习在库存管理中的应用

Julien Siems,Maximilian Schambach,Sebastian Schulze,Johannes S. Otterbach

The COVID-19 pandemic has highlighted the importance of supply chains and the role of digital management to react to dynamic changes in the environment. In this work, we focus on developing dynamic inventory ordering policies for a multi-echelon, i.e. multi-stage, supply chain. Traditional inventory optimization methods aim to determine a static reordering policy. Thus, these policies are not able to adjust to dynamic changes such as those observed during the COVID-19 crisis. On the other hand, conventional strategies offer the advantage of being interpretable, which is a crucial feature for supply chain managers in order to communicate decisions to their stakeholders. To address this limitation, we propose an interpretable reinforcement learning approach that aims to be as interpretable as the traditional static policies while being as flexible and environment-agnostic as other deep learning-based reinforcement learning solutions. We propose to use Neural Additive Models as an interpretable dynamic policy of a reinforcement learning agent, showing that this approach is competitive with a standard full connected policy. Finally, we use the interpretability property to gain insights into a complex ordering strategy for a simple, linear three-echelon inventory supply chain.

翻译：COVID-19 大流行突显了供应链的重要性和数字化管理对于应对环境动态变化的作用。本文关注于为多阶段供应链（即多个阶段）开发动态库存订购策略。传统库存优化方法旨在确定一种静态的重新订购策略。因此，这些策略无法调整到诸如 COVID-19 危机期间观察到的动态变化。另一方面，传统策略具有可解释性的优点，这是供应链经理交流决策给利益相关者的重要特征。为了解决这一限制，我们提出了一种可解释的强化学习方法，旨在与传统静态策略一样具有可解释性，同时与其他深度强化学习解决方案一样具有灵活性和对环境不可知性。我们建议使用神经加法模型作为强化学习代理的可解释动态策略，并展示了这种方法在与标准全连接策略相竞争中是有竞争力的。最后，我们利用可解释性属性，以线性三阶段库存供应链的复杂订购策略为例，获得深刻见解。

0

相关内容

Learning

不可错过！华盛顿大学最新《可解释人工智能》课程，系统讲述XAI最新进展

不可错过！华盛顿大学最新《可解释人工智能》课程，系统讲述XAI最新进展

专知会员服务

70+阅读 · 2022年9月14日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【上海交通大学-张拳石】可解释CNN，Interpretable CNNs for Object Classification

【上海交通大学-张拳石】可解释CNN，Interpretable CNNs for Object Classification

专知会员服务

46+阅读 · 2020年3月13日

【强化学习资源集合】Awesome Reinforcement Learning

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

可解释机器学习（Interpretable Machine Learning）：打开黑盒之谜（238页书籍下载）

可解释机器学习（Interpretable Machine Learning）：打开黑盒之谜（238页书籍下载）

专知会员服务

152+阅读 · 2019年10月27日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

华盛顿大学医学院放射系助理教授朱成成团队招募磁共振成像/分析博士后

华盛顿大学医学院放射系助理教授朱成成团队招募磁共振成像/分析博士后

机器之心

1+阅读 · 2022年7月11日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

Heisenberg群与Minkowski空间中的非线性椭圆方程

国家自然科学基金

0+阅读 · 2014年12月31日

随机利率与随机波动率模型下保险公司最优投资与再保险问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

微分方程的不可积性与动力学行为

国家自然科学基金

0+阅读 · 2012年12月31日

考虑有限理性的供应链中断风险管理模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

金融风险模型的最优投资策略与风险管理研究

国家自然科学基金

5+阅读 · 2012年12月31日

基于数值方法的非线性随机系统的动力学行为及在供应链系统中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

复杂因素下金融风险度量与风险传染建模与风险管理

国家自然科学基金

0+阅读 · 2012年12月31日

供应链突发风险“情景-应对”型应急决策研究

国家自然科学基金

1+阅读 · 2011年12月31日

销售渠道及促销行为与供应链管理策略的交互影响研究

国家自然科学基金

0+阅读 · 2009年12月31日

S-REINFORCE: A Neuro-Symbolic Policy Gradient Approach for Interpretable Reinforcement Learning

Arxiv

0+阅读 · 2023年5月12日

Deep Reinforcement Learning for Interference Management in UAV-based 3D Networks: Potentials and Challenges

Arxiv

0+阅读 · 2023年5月11日

Deep Reinforcement Learning for Multi-Agent Interaction

Arxiv

45+阅读 · 2022年8月2日

Explainable Deep Learning Methods in Medical Diagnosis: A Survey

Arxiv

35+阅读 · 2022年5月10日

A Survey on Trajectory Data Management, Analytics, and Learning

A Survey on Trajectory Data Management, Analytics, and Learning

Arxiv

16+阅读 · 2020年3月25日

Interpretable machine learning: definitions, methods, and applications

Interpretable machine learning: definitions, methods, and applications

Arxiv

19+阅读 · 2019年1月14日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

Video Captioning via Hierarchical Reinforcement Learning

Arxiv

20+阅读 · 2018年3月29日

Visual Interpretability for Deep Learning: a Survey

Arxiv

16+阅读 · 2018年2月7日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！华盛顿大学最新《可解释人工智能》课程，系统讲述XAI最新进展

不可错过！华盛顿大学最新《可解释人工智能》课程，系统讲述XAI最新进展

专知会员服务

70+阅读 · 2022年9月14日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【上海交通大学-张拳石】可解释CNN，Interpretable CNNs for Object Classification

【上海交通大学-张拳石】可解释CNN，Interpretable CNNs for Object Classification

专知会员服务

46+阅读 · 2020年3月13日

【强化学习资源集合】Awesome Reinforcement Learning

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

可解释机器学习（Interpretable Machine Learning）：打开黑盒之谜（238页书籍下载）

可解释机器学习（Interpretable Machine Learning）：打开黑盒之谜（238页书籍下载）

专知会员服务

152+阅读 · 2019年10月27日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

华盛顿大学医学院放射系助理教授朱成成团队招募磁共振成像/分析博士后

华盛顿大学医学院放射系助理教授朱成成团队招募磁共振成像/分析博士后

机器之心

1+阅读 · 2022年7月11日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

S-REINFORCE: A Neuro-Symbolic Policy Gradient Approach for Interpretable Reinforcement Learning

Arxiv

0+阅读 · 2023年5月12日

Deep Reinforcement Learning for Interference Management in UAV-based 3D Networks: Potentials and Challenges

Arxiv

0+阅读 · 2023年5月11日

Deep Reinforcement Learning for Multi-Agent Interaction

Arxiv

45+阅读 · 2022年8月2日

Explainable Deep Learning Methods in Medical Diagnosis: A Survey

Arxiv

35+阅读 · 2022年5月10日

A Survey on Trajectory Data Management, Analytics, and Learning

A Survey on Trajectory Data Management, Analytics, and Learning

Arxiv

16+阅读 · 2020年3月25日

Interpretable machine learning: definitions, methods, and applications

Interpretable machine learning: definitions, methods, and applications

Arxiv

19+阅读 · 2019年1月14日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

Video Captioning via Hierarchical Reinforcement Learning

Arxiv

20+阅读 · 2018年3月29日

Visual Interpretability for Deep Learning: a Survey

Arxiv

16+阅读 · 2018年2月7日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

相关基金

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

Heisenberg群与Minkowski空间中的非线性椭圆方程

国家自然科学基金

0+阅读 · 2014年12月31日

随机利率与随机波动率模型下保险公司最优投资与再保险问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

微分方程的不可积性与动力学行为

国家自然科学基金

0+阅读 · 2012年12月31日

考虑有限理性的供应链中断风险管理模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

金融风险模型的最优投资策略与风险管理研究

国家自然科学基金

5+阅读 · 2012年12月31日

基于数值方法的非线性随机系统的动力学行为及在供应链系统中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

复杂因素下金融风险度量与风险传染建模与风险管理

国家自然科学基金

0+阅读 · 2012年12月31日

供应链突发风险“情景-应对”型应急决策研究

国家自然科学基金

1+阅读 · 2011年12月31日

销售渠道及促销行为与供应链管理策略的交互影响研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员