Distributional reinforcement learning for inventory management in multi-echelon supply chains
File(s)1-s2.0-S2772508122000643-main.pdf (3.52 MB)
Published version
Author(s)
Wu, Guoquan
Servia, Miguel Angel de Carvalho
Mowbray, Max
Type
Journal Article
Abstract
Reinforcement Learning (RL) is an effective method to solve stochastic sequential decision-making problems. This is a problem description common to supply chain operations, however, most RL algorithms are tailored for game-based benchmarks. Here, we propose a deep RL method tailored for supply chain problems. The proposed algorithm deploys a derivative free approach to balance exploration and exploitation of the neural policy’s parameter space, providing means to avoid low quality local optima. Furthermore, the method allows consideration of risk-sensitive formulations to learn a policy that optimizes, for example, the conditional value-at-risk. The capabilities of our algorithm are tested on a multi-echelon supply chain problem, and several combinatorial optimization problems. The results empirically demonstrate the method’s improved sample efficiency compared to the benchmark algorithm proximal policy optimization, and superior performance to shrinking horizon mixed integer formulations. Additionally, its risk-sensitive policy can offer protection from low probability, high severity scenarios. Finally, we provide a sensitivity analysis for technical intuition.
Date Issued
2023-03
Date Acceptance
2022-12-08
Citation
Digital Chemical Engineering, 2023, 6
Publisher
Elsevier
Journal / Book Title
Digital Chemical Engineering
Volume
6
Copyright Statement
© 2022 The Author(s). Published by Elsevier Ltd on behalf of Institution of Chemical Engineers (IChemE). This is an open access article under the CC
BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
Identifier
https://www.sciencedirect.com/science/article/pii/S2772508122000643
Subjects
Distributional reinforcement learning
Engineering
Engineering, Chemical
Inventory management
Machine learning
Multi-echelon supply chains
Optimal control
PARTICLE SWARM OPTIMIZATION
Science & Technology
Technology
Publication Status
Published
Article Number
100073
Date Publish Online
2022-12-14