Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Engineering
  3. Chemical Engineering
  4. Chemical Engineering
  5. Distributional reinforcement learning for inventory management in multi-echelon supply chains
 
  • Details
Distributional reinforcement learning for inventory management in multi-echelon supply chains
File(s)
1-s2.0-S2772508122000643-main.pdf (3.52 MB)
Published version
Author(s)
Wu, Guoquan
Servia, Miguel Angel de Carvalho
Mowbray, Max
Type
Journal Article
Abstract
Reinforcement Learning (RL) is an effective method to solve stochastic sequential decision-making problems. This is a problem description common to supply chain operations, however, most RL algorithms are tailored for game-based benchmarks. Here, we propose a deep RL method tailored for supply chain problems. The proposed algorithm deploys a derivative free approach to balance exploration and exploitation of the neural policy’s parameter space, providing means to avoid low quality local optima. Furthermore, the method allows consideration of risk-sensitive formulations to learn a policy that optimizes, for example, the conditional value-at-risk. The capabilities of our algorithm are tested on a multi-echelon supply chain problem, and several combinatorial optimization problems. The results empirically demonstrate the method’s improved sample efficiency compared to the benchmark algorithm proximal policy optimization, and superior performance to shrinking horizon mixed integer formulations. Additionally, its risk-sensitive policy can offer protection from low probability, high severity scenarios. Finally, we provide a sensitivity analysis for technical intuition.
Date Issued
2023-03
Date Acceptance
2022-12-08
Citation
Digital Chemical Engineering, 2023, 6
URI
http://hdl.handle.net/10044/1/112964
URL
https://www.sciencedirect.com/science/article/pii/S2772508122000643
DOI
https://www.dx.doi.org/10.1016/j.dche.2022.100073
Publisher
Elsevier
Journal / Book Title
Digital Chemical Engineering
Volume
6
Copyright Statement
© 2022 The Author(s). Published by Elsevier Ltd on behalf of Institution of Chemical Engineers (IChemE). This is an open access article under the CC
BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
License URL
https://creativecommons.org/licenses/by-nc-nd/4.0/
Identifier
https://www.sciencedirect.com/science/article/pii/S2772508122000643
Subjects
Distributional reinforcement learning
Engineering
Engineering, Chemical
Inventory management
Machine learning
Multi-echelon supply chains
Optimal control
PARTICLE SWARM OPTIMIZATION
Science & Technology
Technology
Publication Status
Published
Article Number
100073
Date Publish Online
2022-12-14
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback