Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Engineering
  3. Computing
  4. Computing
  5. Distributional constrained reinforcement learning for supply chain optimization
 
  • Details
Distributional constrained reinforcement learning for supply chain optimization
OA Location
https://doi.org/10.48550/arXiv.2302.01727
Author(s)
Bermúdez, Jaime Sabal
del Rio Chanona, Antonio
Tsay, Calvin
Type
Chapter
Abstract
This work studies reinforcement learning (RL) in the context of multi-period supply chains subject to constraints, e.g., on inventory. We introduce Distributional Constrained Policy Optimization (DCPO), a novel approach for reliable constraint satisfaction in RL. Our approach is based on Constrained Policy Optimization (CPO), which is subject to approximation errors that in practice lead it to converge to infeasible policies. We address this issue by incorporating aspects of distributional RL. Using a supply chain case study, we show that DCPO improves the rate at which the RL policy converges and ensures reliable constraint satisfaction by the end of training. The proposed method also greatly reduces the variance of returns between runs; this result is significant in the context of policy gradient methods, which intrinsically introduce high variance during training.
Date Issued
2023
Citation
Computer Aided Chemical Engineering, 2023, pp.1649-1654
URI
http://hdl.handle.net/10044/1/110860
URL
http://dx.doi.org/10.1016/b978-0-443-15274-0.50262-6
DOI
https://www.dx.doi.org/10.1016/b978-0-443-15274-0.50262-6
ISBN
9780443152740
Publisher
Elsevier
Start Page
1649
End Page
1654
Journal / Book Title
Computer Aided Chemical Engineering
Copyright Statement
Copyright © 2023 Elsevier B.V. All rights reserved.
Identifier
http://dx.doi.org/10.1016/b978-0-443-15274-0.50262-6
Publication Status
Published
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback