RMLGym: a formal reward machine framework for reinforcement learning
File(s)paper1.pdf (2.02 MB)
Published version
Author(s)
Unniyankal, H
Belardinelli, F
Ferrando, A
Malvone, V
Type
Conference Paper
Abstract
Reinforcement learning (RL) is a powerful technique for learning optimal policies from trial and error. However, designing a reward function that captures the desired behavior of an agent is often a challenging and tedious task, especially when the agent has to deal with complex and multi-objective problems. To address this issue, researchers have proposed to use higher-level languages, such as Signal Temporal Logic (STL), to specify reward functions in a declarative and expressive way, and then automatically compile them into lower-level functions that can be used by standard RL algorithms. In this paper, we present RMLGym, a tool that integrates RML, a runtime verification tool, with OpenAI Gym, a popular framework for developing and comparing RL algorithms. RMLGym allows users to define reward functions using RML specifications and then generates reward monitors that evaluate the agent’s performance and provide feedback at each step. We demonstrate the usefulness and flexibility of RMLGym by applying it to a famous benchmark problem from OpenAI Gym, and we analyze the strengths and limitations of our approach.
Date Issued
2023
Date Acceptance
2023-11-06
Citation
CEUR Workshop Proceedings, 2023, 3579, pp.1-16
ISSN
1613-0073
Publisher
CEUR Workshop Proceedings
Start Page
1
End Page
16
Journal / Book Title
CEUR Workshop Proceedings
Volume
3579
Copyright Statement
© 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
License URL
Identifier
https://ceur-ws.org/Vol-3579/
Source
WOA 2023 24th Workshop "From Objects to Agents"
Publication Status
Published
Start Date
2023-11-06
Finish Date
2023-11-08
Coverage Spatial
Roma, Italy
Date Publish Online
2023