Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Engineering
  3. Computing
  4. Computing
  5. Skill-conditioned policy optimization with successor features representations
 
  • Details
Skill-conditioned policy optimization with successor features representations
File(s)
scopa_accepted.pdf (6.52 MB)
Accepted version
Author(s)
Faldor, Maxence
Grillotti, Luca
Gonzalez Leon, Borja
Cully, Antoine
Type
Conference Paper
Abstract
A key aspect of intelligence is the ability to exhibit a wide range of behaviors to adapt to unforeseen situations. Designing artificial agents that are capable of showcasing a broad spectrum of skills is a long-standing challenge in Artificial Intelligence. In the last decade, progress in deep reinforcement learning has enabled to solve complex tasks with high-dimensional, continuous state and action spaces. However, most approaches return only one highly-specialized solution to a single problem. We introduce a Skill-Conditioned OPtimal Agent (SCOPA) that leverages successor features representations to learn a continuous range of skills that solve a task. We extend the generalized policy iteration framework with a policy skill improvement update based on successor features that is analogous to the classic policy improvement update. This novel skill improvement update enables to efficiently learn executing skills. From this result, we develop an algorithm that seamlessly unifies value function and successor features policy iteration with constrained optimization to (1) maximize performance, while (2) executing the desired skills. Compared with other skill-conditioned reinforcement learning methods, SCOPA reaches significantly higher performance and skill space coverage on challenging continuous control locomotion tasks with various types of skills. We also demonstrate that the diversity of skills is useful in five downstream adaptation tasks. Videos of our results are available at: https://bit.ly/scopa.
Date Issued
2023-10-28
Date Acceptance
2023-10-28
Citation
2023, pp.1-34
URI
http://hdl.handle.net/10044/1/115697
Start Page
1
End Page
34
Copyright Statement
© 2023 The Author(s).
Identifier
https://maxencefaldor.github.io/
Source
Workshop: Agent Learning in Open-Endedness Workshop at NeurIPS
Publication Status
Published
Start Date
2023-12-15
Coverage Spatial
New Orleans, LA, United States
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback