Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • About
  • Communities & Collections
  • Advanced Search
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Engineering
  3. Bioengineering
  4. Bioengineering
  5. Episodic self-imitation learning with hindsight
 
  • Details
Episodic self-imitation learning with hindsight
File(s)
electronics-09-01742-v2.pdf (2.18 MB)
Published version
Author(s)
Dai, Tianhong
Liu, Hengyan
Bharath, Anil
Type
Journal Article
Abstract
Episodic self-imitation learning, a novel self-imitation algorithm with a trajectory selection module and an adaptive loss function, is proposed to speed up reinforcement learning. Compared to the original self-imitation learning algorithm, which samples good state–action pairs from the experience replay buffer, our agent leverages entire episodes with hindsight to aid self-imitation learning. A selection module is introduced to filter uninformative samples from each episode of the update. The proposed method overcomes the limitations of the standard self-imitation learning algorithm, a transitions-based method which performs poorly in handling continuous control environments with sparse rewards. From the experiments, episodic self-imitation learning is shown to perform better than baseline on-policy algorithms, achieving comparable performance to state-of-the-art off-policy algorithms in several simulated robot control tasks. The trajectory selection module is shown to prevent the agent learning undesirable hindsight experiences. With the capability of solving sparse reward problems in continuous control settings, episodic self-imitation learning has the potential to be applied to real-world problems that have continuous action spaces, such as robot guidance and manipulation.
Date Issued
2020-10-21
Date Acceptance
2020-10-16
Citation
Electronics (Basel), 2020, 9 (10)
URI
http://hdl.handle.net/10044/1/84854
DOI
https://www.dx.doi.org/10.3390/electronics9101742
ISSN
2079-9292
Publisher
MDPI AG
Journal / Book Title
Electronics (Basel)
Volume
9
Issue
10
Copyright Statement
©2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open accessarticle distributed under the terms and conditions of the Creative Commons Attribution(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
License URL
http://creativecommons.org/licenses/by/4.0/
Subjects
cs.AI
cs.AI
cs.RO
0906 Electrical and Electronic Engineering
Publication Status
Published
Article Number
ARTN 1742
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback