Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Engineering
  3. Faculty of Engineering
  4. Cultivating Desired Behaviour: Policy Teaching Via Environment-Dynamics Tweaks
 
  • Details
Cultivating Desired Behaviour: Policy Teaching Via Environment-Dynamics Tweaks
OA Location
http://eprints.soton.ac.uk/268470/
Author(s)
Rabinovich, Zinovi
Dufton, Lachlan
Larson, Kate
Jennings, Nick
Type
Conference Paper
Abstract
In this paper we study, for the first time explicitly, the implications of endowing an interested party (i.e. a teacher) with the ability to modify the underlying dynamics of the environment, in order to encourage an agent to learn to follow a specific policy. We introduce a cost function which can be used by the teacher to balance the modifications it makes to the underlying environment dynamics, with the learner’s performance compared to some ideal, desired, policy. We formulate teacher’s problem of determining optimal environment changes as a planning and control problem, and empirically validate the effectiveness of our model.
Date Issued
2010-05
Citation
2010, pp.1097-1104
URI
http://hdl.handle.net/10044/1/37024
URL
http://eprints.soton.ac.uk/268470/
Start Page
1097
End Page
1104
Identifier
http://eprints.soton.ac.uk/268470/
Source
The 9th International Conference on Autonomous Agents and Multiagent Systems
Notes
keywords: Teacher-learner, control theory, Kullback-Leibler Rate
Publication Status
Unpublished
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback