2
IRUS TotalDownloads
Altmetric
Optimizing minimal agents through abstraction
File | Description | Size | Format | |
---|---|---|---|---|
DTR06-4.pdf | Published version | 243.42 kB | Adobe PDF | View/Open |
Title: | Optimizing minimal agents through abstraction |
Authors: | Broda, K Hogger, C |
Item Type: | Report |
Abstract: | Abstraction is a valuable tool for dealing with scalability in large state space contexts. This paper addresses the design, using abstraction, of good policies for minimal autonomous agents applied within a situation-graph-framework. In this framework an agent’s policy is some function that maps perceptual inputs to actions deterministically. A good policy disposes the agent towards achieving one or more designated goal situations, and the design process aims to identify such policies. The agents to which the framework applies are assumed to have only partial observability, and in particular may not be able to perceive fully a goal situation. A further assumption is that the environment may influence an agent’s situation by unpredictable exogenous events, so that a policy cannot take advantage, of a reliable history of previous actions. The Bellman discount measure provides a means of evaluating situations and hence the overall value of a policy. When abstraction is used, the accuracy of the method can be significantly improved by modifying the standard Bellman equations. This paper describes the modification and demonstrates its power through comparison with simulation results. |
Issue Date: | 1-Jan-2006 |
URI: | http://hdl.handle.net/10044/1/95422 |
DOI: | https://doi.org/10.25561/95422 |
Publisher: | Department of Computing, Imperial College London |
Start Page: | 1 |
End Page: | 12 |
Journal / Book Title: | Departmental Technical Report: 06/4 |
Copyright Statement: | © 2006 The Author(s). This report is available open access under a CC-BY-NC-ND (https://creativecommons.org/licenses/by-nc-nd/4.0/) |
Publication Status: | Published |
Article Number: | 06/4 |
Appears in Collections: | Computing Computing Technical Reports |
This item is licensed under a Creative Commons License