Learning neural search policies for classical planning
File(s)icaps0307CameraReady.pdf (380.35 KB)
Accepted version
Author(s)
Gomoluch, Pawel
Alrajeh, Dalal
Russo, Alessandra
Bucchiarone, A
Type
Conference Paper
Abstract
Heuristic forward search is currently the dominant paradigmin classical planning. Forward search algorithms typicallyrely on a single, relatively simple variation of best-first searchand remain fixed throughout the process of solving a plan-ning problem. Existing work combining multiple search tech-niques usually aims at supporting best-first search with anadditional exploratory mechanism, triggered using a hand-crafted criterion. A notable exception is very recent workwhich combines various search techniques using a trainablepolicy. That approach, however, is confined to a discrete ac-tion space comprising several fixed subroutines.In this paper, we introduce a parametrized search algorithmtemplate which combines various search techniques withina single routine. The template’s parameter space defines aninfinite space of search algorithms, including, among others,BFS, local and random search. We then propose a neural ar-chitecture for designating the values of the search parametersgiven the state of the search. This enables expressing neuralsearch policies that change the values of the parameters asthe search progresses. The policies can be learned automat-ically, with the objective of maximizing the planner’s per-formance on a given distribution of planning problems. Weconsider a training setting based on a stochastic optimizationalgorithm known as thecross-entropy method(CEM). Exper-imental evaluation of our approach shows that it is capable offinding effective distribution-specific search policies, outper-forming the relevant baselines.
Date Issued
2020-05-29
Date Acceptance
2020-01-10
Citation
Proceedings of the Thirtieth International Conference on Automated Planning and Scheduling, 2020, 30, pp.522-530
ISSN
2334-0835
Publisher
AAAI
Start Page
522
End Page
530
Journal / Book Title
Proceedings of the Thirtieth International Conference on Automated Planning and Scheduling
Volume
30
Copyright Statement
2020, Association for the Advancement of ArtificialIntelligence (www.aaai.org). All rights reserved.
Identifier
https://ojs.aaai.org/index.php/ICAPS/issue/view/263
Source
International Conference on Automated Planning and Scheduling (ICAPS) 2020
Publication Status
Published
Start Date
2020-10-26
Finish Date
2020-10-30
Coverage Spatial
Nancy, France