liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Learning and Exploiting Subgoal Structures in Classical Planning: Towards Reliable and Transparent Intelligent Agents that Learn to Plan on Multiple Levels
Linköping University, Department of Computer and Information Science, Artificial Intelligence and Integrated Computer Systems. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-1350-2144
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Classical planning aims to find a plan that is a sequence of actions allowing an intelligent agent to move from its current situation to one that satisfies the goal. Finding a plan is computationally challenging. Agents in the real world often encounter structurally similar problems with varying objects but the same predicates, actions, and related goals. Generalized planning aims to find a general plan that compactly encodes efficiently obtainable plans for each problem in an infinitely large class of structurally similar problems. Thus, we can query a general plan to efficiently obtain a plan for any problem in the class. A general plan may encode behavior on different levels of abstraction. High-level abstractions include subgoal structures that encode stepping stones towards the goal. Subgoal structures play a central role in human problem-solving, enabling reasoning at a higher level before working out the details of a plan. Learning simple, compact, meaningful, and efficient subgoal structures and their hierarchies without human intervention is an open challenge.

This thesis introduces a method for learning subgoal structures with a crisp characterization; they decompose problems into subproblems of controllable polynomial complexity. We represent subgoal structures using the recently introduced policy sketches language, whose simple syntax and semantics build the theoretical foundation of our work. We extend our method to address the long-standing problem of learning hierarchical policies. Our extended method iteratively decomposes classes of problems into classes of subproblems with strictly smaller polynomial complexity, resulting in effective hierarchical decompositions. Our methods learn from small example problems using combinatorial optimization. They seek the syntactically simplest solution, enabling interpretability and allowing us often to establish their correctness for an entire problem class. When learning methods fail, it often results from limited scalability or a lack of language expressivity. We develop two methods to address these limitations. First, we develop symmetry-based abstractions to reduce redundancy in training data and improve learning efficiency. Second, we develop a method for testing the language expressivity requirement of benchmark sets using first-order logic. Moreover, we take steps toward developing a scalable planning framework that avoids an exponential preprocessing step known as grounding, which is often unnecessary in generalized planning. Our framework supports expressive language features such as conditional effects and derived predicates that cannot concisely be compiled away, enabling researchers to model and address more complex planning problems.

Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2025. , p. 77
Series
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 2439
National Category
Artificial Intelligence
Identifiers
URN: urn:nbn:se:liu:diva-211903DOI: 10.3384/9789181180206ISBN: 9789181180190 (print)ISBN: 9789181180206 (electronic)OAI: oai:DiVA.org:liu-211903DiVA, id: diva2:1940902
Public defence
2025-03-24, Ada Lovelace, B Building, Campus Valla, Linköping, 09:15 (English)
Opponent
Supervisors
Available from: 2025-02-27 Created: 2025-02-27 Last updated: 2025-02-27Bibliographically approved
List of papers
1. Expressing and Exploiting Subgoal Structure in Classical Planning Using Sketches
Open this publication in new window or tab >>Expressing and Exploiting Subgoal Structure in Classical Planning Using Sketches
2024 (English)In: The journal of artificial intelligence research, ISSN 1076-9757, E-ISSN 1943-5037, Vol. 80, p. 171-208Article in journal (Refereed) Published
Abstract [en]

Width-based planning methods deal with conjunctive goals by decomposing problems into subproblems of low width. Algorithms like SIW thus fail when the goal is not easily serializable in this way or when some of the subproblems have a high width. In this work, we address these limitations by using a simple but powerful language for expressing finer problem decompositions introduced recently by Bonet and Geffner, called policy sketches. A policy sketch R over a set of Boolean and numerical features is a set of sketch rules C -> E that express how the values of these features are supposed to change. Like general policies, policy sketches are domain general, but unlike policies, the changes captured by sketch rules do not need to be achieved in a single step. We show that many planning domains that cannot be solved by SIW are provably solvable in low polynomial time with the SIWR algorithm, the version of SIW that employs user-provided policy sketches. Policy sketches are thus shown to be a powerful language for expressing domain-specific knowledge in a simple and compact way and a convenient alternative to languages such as HTNs or temporal logics. Furthermore, they make it easy to express general problem decompositions and prove key properties of them like their width and complexity.

Place, publisher, year, edition, pages
AI ACCESS FOUNDATION, 2024
Keywords
classical planning, subgoal structure, knowledge representation languages
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-204189 (URN)10.1613/jair.1.15821 (DOI)001237982500001 ()2-s2.0-85196110676 (Scopus ID)
Funder
Knut and Alice Wallenberg FoundationSwedish National Infrastructure for Computing (SNIC), 2022-06725Swedish National Infrastructure for Computing (SNIC), 2018-05973EU, Horizon 2020, 952215European Commission, 885107
Note

Funding Agencies|ERC Advanced Grant [885107]; EU [952215]; Wallenberg AI, Autonomous Systems and Software Program (WASP) - Knut and Alice Wallenberg Foundation; Wallenberg Guest Professor at Linkoping University, Sweden - Swedish Research Council [2022-06725, 2018-05973]

Available from: 2024-06-05 Created: 2024-06-05 Last updated: 2025-08-13
2. Learning Sketches for Decomposing Planning Problems into Subproblems of Bounded Width
Open this publication in new window or tab >>Learning Sketches for Decomposing Planning Problems into Subproblems of Bounded Width
2022 (English)In: Proceedings of the 32nd International Conference on Automated Planning and Scheduling (ICAPS2022), Palo Alto, California USA, 2022, p. 62-70Conference paper, Published paper (Refereed)
Abstract [en]

Recently, sketches have been introduced as a general language for representing the subgoal structure of instances drawn from the same domain. Sketches are collections of rules of the form C -> E over a given set of features where C expresses Boolean conditions and E expresses qualitative changes. Each sketch rule defines a subproblem: going from a state that satisfies C to a state that achieves the change expressed by E or a goal state. Sketches can encode simple goal serializations, general policies, or decompositions of bounded width that can be solved greedily, in polynomial time, by the SIW_R variant of the SIW algorithm. Previous work has shown the computational value of sketches over benchmark domains that, while tractable, are challenging for domain-independent planners. In this work, we address the problem of learning sketches automatically given a planning domain, some instances of the target class of problems, and the desired bound on the sketch width. We present a logical formulation of the problem, an implementation using the ASP solver Clingo, and experimental results. The sketch learner and the SIW_R planner yield a domain-independent planner that learns and exploits domain structure in a crisp and explicit form.

Place, publisher, year, edition, pages
Palo Alto, California USA: , 2022
Series
Proceedings of the International Conference on Automated Planning and Scheduling, ISSN 2334-0835, E-ISSN 2334-0843
Keywords
Classical planning, Serialized Iterated Width, Learning the Subgoal Structure, Policy Sketches, Combinatorial Optimization Approach
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-184354 (URN)10.1609/icaps.v32i1.19786 (DOI)2-s2.0-85134742351 (Scopus ID)9781577358749 (ISBN)
Conference
32nd International Conference on Automated Planning and Scheduling, Singapore (Virtual), June 13-24, 2022
Funder
Knut and Alice Wallenberg FoundationEU, Horizon 2020, 952215European Commission, 885107Swedish National Infrastructure for Computing (SNIC), 2018-05973
Available from: 2022-04-14 Created: 2022-04-14 Last updated: 2025-11-18Bibliographically approved
3. Learning Hierarchical Policies by Iteratively Reducing the Width of Sketch Rules
Open this publication in new window or tab >>Learning Hierarchical Policies by Iteratively Reducing the Width of Sketch Rules
2023 (English)In: 20th International Conference on Principles of Knowledge Representation and Reasoning, Rhodes, Greece, September 2-8, 2023, International Joint Conferences on Artificial Intelligence , 2023, p. 208-218Conference paper, Published paper (Refereed)
Abstract [en]

Hierarchical policies are a key ingredient of intelligent behavior, expressing the different levels of abstraction involved in the solution of a problem. Learning hierarchical policies, however, remains a challenge, as no general learning principles have been identified for this purpose, despite the broad interest and vast literature in both model-free reinforcement learning and model-based planning. In this work, we introduce a principled method for learning hierarchical policies over classical planning domains, with no supervision from small instances. The method is based on learning to decompose problems into subproblems so that the subproblems have a lower complexity as measured by their width. Problems and subproblems are captured by means of sketch rules, and the scheme for reducing the width of sketch rules is applied iteratively until the final sketch rules have zero width and encode a general policy. We evaluate the learning method on a number of classical planning domains, analyze the resulting hierarchical policies, and prove their properties. We also show that learning hierarchical policies by learning and refining sketches iteratively is often more efficient than learning flat general policies in one shot.

Place, publisher, year, edition, pages
International Joint Conferences on Artificial Intelligence, 2023
Series
Proceedings of the 20th International Conference on Principles of Knowledge Representation and Reasoning, ISSN 2334-1033
Keywords
classical planning, learning hierarchical policies, policy sketches language, planning width
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-196015 (URN)10.24963/kr.2023/21 (DOI)978-1-956792-02-7 (ISBN)
Conference
20th International Conference on Principles of Knowledge Representation and Reasoning, Rhodes, Greece, September 2-8, 2023
Funder
Swedish National Infrastructure for Computing (SNIC), 2018-05973, 2022-06725National Supercomputer Centre (NSC), SwedenWallenberg AI, Autonomous Systems and Software Program (WASP)EU, Horizon 2020, 952215EU, European Research Council, 885107
Available from: 2023-06-30 Created: 2023-06-30 Last updated: 2025-11-13
4. Symmetries and Expressive Requirements for Learning General Policies
Open this publication in new window or tab >>Symmetries and Expressive Requirements for Learning General Policies
2024 (English)In: Proceedings of the 21st International Conference on Principles of Knowledge Representation and Reasoning (KR2024), 2024Conference paper, Published paper (Refereed)
Abstract [en]

State symmetries play an important role in planning and generalized planning. In the first case, state symmetries can be used to reduce the size of the search; in the second, to reduce the size of the training set. In the case of general planning, however, it is also critical to distinguish non-symmetric states, i.e., states that represent non-isomorphic relational structures. However, while the language of first-order logic distinguishes non-symmetric states, the languages and architectures used to represent and learn general policies do not. In particular, recent approaches for learning general policies use state features derived from description logics or learned via graph neural networks (GNNs) that are known to be limited by the expressive power of C2, first-order logic with two variables and counting. In this work, we address the problem of detecting symmetries in planning and generalized planning and use the results to assess the expressive requirements for learning general policies over various planning domains. For this, we map planning states to plain graphs, run off-the-shelf algorithms to determine whether two states are isomorphic with respect to the goal, and run coloring algorithms to determine if C2 features computed logically or via GNNs distinguish non-isomorphic states. Symmetry detection results in more effective learning, while the failure to detect non-symmetries prevents general policies from being learned at all in certain domains.

National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-207393 (URN)
Conference
21st International Conference on Principles of Knowledge Representation and Reasoning, Hanoi, Vietnam, November 2-8, 2024
Funder
Knut and Alice Wallenberg FoundationEU, European Research Council, 885107
Available from: 2024-09-07 Created: 2024-09-07 Last updated: 2025-02-27

Open Access in DiVA

fulltext(2472 kB)364 downloads
File information
File name FULLTEXT01.pdfFile size 2472 kBChecksum SHA-512
9c491466068f85fa5a7c2a2766b9e8326554180a14430584b0ce0ad814670dd60145bda83965afd2a58e93f0d596a9dc75ede8e58b4a3c822e79f20e4c959b3c
Type fulltextMimetype application/pdf
Order online >>

Other links

Publisher's full text

Authority records

Drexler, Dominik

Search in DiVA

By author/editor
Drexler, Dominik
By organisation
Artificial Intelligence and Integrated Computer SystemsFaculty of Science & Engineering
Artificial Intelligence

Search outside of DiVA

GoogleGoogle Scholar
Total: 364 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 3653 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf