liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Symmetries and Expressive Requirements for Learning General Policies
Linköping University, Department of Computer and Information Science, Artificial Intelligence and Integrated Computer Systems. Linköping University, Faculty of Science & Engineering. (AIICS)ORCID iD: 0000-0002-1350-2144
RWTH Aachen University, Germany.ORCID iD: 0000-0002-4092-8175
Universitat Pompeu Fabra, Spain.ORCID iD: 0000-0001-8212-7408
RWTH Aachen University, Germany.ORCID iD: 0000-0001-9851-8219
2024 (English)In: Proceedings of the 21st International Conference on Principles of Knowledge Representation and Reasoning (KR2024), 2024Conference paper, Published paper (Refereed)
Abstract [en]

State symmetries play an important role in planning and generalized planning. In the first case, state symmetries can be used to reduce the size of the search; in the second, to reduce the size of the training set. In the case of general planning, however, it is also critical to distinguish non-symmetric states, i.e., states that represent non-isomorphic relational structures. However, while the language of first-order logic distinguishes non-symmetric states, the languages and architectures used to represent and learn general policies do not. In particular, recent approaches for learning general policies use state features derived from description logics or learned via graph neural networks (GNNs) that are known to be limited by the expressive power of C2, first-order logic with two variables and counting. In this work, we address the problem of detecting symmetries in planning and generalized planning and use the results to assess the expressive requirements for learning general policies over various planning domains. For this, we map planning states to plain graphs, run off-the-shelf algorithms to determine whether two states are isomorphic with respect to the goal, and run coloring algorithms to determine if C2 features computed logically or via GNNs distinguish non-isomorphic states. Symmetry detection results in more effective learning, while the failure to detect non-symmetries prevents general policies from being learned at all in certain domains.

Place, publisher, year, edition, pages
2024.
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:liu:diva-207393OAI: oai:DiVA.org:liu-207393DiVA, id: diva2:1895860
Conference
21st International Conference on Principles of Knowledge Representation and Reasoning, Hanoi, Vietnam, November 2-8, 2024
Funder
Knut and Alice Wallenberg FoundationEU, European Research Council, 885107Available from: 2024-09-07 Created: 2024-09-07 Last updated: 2025-02-27
In thesis
1. Learning and Exploiting Subgoal Structures in Classical Planning: Towards Reliable and Transparent Intelligent Agents that Learn to Plan on Multiple Levels
Open this publication in new window or tab >>Learning and Exploiting Subgoal Structures in Classical Planning: Towards Reliable and Transparent Intelligent Agents that Learn to Plan on Multiple Levels
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Classical planning aims to find a plan that is a sequence of actions allowing an intelligent agent to move from its current situation to one that satisfies the goal. Finding a plan is computationally challenging. Agents in the real world often encounter structurally similar problems with varying objects but the same predicates, actions, and related goals. Generalized planning aims to find a general plan that compactly encodes efficiently obtainable plans for each problem in an infinitely large class of structurally similar problems. Thus, we can query a general plan to efficiently obtain a plan for any problem in the class. A general plan may encode behavior on different levels of abstraction. High-level abstractions include subgoal structures that encode stepping stones towards the goal. Subgoal structures play a central role in human problem-solving, enabling reasoning at a higher level before working out the details of a plan. Learning simple, compact, meaningful, and efficient subgoal structures and their hierarchies without human intervention is an open challenge.

This thesis introduces a method for learning subgoal structures with a crisp characterization; they decompose problems into subproblems of controllable polynomial complexity. We represent subgoal structures using the recently introduced policy sketches language, whose simple syntax and semantics build the theoretical foundation of our work. We extend our method to address the long-standing problem of learning hierarchical policies. Our extended method iteratively decomposes classes of problems into classes of subproblems with strictly smaller polynomial complexity, resulting in effective hierarchical decompositions. Our methods learn from small example problems using combinatorial optimization. They seek the syntactically simplest solution, enabling interpretability and allowing us often to establish their correctness for an entire problem class. When learning methods fail, it often results from limited scalability or a lack of language expressivity. We develop two methods to address these limitations. First, we develop symmetry-based abstractions to reduce redundancy in training data and improve learning efficiency. Second, we develop a method for testing the language expressivity requirement of benchmark sets using first-order logic. Moreover, we take steps toward developing a scalable planning framework that avoids an exponential preprocessing step known as grounding, which is often unnecessary in generalized planning. Our framework supports expressive language features such as conditional effects and derived predicates that cannot concisely be compiled away, enabling researchers to model and address more complex planning problems.

Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2025. p. 77
Series
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 2439
National Category
Artificial Intelligence
Identifiers
urn:nbn:se:liu:diva-211903 (URN)10.3384/9789181180206 (DOI)9789181180190 (ISBN)9789181180206 (ISBN)
Public defence
2025-03-24, Ada Lovelace, B Building, Campus Valla, Linköping, 09:15 (English)
Opponent
Supervisors
Available from: 2025-02-27 Created: 2025-02-27 Last updated: 2025-02-27Bibliographically approved

Open Access in DiVA

No full text in DiVA

Authority records

Drexler, DominikStåhlberg, SimonGeffner, Hector

Search in DiVA

By author/editor
Drexler, DominikStåhlberg, SimonBonet, BlaiGeffner, Hector
By organisation
Artificial Intelligence and Integrated Computer SystemsFaculty of Science & Engineering
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 263 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf