liu.seSearch for publications in DiVA
Change search
Link to record
Permanent link

Direct link
Geffner, Hector, ProfessorORCID iD iconorcid.org/0000-0001-9851-8219
Publications (10 of 13) Show all publications
Drexler, D., Ståhlberg, S., Bonet, B. & Geffner, H. (2024). Equivalence-Based Abstractions for Learning General Policies. In: Workshop on Bridging the Gap Between AI Planning and Reinforcement Learning: . Paper presented at 34th International Conference on Automated Planning and Scheduling.
Open this publication in new window or tab >>Equivalence-Based Abstractions for Learning General Policies
2024 (English)In: Workshop on Bridging the Gap Between AI Planning and Reinforcement Learning, 2024Conference paper, Published paper (Refereed)
Abstract [en]

Identifying state symmetries plays a crucial role in minimiz-ing the number of states explored during search, yet identify-ing precisely all symmetries is computationally hard. In thecontext of learning general policies that solve instances ofarbitrary size from small instances, however, this computa-tional bottleneck is not a problem. In this paper, we addressthe task of identifying all state symmetries through the lensof the graph isomorphism problem. To accomplish this, werepresent states as undirected, labeled graphs that reflect therelational structure of states and the goal. We then use off-the-shelf graph isomorphism algorithms to determine whethertwo states are isomorphic with respect to the goal. The iso-morphism relationship forms equivalent classes that result inan abstract state space that can be used instead of the origi-nal one to learn general policies more efficiently. While thisabstract state space can be used for many different learningtasks, we focus on learning symbolic general policies wherewe show that the proposed approach can lead to significantspeedups.

National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-215814 (URN)
Conference
34th International Conference on Automated Planning and Scheduling
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)EU, Horizon 2020, 952215European Commission, 885107
Available from: 2025-06-29 Created: 2025-06-29 Last updated: 2025-08-13
Drexler, D., Seipp, J. & Geffner, H. (2024). Expressing and Exploiting Subgoal Structure in Classical Planning Using Sketches. The journal of artificial intelligence research, 80, 171-208
Open this publication in new window or tab >>Expressing and Exploiting Subgoal Structure in Classical Planning Using Sketches
2024 (English)In: The journal of artificial intelligence research, ISSN 1076-9757, E-ISSN 1943-5037, Vol. 80, p. 171-208Article in journal (Refereed) Published
Abstract [en]

Width-based planning methods deal with conjunctive goals by decomposing problems into subproblems of low width. Algorithms like SIW thus fail when the goal is not easily serializable in this way or when some of the subproblems have a high width. In this work, we address these limitations by using a simple but powerful language for expressing finer problem decompositions introduced recently by Bonet and Geffner, called policy sketches. A policy sketch R over a set of Boolean and numerical features is a set of sketch rules C -> E that express how the values of these features are supposed to change. Like general policies, policy sketches are domain general, but unlike policies, the changes captured by sketch rules do not need to be achieved in a single step. We show that many planning domains that cannot be solved by SIW are provably solvable in low polynomial time with the SIWR algorithm, the version of SIW that employs user-provided policy sketches. Policy sketches are thus shown to be a powerful language for expressing domain-specific knowledge in a simple and compact way and a convenient alternative to languages such as HTNs or temporal logics. Furthermore, they make it easy to express general problem decompositions and prove key properties of them like their width and complexity.

Place, publisher, year, edition, pages
AI ACCESS FOUNDATION, 2024
Keywords
classical planning, subgoal structure, knowledge representation languages
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-204189 (URN)10.1613/jair.1.15821 (DOI)001237982500001 ()2-s2.0-85196110676 (Scopus ID)
Funder
Knut and Alice Wallenberg FoundationSwedish National Infrastructure for Computing (SNIC), 2022-06725Swedish National Infrastructure for Computing (SNIC), 2018-05973EU, Horizon 2020, 952215European Commission, 885107
Note

Funding Agencies|ERC Advanced Grant [885107]; EU [952215]; Wallenberg AI, Autonomous Systems and Software Program (WASP) - Knut and Alice Wallenberg Foundation; Wallenberg Guest Professor at Linkoping University, Sweden - Swedish Research Council [2022-06725, 2018-05973]

Available from: 2024-06-05 Created: 2024-06-05 Last updated: 2025-08-13
Funkquist, M., Ståhlberg, S. & Geffner, H. (2024). Learning to Ground Existentially Quantified Goals. In: Proceedings of the Twenty-First International Conference on Principles of Knowledge Representation and Reasoning (KR 2024): . Paper presented at Conference on Principles of Knowledge Representation and Reasoning (KR 2024).
Open this publication in new window or tab >>Learning to Ground Existentially Quantified Goals
2024 (English)In: Proceedings of the Twenty-First International Conference on Principles of Knowledge Representation and Reasoning (KR 2024), 2024Conference paper, Published paper (Refereed)
Abstract [en]

Goal instructions for autonomous AI agents cannot assume that objects have unique names. Instead, objects in goals must be referred to by providing suitable descriptions. However, this raises problems in both classical planning and generalized planning. The standard approach to handling existentially quantified goals in classical planning involves compiling them into a DNF formula that encodes all possible variable bindings and adding dummy actions to map each DNF term into the new, dummy goal. This preprocessing is exponential in the number of variables. In generalized planning, the problem is different: even if general policies can deal with any initial situation and goal, executing a general policy requires the goal to be grounded to define a value for the policy features. The problem of grounding goals, namely finding the objects to bind the goal variables, is subtle: it is a generalization of classical planning, which is a special case when there are no goal variables to bind, and constraint reasoning, which is a special case when there are no actions. In this work, we address the goal grounding problem with a novel supervised learning approach. A GNN architecture, trained to predict the cost of partially quantified goals over small domain instances is tested on larger instances involving more objects and different quantified goals. The proposed architecture is evaluated experimentally over several planning domains where generalization is tested along several dimensions including the number of goal variables and objects that can bind such variables. The scope of the approach is also discussed in light of the known relationship between GNNs and C₂ logics.

Series
arXiv.org ; 2409.20259
Keywords
classical planning, gnn, deep learning, artificial intelligence
National Category
Artificial Intelligence
Identifiers
urn:nbn:se:liu:diva-214384 (URN)10.24963/kr.2024/80 (DOI)
Conference
Conference on Principles of Knowledge Representation and Reasoning (KR 2024)
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Available from: 2025-06-06 Created: 2025-06-06 Last updated: 2025-06-06
Bonet, B., Drexler, D. & Geffner, H. (2024). On Policy Reuse: An Expressive Language for Representing and Executing General Policies that Call Other Policies. In: Sara Bernardini and Christian Muise (Ed.), Proceedings of the Thirty-Fourth International Conference on Automated Planning and Scheduling (ICAPS 2024): . Paper presented at Thirty-Fourth International Conference on Automated Planning and Scheduling (ICAPS 2024), Banff, Alberta, Canada, June 1-6, 2024.
Open this publication in new window or tab >>On Policy Reuse: An Expressive Language for Representing and Executing General Policies that Call Other Policies
2024 (English)In: Proceedings of the Thirty-Fourth International Conference on Automated Planning and Scheduling (ICAPS 2024) / [ed] Sara Bernardini and Christian Muise, 2024Conference paper, Published paper (Refereed)
Abstract [en]

Recently,a simple but powerful language for expressing and learning general policies and problem decompositions (sketches) has been introduced in terms of rules defined over a set of Boolean and numerical features. In this work, we consider three extensions of this language aimed at making policies and sketches more flexible and reusable: internal memory states, as in finite state controllers; indexical features, whose values are a function of the state and a number of internal registers that can be loaded with objects; and modules that wrap up policies and sketches and allow them to call each other by passing parameters. In addition, unlike general policies that select state transitions rather than ground actions, the new language allows for the selection of such actions. The expressive power of the resulting language for policies and sketches is illustrated through a number of examples.

Keywords
planning, knowledge representation
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-204032 (URN)
Conference
Thirty-Fourth International Conference on Automated Planning and Scheduling (ICAPS 2024), Banff, Alberta, Canada, June 1-6, 2024
Funder
EU, European Research Council, 885107EU, Horizon 2020, 952215Knut and Alice Wallenberg FoundationSwedish National Infrastructure for Computing (SNIC), 2022-06725Swedish National Infrastructure for Computing (SNIC), 2018-05973
Available from: 2024-06-01 Created: 2024-06-01 Last updated: 2024-06-28Bibliographically approved
Drexler, D., Ståhlberg, S., Bonet, B. & Geffner, H. (2024). Symmetries and Expressive Requirements for Learning General Policies. In: Proceedings of the 21st International Conference on Principles of Knowledge Representation and Reasoning (KR2024): . Paper presented at 21st International Conference on Principles of Knowledge Representation and Reasoning, Hanoi, Vietnam, November 2-8, 2024.
Open this publication in new window or tab >>Symmetries and Expressive Requirements for Learning General Policies
2024 (English)In: Proceedings of the 21st International Conference on Principles of Knowledge Representation and Reasoning (KR2024), 2024Conference paper, Published paper (Refereed)
Abstract [en]

State symmetries play an important role in planning and generalized planning. In the first case, state symmetries can be used to reduce the size of the search; in the second, to reduce the size of the training set. In the case of general planning, however, it is also critical to distinguish non-symmetric states, i.e., states that represent non-isomorphic relational structures. However, while the language of first-order logic distinguishes non-symmetric states, the languages and architectures used to represent and learn general policies do not. In particular, recent approaches for learning general policies use state features derived from description logics or learned via graph neural networks (GNNs) that are known to be limited by the expressive power of C2, first-order logic with two variables and counting. In this work, we address the problem of detecting symmetries in planning and generalized planning and use the results to assess the expressive requirements for learning general policies over various planning domains. For this, we map planning states to plain graphs, run off-the-shelf algorithms to determine whether two states are isomorphic with respect to the goal, and run coloring algorithms to determine if C2 features computed logically or via GNNs distinguish non-isomorphic states. Symmetry detection results in more effective learning, while the failure to detect non-symmetries prevents general policies from being learned at all in certain domains.

National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-207393 (URN)
Conference
21st International Conference on Principles of Knowledge Representation and Reasoning, Hanoi, Vietnam, November 2-8, 2024
Funder
Knut and Alice Wallenberg FoundationEU, European Research Council, 885107
Available from: 2024-09-07 Created: 2024-09-07 Last updated: 2025-02-27
Bonet, B., Drexler, D. & Geffner, H. (2023). General and Reusable Indexical Policies and Sketches. In: NeurIPS 2023 Workshop on Generalization in Planning (GenPlan 2023): . Paper presented at Conference on Neural Information Processing Systems.
Open this publication in new window or tab >>General and Reusable Indexical Policies and Sketches
2023 (English)In: NeurIPS 2023 Workshop on Generalization in Planning (GenPlan 2023), 2023Conference paper, Published paper (Refereed)
Abstract [en]

Recently, a simple but powerful language for expressing and learning generalpolicies and problem decompositions (sketches) has been introduced, which isbased on collections of rules defined on a set of Boolean and numerical features.In this work, we consider extensions of this basic language aimed at makingpolicies and sketches more flexible and reusable. For this, three basic extensionsare considered: 1) internal memory states, as in finite state controllers, 2) indexicalfeatures, whose values are a function of the state and a number of internal registersthat can be loaded with objects, and 3) modules that wrap up policies and sketchesand allow them to call each other by passing parameters. In addition, unlikepreviously defined policies that select actions indirectly by the selection of statetransitions, the new language allows for the selection of actions directly. Theexpressive power of the resulting language for recombining policies and sketchesis illustrated through examples. The problem of learning policies and sketches inthe new language is left for future work.

National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-215812 (URN)
Conference
Conference on Neural Information Processing Systems
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)EU, Horizon 2020, 952215European Commission, 885107
Available from: 2025-06-29 Created: 2025-06-29 Last updated: 2025-08-13
Ståhlberg, S., Bonet, B. & Geffner, H. (2023). Learning General Policies with Policy Gradient Methods. In: Pierre Marquis, Tran Cao Son, Gabriele Kern-Isberner (Ed.), Proceedings of the 20th International Conference on Principles of Knowledge Representation and Reasoning: . Paper presented at 20th International Conference on Principles of Knowledge Representation and Reasoning, Rhodes, Greece, KR 2023 September 2-8, 2023 (pp. 647-657). International Joint Conferences on Artificial Intelligence
Open this publication in new window or tab >>Learning General Policies with Policy Gradient Methods
2023 (English)In: Proceedings of the 20th International Conference on Principles of Knowledge Representation and Reasoning / [ed] Pierre Marquis, Tran Cao Son, Gabriele Kern-Isberner, International Joint Conferences on Artificial Intelligence , 2023, p. 647-657Conference paper, Published paper (Refereed)
Abstract [en]

While reinforcement learning methods have delivered remarkable results in a number of settings, generalization, i.e., the ability to produce policies that generalize in a reliable and systematic way, has remained a challenge. The problem of generalization has been addressed formally in classical planning where provable correct policies that generalize over all instances of a given domain have been learned using combinatorial methods. The aim of this work is to bring these two research threads together to illuminate the conditions under which (deep) reinforcement learning approaches, and in particular, policy optimization methods, can be used to learn policies that generalize like combinatorial methods do. We draw on lessons learned from previous combinatorial and deep learning approaches, and extend them in a convenient way. From the former, we model policies as state transition classifiers, as (ground) actions are not general and change from instance to instance. From the latter, we use graph neural networks (GNNs) adapted to deal with relational structures for representing value functions over planning states, and in our case, policies. With these ingredients in place, we find that actor-critic methods can be used to learn policies that generalize almost as well as those obtained using combinatorial approaches while avoiding the scalability bottleneck and the use of feature pools. Moreover, the limitations of the DRL methods on the benchmarks considered have little to do with deep learning or reinforcement learning algorithms, and result from the well-understood expressive limitations of GNNs, and the tradeoff between optimality and generalization (general policies cannot be optimal in some domains). Both of these limitations are addressed without changing the basic DRL methods by adding derived predicates and an alternative cost structure to optimize.

Place, publisher, year, edition, pages
International Joint Conferences on Artificial Intelligence, 2023
Series
Proceedings of the Conference on Principles of Knowledge Representation and Reasoning, ISSN 2334-1033
Keywords
Reasoning about actions and change, action languages, Learning action theories, Symbolic reinforcement learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-198747 (URN)10.24963/kr.2023/63 (DOI)9781956792027 (ISBN)
Conference
20th International Conference on Principles of Knowledge Representation and Reasoning, Rhodes, Greece, KR 2023 September 2-8, 2023
Available from: 2023-10-25 Created: 2023-10-25 Last updated: 2025-11-13Bibliographically approved
Drexler, D., Seipp, J. & Geffner, H. (2023). Learning Hierarchical Policies by Iteratively Reducing the Width of Sketch Rules. In: 20th International Conference on Principles of Knowledge Representation and Reasoning, Rhodes, Greece, September 2-8, 2023: . Paper presented at 20th International Conference on Principles of Knowledge Representation and Reasoning, Rhodes, Greece, September 2-8, 2023 (pp. 208-218). International Joint Conferences on Artificial Intelligence
Open this publication in new window or tab >>Learning Hierarchical Policies by Iteratively Reducing the Width of Sketch Rules
2023 (English)In: 20th International Conference on Principles of Knowledge Representation and Reasoning, Rhodes, Greece, September 2-8, 2023, International Joint Conferences on Artificial Intelligence , 2023, p. 208-218Conference paper, Published paper (Refereed)
Abstract [en]

Hierarchical policies are a key ingredient of intelligent behavior, expressing the different levels of abstraction involved in the solution of a problem. Learning hierarchical policies, however, remains a challenge, as no general learning principles have been identified for this purpose, despite the broad interest and vast literature in both model-free reinforcement learning and model-based planning. In this work, we introduce a principled method for learning hierarchical policies over classical planning domains, with no supervision from small instances. The method is based on learning to decompose problems into subproblems so that the subproblems have a lower complexity as measured by their width. Problems and subproblems are captured by means of sketch rules, and the scheme for reducing the width of sketch rules is applied iteratively until the final sketch rules have zero width and encode a general policy. We evaluate the learning method on a number of classical planning domains, analyze the resulting hierarchical policies, and prove their properties. We also show that learning hierarchical policies by learning and refining sketches iteratively is often more efficient than learning flat general policies in one shot.

Place, publisher, year, edition, pages
International Joint Conferences on Artificial Intelligence, 2023
Series
Proceedings of the 20th International Conference on Principles of Knowledge Representation and Reasoning, ISSN 2334-1033
Keywords
classical planning, learning hierarchical policies, policy sketches language, planning width
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-196015 (URN)10.24963/kr.2023/21 (DOI)978-1-956792-02-7 (ISBN)
Conference
20th International Conference on Principles of Knowledge Representation and Reasoning, Rhodes, Greece, September 2-8, 2023
Funder
Swedish National Infrastructure for Computing (SNIC), 2018-05973, 2022-06725National Supercomputer Centre (NSC), SwedenWallenberg AI, Autonomous Systems and Software Program (WASP)EU, Horizon 2020, 952215EU, European Research Council, 885107
Available from: 2023-06-30 Created: 2023-06-30 Last updated: 2025-11-13
Ståhlberg, S., Bonet, B. & Geffner, H. (2022). Learning General Optimal Policies with Graph Neural Networks: Expressive Power, Transparency, and Limits. In: Akshat Kumar, Sylvie Thiébaux, Pradeep Varakantham, William Yeoh (eds) (Ed.), Proceedings of the Thirty-Second International Conference on Automated Planning and Scheduling (ICAPS 2022): . Paper presented at 32nd International Conference on Automated Planning and Scheduling, June 13 - 24, 2022 (pp. 629-637). Palo Alto: AAAI Press, 32
Open this publication in new window or tab >>Learning General Optimal Policies with Graph Neural Networks: Expressive Power, Transparency, and Limits
2022 (English)In: Proceedings of the Thirty-Second International Conference on Automated Planning and Scheduling (ICAPS 2022) / [ed] Akshat Kumar, Sylvie Thiébaux, Pradeep Varakantham, William Yeoh (eds), Palo Alto: AAAI Press, 2022, Vol. 32, p. 629-637Conference paper, Published paper (Refereed)
Abstract [en]

It has been recently shown that general policies for many clas-sical planning domains can be expressed and learned in termsof a pool of features defined from the domain predicates usinga description logic grammar. At the same time, most descrip-tion logics correspond to a fragment of k-variable countinglogic (Ck ) for k = 2, that has been shown to provide a tightcharacterization of the expressive power of graph neural net-works. In this work, we make use of these results to under-stand the power and limits of using graph neural networks(GNNs) for learning optimal general policies over a numberof tractable planning domains where such policies are knownto exist. For this, we train a simple GNN in a supervised man-ner to approximate the optimal value function V ∗(s) of anumber of sample states s. As predicted by the theory, it is ob-served that general optimal policies are obtained in domainswhere general optimal value functions can be defined withC2 features but not in those requiring more expressive C3 fea-tures. In addition, it is observed that the features learned are inclose correspondence with the features needed to express V ∗in closed form. The theory and the analysis of the domainslet us understand the features that are actually learned as wellas those that cannot be learned in this way, and let us movein a principled manner from a combinatorial optimization ap-proach to learning general policies to a potentially, more ro-bust and scalable approach based on deep learning.

Place, publisher, year, edition, pages
Palo Alto: AAAI Press, 2022
Series
Proceedings of the International Conference on Automated Planning and Scheduling, ISSN 2334-0835, E-ISSN 2334-0843
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-188842 (URN)10.1609/icaps.v32i1.19851 (DOI)2-s2.0-85130563464 (Scopus ID)9781577358749 (ISBN)
Conference
32nd International Conference on Automated Planning and Scheduling, June 13 - 24, 2022
Available from: 2022-09-27 Created: 2022-09-27 Last updated: 2025-09-18Bibliographically approved
Ståhlberg, S., Bonet, B. & Geffner, H. (2022). Learning Generalized Policies without Supervision Using GNNs. In: Proceedings of the 19th International Conference on Principles of Knowledge Representation and Reasoning: Special Session on KR and Machine Learning. Paper presented at 19th International Conference on Principles of Knowledge Representation and Reasoning, Haifa, Israel, July 31 - August 5, 2022 (pp. 474-483). IJACAI Organization
Open this publication in new window or tab >>Learning Generalized Policies without Supervision Using GNNs
2022 (English)In: Proceedings of the 19th International Conference on Principles of Knowledge Representation and Reasoning: Special Session on KR and Machine Learning, IJACAI Organization , 2022, p. 474-483Conference paper, Published paper (Refereed)
Abstract [en]

We consider the problem of learning generalized policies forclassical planning domains using graph neural networks fromsmall instances represented in lifted STRIPS. The problemhas been considered before but the proposed neural architec-tures are complex and the results are often mixed. In thiswork, we use a simple and general GNN architecture andaim at obtaining crisp experimental results and a deeper un-derstanding: either the policy greedy in the learned valuefunction achieves close to 100% generalization over instanceslarger than those used in training, or the failure must be under-stood, and possibly fixed, logically. For this, we exploit therelation established between the expressive power of GNNsand the C2 fragment of first-order logic (namely, FOL with2 variables and counting quantifiers). We find for examplethat domains with general policies that require more expres-sive features can be solved with GNNs once the states are ex-tended with suitable ”derived atoms” encoding role composi-tions and transitive closures that do not fit into C2. The workfollows an existing approach based on GNNs for learning op-timal general policies in a supervised fashion, but the learnedpolicies are no longer required to be optimal (which expandsthe scope, as many planning domains do not have general op-timal policies) and are learned without supervision. Interest-ingly, value-based reinforcement learning methods that aimto produce optimal policies, do not always yield policies thatgeneralize, as the goals of optimality and generality are inconflict in domains where optimal planning is NP-hard.

Place, publisher, year, edition, pages
IJACAI Organization, 2022
Series
Proceedings of the International Conference on Principles of Knowledge Representation and Reasoning (KR), ISSN 2334-1033
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-188843 (URN)10.24963/kr.2022/49 (DOI)2-s2.0-85141885575 (Scopus ID)9781956792010 (ISBN)
Conference
19th International Conference on Principles of Knowledge Representation and Reasoning, Haifa, Israel, July 31 - August 5, 2022
Available from: 2022-09-27 Created: 2022-09-27 Last updated: 2024-08-22Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-9851-8219

Search in DiVA

Show all publications