liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Model-Based Multi-Objective Reinforcement Learning with Dynamic Utility Functions
Linköping University, Department of Computer and Information Science, Artificial Intelligence and Integrated Computer Systems. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-4144-4893
Linköping University, Department of Computer and Information Science, Artificial Intelligence and Integrated Computer Systems. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-9595-2471
2023 (English)In: Proceedings of the Adaptive and Learning Agents Workshop (ALA) at AAMAS 2023, 2023, p. 1-9Conference paper, Published paper (Refereed)
Abstract [en]

Many real-world problems require a trade-off between multiple conflicting objectives. Decision-makers’ preferences over solutions to such problems are determined by their utility functions, which convert multi-objective values to scalars. In some settings, utility functions change over time, and the goal is to find methods that can efficiently adapt an agent’s policy to changes in utility. Previous work on learning with dynamic utility functions has focused on model-free methods, which often suffer from poor sample efficiency. In this work, we instead propose a model-based actor-critic, which explores with diverse utility functions through imagined rollouts within a learned world model between interactions with the real environment. An experimental evaluation shows that by learning a model of the environment the performance of the agent can be improved compared to model-free algorithms.

Place, publisher, year, edition, pages
2023. p. 1-9
Keywords [en]
Multiple Objectives, Reinforcement Learning, Model-Based Learning
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:liu:diva-194747OAI: oai:DiVA.org:liu-194747DiVA, id: diva2:1764995
Conference
Adaptive and Learning Agents Workshop (ALA) at AAMAS 2023
Funder
Vinnova, NFFP7/2017-04885Wallenberg AI, Autonomous Systems and Software Program (WASP)Available from: 2023-06-09 Created: 2023-06-09 Last updated: 2023-06-09

Open Access in DiVA

No full text in DiVA

Other links

Workshop WebsitePaper Link

Authority records

Källström, JohanHeintz, Fredrik

Search in DiVA

By author/editor
Källström, JohanHeintz, Fredrik
By organisation
Artificial Intelligence and Integrated Computer SystemsFaculty of Science & Engineering
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 170 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf