liu.seSearch for publications in DiVA
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
A Variational Perspective on Generative Flow Networks
Amsterdam Machine Learning Lab, University of Amsterdam.
Linköpings universitet, Institutionen för datavetenskap, Statistik och maskininlärning. Linköpings universitet, Filosofiska fakulteten.ORCID-id: 0000-0003-3749-5820
Amsterdam Machine Learning Lab, University of Amsterdam.
Amsterdam Machine Learning Lab, University of Amsterdam.
2024 (engelsk)Inngår i: Transactions on Machine Learning Research, E-ISSN 2835-8856Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Generative flow networks (GFNs) are a class of probabilistic models for sequential samplingof composite objects, proportional to a target distribution that is defined in terms of anenergy function or a reward. GFNs are typically trained using a flow matching or trajectorybalance objective, which matches forward and backward transition models over trajectories.In this work we introduce a variational objective for training GFNs, which is a convexcombination of the reverse- and forward KL divergences, and compare it to the trajectorybalance objective when sampling from the forward- and backward model, respectively. Weshow that, in certain settings, variational inference for GFNs is equivalent to minimizing thetrajectory balance objective, in the sense that both methods compute the same score-functiongradient. This insight suggests that in these settings, control variates, which are commonlyused to reduce the variance of score-function gradient estimates, can also be used with thetrajectory balance objective. We evaluate our findings and the performance of the proposedvariational objective numerically by comparing it to the trajectory balance objective on twosynthetic tasks.

sted, utgiver, år, opplag, sider
2024.
HSV kategori
Identifikatorer
URN: urn:nbn:se:liu:diva-204028OAI: oai:DiVA.org:liu-204028DiVA, id: diva2:1863822
Forskningsfinansiär
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile CommunicationsWallenberg AI, Autonomous Systems and Software Program (WASP)Swedish Research Council, 2020-04122Tilgjengelig fra: 2024-06-01 Laget: 2024-06-01 Sist oppdatert: 2025-09-24bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

https://openreview.net/forum?id=AZ4GobeSLq

Person

Lindsten, Fredrik

Søk i DiVA

Av forfatter/redaktør
Lindsten, Fredrik
Av organisasjonen
I samme tidsskrift
Transactions on Machine Learning Research

Søk utenfor DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric

urn-nbn
Totalt: 204 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf