liu.seSök publikationer i DiVA
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
A Variational Perspective on Generative Flow Networks
Amsterdam Machine Learning Lab, University of Amsterdam.
Linköpings universitet, Institutionen för datavetenskap, Statistik och maskininlärning. Linköpings universitet, Filosofiska fakulteten.ORCID-id: 0000-0003-3749-5820
Amsterdam Machine Learning Lab, University of Amsterdam.
Amsterdam Machine Learning Lab, University of Amsterdam.
2024 (Engelska)Ingår i: Transactions on Machine Learning Research, E-ISSN 2835-8856Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Generative flow networks (GFNs) are a class of probabilistic models for sequential samplingof composite objects, proportional to a target distribution that is defined in terms of anenergy function or a reward. GFNs are typically trained using a flow matching or trajectorybalance objective, which matches forward and backward transition models over trajectories.In this work we introduce a variational objective for training GFNs, which is a convexcombination of the reverse- and forward KL divergences, and compare it to the trajectorybalance objective when sampling from the forward- and backward model, respectively. Weshow that, in certain settings, variational inference for GFNs is equivalent to minimizing thetrajectory balance objective, in the sense that both methods compute the same score-functiongradient. This insight suggests that in these settings, control variates, which are commonlyused to reduce the variance of score-function gradient estimates, can also be used with thetrajectory balance objective. We evaluate our findings and the performance of the proposedvariational objective numerically by comparing it to the trajectory balance objective on twosynthetic tasks.

Ort, förlag, år, upplaga, sidor
2024.
Nationell ämneskategori
Sannolikhetsteori och statistik Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:liu:diva-204028OAI: oai:DiVA.org:liu-204028DiVA, id: diva2:1863822
Forskningsfinansiär
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile CommunicationsWallenberg AI, Autonomous Systems and Software Program (WASP)Vetenskapsrådet, 2020-04122Tillgänglig från: 2024-06-01 Skapad: 2024-06-01 Senast uppdaterad: 2025-09-24Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

https://openreview.net/forum?id=AZ4GobeSLq

Person

Lindsten, Fredrik

Sök vidare i DiVA

Av författaren/redaktören
Lindsten, Fredrik
Av organisationen
Statistik och maskininlärningFilosofiska fakulteten
I samma tidskrift
Transactions on Machine Learning Research
Sannolikhetsteori och statistikDatavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 208 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf