liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Scalable Deep Gaussian Markov Random Fields for General Graphs
Linköping University, Department of Computer and Information Science, The Division of Statistics and Machine Learning. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-8201-0282
Linköping University, Department of Computer and Information Science, The Division of Statistics and Machine Learning. Linköping University, Faculty of Science & Engineering. Arriver Software AB, Sweden.
Linköping University, Faculty of Science & Engineering. Linköping University, Department of Computer and Information Science, The Division of Statistics and Machine Learning.
2022 (English)In: Proceedings of the 39th International Conference on Machine Learning / [ed] Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvari, Gang Niu, Sivan Sabato, 2022, p. 17117-17137Conference paper, Published paper (Refereed)
Abstract [en]

Machine learning methods on graphs have proven useful in many applications due to their ability to handle generally structured data. The framework of Gaussian Markov Random Fields (GMRFs) provides a principled way to define Gaussian models on graphs by utilizing their sparsity structure. We propose a flexible GMRF model for general graphs built on the multi-layer structure of Deep GMRFs, originally proposed for lattice graphs only. By designing a new type of layer we enable the model to scale to large graphs. The layer is constructed to allow for efficient training using variational inference and existing software frameworks for Graph Neural Networks. For a Gaussian likelihood, close to exact Bayesian inference is available for the latent field. This allows for making predictions with accompanying uncertainty estimates. The usefulness of the proposed model is verified by experiments on a number of synthetic and real world datasets, where it compares favorably to other both Bayesian and deep learning methods.

Place, publisher, year, edition, pages
2022. p. 17117-17137
Series
Proceedings of Machine Learning Research, ISSN 2640-3498 ; 162
Keywords [en]
machine learning, graphs, gmrf, deep gmrf, variational inference, gaussian, markov random field
National Category
Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:liu:diva-187178ISI: 000900064907012OAI: oai:DiVA.org:liu-187178DiVA, id: diva2:1686544
Conference
The 39th International Conference on Machine Learning, ICML, 17-23 July 2022, Baltimore, Maryland, USA
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)ELLIIT - The Linköping‐Lund Initiative on IT and Mobile CommunicationsSwedish Research Council, 2020-04122Available from: 2022-08-10 Created: 2022-08-10 Last updated: 2025-06-26
In thesis
1. Modeling Spatio-Temporal Systems with Graph-based Machine Learning
Open this publication in new window or tab >>Modeling Spatio-Temporal Systems with Graph-based Machine Learning
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Most systems in the physical world are spatio-temporal in nature. The clouds move over our heads, vehicles travel on the roads and electricity is transmitted through vast spatial networks. Machine learning offers many opportunities to understand and forecast the evolution of these systems by making use of large amounts of collected data. However, building useful models of such systems requires taking both spatial and temporal correlations into account. We can not accurately forecast the weather in Linköping without knowing if there is hot air blowing in from the south. Similarly, we can not predict if a vehicle is about to make a left turn without knowing its position and velocity relative to other vehicles on the road. This thesis proposes a set of methods for accurately capturing such spatio-temporal dependencies in machine learning models. 

At the core of the thesis is the idea of using graphs as a way to represent the spatial relationships in spatio-temporal systems. Graphs offer a highly flexible framework for this purpose, in particular for situations where observation locations do not lie on a regular spatial grid. Throughout the thesis, spatial graphs are constructed by letting nodes correspond to spatial locations and edges the relationships between them. These graphs are then used to construct different machine learning models, including graph neural networks and probabilistic graphical models. Combining such graph-based components with machine learning methods for time series modeling then allows for capturing the full spatio-temporal structure of the data. 

The main contribution of the thesis lies in exploring a number of methods using graph-based modeling for spatio-temporal data. This includes extending temporal graph neural networks to handle data observed irregularly over time. Temporal graph neural networks are also used to develop a model for vehicle trajectory forecasting, where the edges of the graph correspond to interactions between traffic agents. The thesis additionally includes work on Bayesian modeling, where a connection between Gaussian Markov random fields and graph neural networks allows for building scalable probabilistic models for data defined using graphs. 

A motivation for the methods developed in this thesis is the increasing use of machine learning in earth science. Capturing relevant spatio-temporal relationships is central for building useful models of the earth system. The thesis includes numerous experiments making use of weather and climate data, as well as application-driven work specifically targeting weather forecasting. Recent years have seen rapid progress in using machine learning models for weather forecasting, and the thesis makes multiple contributions in this direction. A probabilistic weather forecasting model is developed by combining graph-based methods with a latent variable formulation. Lastly, machine learning limited area models are also explored, where graph neural networks are used for regional weather forecasting.   

Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2025. p. 97
Series
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 2453
National Category
Computer Sciences Meteorology and Atmospheric Sciences
Identifiers
urn:nbn:se:liu:diva-215665 (URN)10.3384/9789181181173 (DOI)9789181181166 (ISBN)9789181181173 (ISBN)
Public defence
2025-08-22, Ada Lovelace, B-huset, Campus Valla, Linköping, 13:15
Opponent
Supervisors
Funder
Swedish Research Council, 2020-04122; 2024-05011Wallenberg AI, Autonomous Systems and Software Program (WASP)ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2025-06-26 Created: 2025-06-26 Last updated: 2025-06-26Bibliographically approved

Open Access in DiVA

fulltext(2515 kB)118 downloads
File information
File name FULLTEXT02.pdfFile size 2515 kBChecksum SHA-512
bce5f2721d1d8a0a9936e59237ed8ebf929e9af5f38a057ef535872b2c7c7ef26a6ce089287df529f1e0de745388f197d39bdcc511e0d331b1b5a17187722fc1
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Oskarsson, JoelSidén, PerLindsten, Fredrik

Search in DiVA

By author/editor
Oskarsson, JoelSidén, PerLindsten, Fredrik
By organisation
The Division of Statistics and Machine LearningFaculty of Science & Engineering
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar
Total: 118 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 343 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf