liu.seSearch for publications in DiVA
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Crystal Symmetry and Machine Learning for Systematic Materials Discovery
Linköpings universitet, Institutionen för fysik, kemi och biologi, Teoretisk Fysik. Linköpings universitet, Tekniska fakulteten.ORCID-id: 0000-0003-0747-1289
2026 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)
Abstract [en]

Discovering new crystalline materials lies at the frontier of modern materials science, driving innovation in energy storage, catalysis, semiconductors, and beyond. The vastness of the chemical and structural space poses a profound challenge: the number of possible atomic arrangements grows prohibitably large with system size and composition. Traditional first-principles methods such as density functional theory (DFT) have revolutionized materials discovery, but their high computational cost limits large-scale exploration. This work addresses the combinatorial bottleneck by bringing together two complementary dimensions of modern materials discovery: data-driven predictions using machine learning and high-performance computing.

The work presented in this thesis builds on a symmetry-aware representation of crystal structures called protostructures, based on Wyckoff positions: a coordinate free description of symmetry related atomic sites. This formulation transforms the continuous space of atomic coordinates into a discrete and combinatorially enumerable one. We developed a machine learning model, Wren, which is trained on this representation to provide fast estimates of stability and guide exploration toward promising regions of structural space. A GPU-accelerated workflow using machine-learning-based interatomic potentials and parallelized screening allows for the evaluation of billions of candidate structures within practical timeframes.

Building on this framework, the presented work enumerates 39 billion binary and ternary compounds spanning the chemical space from lithium to bromine, identifying over 88,000 new structural prototypes, and about half a million new crystal structures within a stability limit of 100 meV/atom. The approach is further applied to experimentally unresolved powder diffraction data, where it reconstructs crystal structures consistent with measured patterns, demonstrating the workflow’s ability to uncover physically realizable materials beyond known prototypes.

To explore even broader regions of structural complexity, this work introduces WyckoffDiff, a diffusion-based generative model that produces novel, symmetry-consistent protostructures beyond the training distribution, some predicted to be thermodynamically stable.

Since pretrained interatomic potentials form the foundation of this work, their quality was examined through two complementary studies. The first benchmarks their accuracy in reproducing mixing enthalpies across disordered alloys. The second investigates how these potentials capture the topology of potential energy surfaces by probing energy variations along symmetry-constrained pathways, showing how different machine-learning potentials represent local minima and saddle points, and other artifacts. These two benchmarks provides insight into their reliability for structure prediction, and the resulting findings informed the selection and parametrization of models used throughout our screening framework.

Altogether, the work presented in this thesis demonstrates that the combination of coarse grained screening, ML-based interatomic potentials, and high-performance computing can dramatically accelerate the discovery of previously unseen crystal structures. The framework presented in the thesis expands the boundaries of computational materials discovery and represents a step toward a large-scale, perhaps even comprehensive, mapping of all stable crystal structures permitted by chemistry and symmetry.

sted, utgiver, år, opplag, sider
Linköping: Linköping University Electronic Press, 2026. , s. 89
Serie
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 2510
HSV kategori
Identifikatorer
URN: urn:nbn:se:liu:diva-221198DOI: 10.3384/9789181184761ISBN: 9789181184754 (tryckt)ISBN: 9789181184761 (digital)OAI: oai:DiVA.org:liu-221198DiVA, id: diva2:2038335
Disputas
2026-03-06, Planck, F Building, Campus Valla, Linköping, 09:15 (engelsk)
Opponent
Veileder
Tilgjengelig fra: 2026-02-13 Laget: 2026-02-13 Sist oppdatert: 2026-02-13bibliografisk kontrollert
Delarbeid
1. Rapid discovery of stable materials by coordinate-free coarse graining
Åpne denne publikasjonen i ny fane eller vindu >>Rapid discovery of stable materials by coordinate-free coarse graining
Vise andre…
2022 (engelsk)Inngår i: Science Advances, E-ISSN 2375-2548, Vol. 8, nr 30, artikkel-id eabn4117Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

A fundamental challenge in materials science pertains to elucidating the relationship between stoichiometry, stability, structure, and property. Recent advances have shown that machine learning can be used to learn such relationships, allowing the stability and functional properties of materials to be accurately predicted. However, most of these approaches use atomic coordinates as input and are thus bottlenecked by crystal structure identification when investigating previously unidentified materials. Our approach solves this bottleneck by coarse-graining the infinite search space of atomic coordinates into a combinatorially enumerable search space. The key idea is to use Wyckoff representations, coordinate-free sets of symmetry-related positions in a crystal, as the input to a machine learning model. Our model demonstrates exceptionally high precision in finding unknown theoretically stable materials, identifying 1569 materials that lie below the known convex hull of previously calculated materials from just 5675 ab initio calculations. Our approach opens up fundamental advances in computational materials discovery.

sted, utgiver, år, opplag, sider
AMER ASSOC ADVANCEMENT SCIENCE, 2022
HSV kategori
Identifikatorer
urn:nbn:se:liu:diva-187733 (URN)10.1126/sciadv.abn4117 (DOI)000836554300009 ()35895811 (PubMedID)
Merknad

Funding Agencies|Winton Programme for the Physics of Sustainability; Royal Society; Swiss National Science Foundation [P2BSP2_191736]; Swedish Research Council (VR) [2020-05402]; Swedish e-Science Centre (SeRC); Swedish Research Council [2018-05973]

Tilgjengelig fra: 2022-08-30 Laget: 2022-08-30 Sist oppdatert: 2026-02-13
2. Identifying crystal structures beyond known prototypes from x-ray powder diffraction spectra
Åpne denne publikasjonen i ny fane eller vindu >>Identifying crystal structures beyond known prototypes from x-ray powder diffraction spectra
2024 (engelsk)Inngår i: Physical Review Materials, E-ISSN 2475-9953, Vol. 8, nr 10, artikkel-id 103801Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

The large amount of powder diffraction data for which the corresponding crystal structures have not yet been identified suggests the existence of numerous undiscovered, physically relevant crystal structure prototypes. In this paper, we present a scheme to resolve powder diffraction data into crystal structures with precise atomic coordinates by screening the space of all possible atomic arrangements, i.e., structural prototypes, including those not previously observed, using a pre-trained machine learning (ML) model. This involves (i) enumerating all possible symmetry-confined ways in which a given composition can be accommodated in a given space group, (ii) ranking the element-assigned prototype representations using energies predicted using and perturbing atoms along the degree of freedom allowed by the Wyckoff positions to match the experimental diffraction data, and (iv) validating the thermodynamic stability of the material using density-functional theory. An advantage of the presented method is that it does not rely on a database of previously observed prototypes and is, therefore capable of finding crystal structures with entirely new symmetric arrangements of atoms. We demonstrate the workflow on unidentified x-ray diffraction spectra from the ICDD database and identify a number of stable structures, where a majority turns out to be derivable from known prototypes. However, at least two are found not to be part of our prior structural data sets.

sted, utgiver, år, opplag, sider
AMER PHYSICAL SOC, 2024
HSV kategori
Identifikatorer
urn:nbn:se:liu:diva-208676 (URN)10.1103/PhysRevMaterials.8.103801 (DOI)001330003700001 ()
Merknad

Funding Agencies|Swedish Research Council (VR) [2020-05402]; Swedish e-Science Centre (SeRC); Swedish Research Council [2018-05973]

Tilgjengelig fra: 2024-10-22 Laget: 2024-10-22 Sist oppdatert: 2026-02-13
3. Evaluating and improving the predictive accuracy of mixing enthalpies and volumes in disordered alloys from universal pretrained machine learning potentials
Åpne denne publikasjonen i ny fane eller vindu >>Evaluating and improving the predictive accuracy of mixing enthalpies and volumes in disordered alloys from universal pretrained machine learning potentials
2024 (engelsk)Inngår i: Physical Review Materials, E-ISSN 2475-9953, Vol. 8, nr 11, artikkel-id 113803Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

The advent of machine learning in materials science opens the way for exciting and ambitious simulations of large systems and long time scales with the accuracy of ab initio calculations. Recently, several pretrained universal machine learned interatomic potentials (UPMLIPs) have been published, i.e., potentials distributed with a single set of weights trained to target systems across a very wide range of chemistries and atomic arrangements. These potentials raise the hope of reducing the computational cost and methodological complexity of performing simulations compared to models that require for-purpose training. However, the application of these models needs critical evaluation to assess their usability across material types and properties. In this work, we investigate the application of the following UPMLIPs: MACE, CHGNET, and M3GNET to the context of alloy theory. We calculate the mixing enthalpies and volumes of 21 binary alloy systems and compare the results with DFT calculations to assess the performance of these potentials over different properties and types of materials. We find that the small relative energies necessary to correctly predict mixing energies are generally not reproduced by these methods with sufficient accuracy to describe correct mixing behaviors. However, the performance can be significantly improved by supplementing the training data with relevant training data. The potentials can also be used to partially accelerate these calculations by replacing the ab initio structural relaxation step.

sted, utgiver, år, opplag, sider
AMER PHYSICAL SOC, 2024
HSV kategori
Identifikatorer
urn:nbn:se:liu:diva-210045 (URN)10.1103/PhysRevMaterials.8.113803 (DOI)001356380700001 ()
Merknad

Funding Agencies|Swedish Research Council (VR) [2020-05402]; Swedish Government Strategic Re-search Area in Materials Science on Functional Materials at Linkping University [2009-00971]; Swedish e -Science Centre (SeRC) - Swedish Research Council [2022-06725]

Tilgjengelig fra: 2024-11-27 Laget: 2024-11-27 Sist oppdatert: 2026-02-13
4. WyckoffDiff– A Generative Diffusion Model for Crystal Symmetry
Åpne denne publikasjonen i ny fane eller vindu >>WyckoffDiff– A Generative Diffusion Model for Crystal Symmetry
Vise andre…
2025 (engelsk)Inngår i: Proceedings of the 42nd International Conference on Machine Learning, PMLR , 2025, Vol. 267, s. 15130-15147Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Crystalline materials often exhibit a high level of symmetry. However, most generative models do not account for symmetry, but rather model each atom without any constraints on its position or element. We propose a generative model, Wyckoff Diffusion (WyckoffDiff), which generates symmetry-based descriptions of crystals. This is enabled by considering a crystal structure representation that encodes all symmetry, and we design a novel neural network architecture which enables using this representation inside a discrete generative model framework. In addition to respecting symmetry by construction, the discrete nature of our model enables fast generation. We additionally present a new metric, Fréchet Wrenformer Distance, which captures the symmetry aspects of the materials generated, and we benchmark WyckoffDiff against recently proposed generative models for crystal generation. As a proof-of-concept study, we use WyckoffDiff to find new materials below the convex hull of thermodynamical stability.

sted, utgiver, år, opplag, sider
PMLR, 2025
Serie
Proceedings of Machine Learning Research, ISSN 2640-3498
HSV kategori
Identifikatorer
urn:nbn:se:liu:diva-218524 (URN)001693104000283 ()
Konferanse
ICML 2025, Forty-Second International Conference on Machine Learning, Vancouver Convention Center, Sun. July 13th through Sat. July 19th
Merknad

Funding: Knut and Alice Wallenberg Foundation (KAW) via the Wallenberg AI, Autonomous Systems and Software Program (WASP); Wallenberg Initiative Material Science for Sustainability (WISE); Swedish Research Council (VR) [2020-05402]; Excellence Center at Linkoping-Lund in Information Technology (ELLIIT); Swedish e-Science Centre (SeRC); Knut and Alice Wallenberg Foundation at the National Supercomputer Centre (NSC); Chalmers Centre for Computational Science and Engineering (C3SE) - Swedish Research Council [2022-06725]

Tilgjengelig fra: 2025-10-07 Laget: 2025-10-07 Sist oppdatert: 2026-04-14

Open Access i DiVA

fulltext(77474 kB)131 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 77474 kBChecksum SHA-512
78fc936548e3b4051ac20709cef32766e88b6be3b227f417fa5c28385ce995a3556e00fdc64f23c38b905fda99431122fc47d2c281b0545a17437ca14e032e17
Type fulltextMimetype application/pdf
Bestill online >>

Andre lenker

Forlagets fulltekst

Person

Parackal, Abhijith S.

Søk i DiVA

Av forfatter/redaktør
Parackal, Abhijith S.
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

doi
isbn
urn-nbn

Altmetric

doi
isbn
urn-nbn
Totalt: 4236 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf