liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Rapid discovery of stable materials by coordinate-free coarse graining
Univ Cambridge, England.
Linköping University, Department of Physics, Chemistry and Biology, Theoretical Physics. Linköping University, Faculty of Science & Engineering.
Univ Cambridge, England.
Linköping University, Department of Physics, Chemistry and Biology, Theoretical Physics. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-5571-0814
Show others and affiliations
2022 (English)In: Science Advances, E-ISSN 2375-2548, Vol. 8, no 30, article id eabn4117Article in journal (Refereed) Published
Abstract [en]

A fundamental challenge in materials science pertains to elucidating the relationship between stoichiometry, stability, structure, and property. Recent advances have shown that machine learning can be used to learn such relationships, allowing the stability and functional properties of materials to be accurately predicted. However, most of these approaches use atomic coordinates as input and are thus bottlenecked by crystal structure identification when investigating previously unidentified materials. Our approach solves this bottleneck by coarse-graining the infinite search space of atomic coordinates into a combinatorially enumerable search space. The key idea is to use Wyckoff representations, coordinate-free sets of symmetry-related positions in a crystal, as the input to a machine learning model. Our model demonstrates exceptionally high precision in finding unknown theoretically stable materials, identifying 1569 materials that lie below the known convex hull of previously calculated materials from just 5675 ab initio calculations. Our approach opens up fundamental advances in computational materials discovery.

Place, publisher, year, edition, pages
AMER ASSOC ADVANCEMENT SCIENCE , 2022. Vol. 8, no 30, article id eabn4117
National Category
Textile, Rubber and Polymeric Materials
Identifiers
URN: urn:nbn:se:liu:diva-187733DOI: 10.1126/sciadv.abn4117ISI: 000836554300009PubMedID: 35895811OAI: oai:DiVA.org:liu-187733DiVA, id: diva2:1691369
Note

Funding Agencies|Winton Programme for the Physics of Sustainability; Royal Society; Swiss National Science Foundation [P2BSP2_191736]; Swedish Research Council (VR) [2020-05402]; Swedish e-Science Centre (SeRC); Swedish Research Council [2018-05973]

Available from: 2022-08-30 Created: 2022-08-30 Last updated: 2026-02-13
In thesis
1. Crystal Symmetry and Machine Learning for Systematic Materials Discovery
Open this publication in new window or tab >>Crystal Symmetry and Machine Learning for Systematic Materials Discovery
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Discovering new crystalline materials lies at the frontier of modern materials science, driving innovation in energy storage, catalysis, semiconductors, and beyond. The vastness of the chemical and structural space poses a profound challenge: the number of possible atomic arrangements grows prohibitably large with system size and composition. Traditional first-principles methods such as density functional theory (DFT) have revolutionized materials discovery, but their high computational cost limits large-scale exploration. This work addresses the combinatorial bottleneck by bringing together two complementary dimensions of modern materials discovery: data-driven predictions using machine learning and high-performance computing.

The work presented in this thesis builds on a symmetry-aware representation of crystal structures called protostructures, based on Wyckoff positions: a coordinate free description of symmetry related atomic sites. This formulation transforms the continuous space of atomic coordinates into a discrete and combinatorially enumerable one. We developed a machine learning model, Wren, which is trained on this representation to provide fast estimates of stability and guide exploration toward promising regions of structural space. A GPU-accelerated workflow using machine-learning-based interatomic potentials and parallelized screening allows for the evaluation of billions of candidate structures within practical timeframes.

Building on this framework, the presented work enumerates 39 billion binary and ternary compounds spanning the chemical space from lithium to bromine, identifying over 88,000 new structural prototypes, and about half a million new crystal structures within a stability limit of 100 meV/atom. The approach is further applied to experimentally unresolved powder diffraction data, where it reconstructs crystal structures consistent with measured patterns, demonstrating the workflow’s ability to uncover physically realizable materials beyond known prototypes.

To explore even broader regions of structural complexity, this work introduces WyckoffDiff, a diffusion-based generative model that produces novel, symmetry-consistent protostructures beyond the training distribution, some predicted to be thermodynamically stable.

Since pretrained interatomic potentials form the foundation of this work, their quality was examined through two complementary studies. The first benchmarks their accuracy in reproducing mixing enthalpies across disordered alloys. The second investigates how these potentials capture the topology of potential energy surfaces by probing energy variations along symmetry-constrained pathways, showing how different machine-learning potentials represent local minima and saddle points, and other artifacts. These two benchmarks provides insight into their reliability for structure prediction, and the resulting findings informed the selection and parametrization of models used throughout our screening framework.

Altogether, the work presented in this thesis demonstrates that the combination of coarse grained screening, ML-based interatomic potentials, and high-performance computing can dramatically accelerate the discovery of previously unseen crystal structures. The framework presented in the thesis expands the boundaries of computational materials discovery and represents a step toward a large-scale, perhaps even comprehensive, mapping of all stable crystal structures permitted by chemistry and symmetry.

Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2026. p. 89
Series
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 2510
National Category
Condensed Matter Physics
Identifiers
urn:nbn:se:liu:diva-221198 (URN)10.3384/9789181184761 (DOI)9789181184754 (ISBN)9789181184761 (ISBN)
Public defence
2026-03-06, Planck, F Building, Campus Valla, Linköping, 09:15 (English)
Opponent
Supervisors
Available from: 2026-02-13 Created: 2026-02-13 Last updated: 2026-02-13Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMed

Search in DiVA

By author/editor
Parackal, Abhijith SArmiento, Rickard
By organisation
Theoretical PhysicsFaculty of Science & Engineering
In the same journal
Science Advances
Textile, Rubber and Polymeric Materials

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 271 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf