liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Precise and interpretable neural networks reveal epigenetic signatures of aging across youth in health and disease
Linköping University, Department of Physics, Chemistry and Biology, Bioinformatics. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-6363-6298
Linköping University, Department of Physics, Chemistry and Biology, Bioinformatics. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-6362-0659
Linköping University, Department of Physics, Chemistry and Biology, Bioinformatics. Linköping University, Faculty of Science & Engineering.
Swedish Natl Board Forens Med, Dept Forens Genet & Toxicol, Linköping, Sweden.
Show others and affiliations
2025 (English)In: Frontiers in Aging, E-ISSN 2673-6217, Vol. 5, article id 1526146Article in journal (Refereed) Published
Abstract [en]

Introduction DNA methylation (DNAm) age clocks are powerful tools for measuring biological age, providing insights into aging risks and outcomes beyond chronological age. While traditional models are effective, their interpretability is limited by their dependence on small and potentially stochastic sets of CpG sites. Here, we propose that the reliability of DNAm age clocks should stem from their capacity to detect comprehensive and targeted aging signatures.Methods We compiled publicly available DNAm whole-blood samples (n = 17,726) comprising the entire human lifespan (0-112 years). We used a pre-trained network-coherent autoencoder (NCAE) to compress DNAm data into embeddings, with which we trained interpretable neural network epigenetic clocks. We then retrieved their age-specific epigenetic signatures of aging and examined their functional enrichments in age-associated biological processes.Results We introduce NCAE-CombClock, a novel highly precise (R2 = 0.978, mean absolute error = 1.96 years) deep neural network age clock integrating data-driven DNAm embeddings and established CpG age markers. Additionally, we developed a suite of interpretable NCAE-Age neural network classifiers tailored for adolescence and young adulthood. These clocks can accurately classify individuals at critical developmental ages in youth (AUROC = 0.953, 0.972, and 0.927, for 15, 18, and 21 years) and capture fine-grained, single-year DNAm signatures of aging that are enriched in biological processes associated with anatomic and neuronal development, immunoregulation, and metabolism. We showcased the practical applicability of this approach by identifying candidate mechanisms underlying the altered pace of aging observed in pediatric Crohn's disease.Discussion In this study, we present a deep neural network epigenetic clock, named NCAE-CombClock, that improves age prediction accuracy in large datasets, and a suite of explainable neural network clocks for robust age classification across youth. Our models offer broad applications in personalized medicine and aging research, providing a valuable resource for interpreting aging trajectories in health and disease.

Place, publisher, year, edition, pages
FRONTIERS MEDIA SA , 2025. Vol. 5, article id 1526146
Keywords [en]
DNA methylation; neural networks; age clock; epigenetic age; youth
National Category
Bioinformatics (Computational Biology)
Identifiers
URN: urn:nbn:se:liu:diva-211720DOI: 10.3389/fragi.2024.1526146ISI: 001414074300001PubMedID: 39916723Scopus ID: 2-s2.0-85216955208OAI: oai:DiVA.org:liu-211720DiVA, id: diva2:1938423
Note

Funding Agencies|Vetenskapsrdet10.13039/501100004359 [Berzelius-2022-156, Berzelius-2024-5, LiU-compute-2023-38, NAISS 2023/5-303]

Available from: 2025-02-18 Created: 2025-02-18 Last updated: 2025-11-24
In thesis
1. Explainable deep learning for DNA methylation analysis in health and disease
Open this publication in new window or tab >>Explainable deep learning for DNA methylation analysis in health and disease
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Modern clinical decision support requires models that are both accurate and mechanistically interpretable. DNA methylation tracks the cumulative influence of development, lifestyle, and environment on gene regulation, but its dimensionality and tissue specificity complicate analysis and clinical application. This thesis develops explainable deep learning methods that learn coherent biological signals from genome-wide methylation data, aiming to derive reliable biomarkers of aging, disease risk and severity, and system-level health. Central to our approach are deep autoencoders, unsupervised multi-layered neural networks that efficiently compress DNA methylation data into low-dimensional embeddings that preserve relevant biology, paired with interpretability techniques that expose feature contributions and model reasoning, such as perturbation-based latent activation.

By training on large multi-tissue compendia of human DNA methylation samples, we observed that the autoencoders self-organized their latent spaces, recapitulating protein-protein interaction (PPI) modules. Interpreting these structured embeddings yielded pathway-enriched epigenomic signatures that supported accurate epigenetic age estimation and robust classification of disease status and smoking. Building on these findings, we introduced a PPI-guided autoencoder that incorporates a graph-regularized protein interaction prior, encouraging each latent unit to be functionally specific and colocalized within the human interactome. We showed that this soft guidance improved the mechanistic interpretability of downstream models, in this case supervised translators that map between omics modalities (transcriptomics, DNA methylation, genomics).

In parallel, we combined autoencoder embeddings with established aging markers to train explainable neural-network age clocks that achieved state-of-the-art cross-tissue precision, while also capturing fine-grained developmental, immune, and metabolic signatures. Finally, we operationalized these representations in a clinical decision-support pipeline that predicts respiratory, cardiovascular, and metabolic system-level health scores from blood methylation, with supervised deep learning models that highlight biological processes associated with each physiological system. Collectively, this work provides a scalable and auditable framework that converts methylomes into interpretable feature sets and actionable indicators for clinical use, enabling early risk assessment, monitoring of treatment responses and lifestyle changes, and informed therapeutic target prioritization.

Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2025. p. 105
Series
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 2490
Keywords
Deep learning, Autoencoders, DNA methylation, Aging, Health
National Category
Medical Genetics and Genomics
Identifiers
urn:nbn:se:liu:diva-219551 (URN)10.3384/9789181183320 (DOI)9789181183313 (ISBN)9789181183320 (ISBN)
Public defence
2025-12-18, C1, C-building, Campus Valla, Linköping, 09:00 (English)
Opponent
Supervisors
Note

Funding Agencies: Swedish Heart-Lung Foundation

Available from: 2025-11-17 Created: 2025-11-17 Last updated: 2025-11-17Bibliographically approved

Open Access in DiVA

fulltext(5679 kB)30 downloads
File information
File name FULLTEXT01.pdfFile size 5679 kBChecksum SHA-512
8494e08aef4c1bf523bc77cd23fbe43c3531e3f172e1e39e63583628c0a4e3e1a8926a2ef6c111eb262946d78709763a086846147932cfab5618ed594e17df1c
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Search in DiVA

By author/editor
Martinez, DavidHillerton, ThomasÅkesson, JuliaLerm, MariaGustafsson, Mika
By organisation
BioinformaticsFaculty of Science & EngineeringDivision of Inflammation and InfectionFaculty of Medicine and Health Sciences
In the same journal
Frontiers in Aging
Bioinformatics (Computational Biology)

Search outside of DiVA

GoogleGoogle Scholar
Total: 30 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 213 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf