liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Machine-learning analysis of cross-study samples according to the gut microbiome in 12 infant cohorts
Univ Oulu, Finland.
Univ Oulu, Finland.
Univ Oulu, Finland; Oulu Univ Hosp, Finland.
Baylor Coll Med, TX USA; Texas Childrens Hosp, TX USA.
Show others and affiliations
2023 (English)In: mSystems, E-ISSN 2379-5077, Vol. 8, no 6, article id e0036423Article in journal (Refereed) Published
Abstract [en]

Combining and comparing microbiome data from distinct infant cohorts has been challenging because such data are inherently multidimensional and complex. Here, we used an ensemble of machine-learning (ML) models and studied 16S rRNA amplicon sequencing data from 4,099 gut microbiome samples representing 12 prospectively collected infant cohorts. We chose the childbirth delivery mode as a starting point for such analysis because it has previously been associated with alterations in the gut microbiome in infants. In cross-study ensemble models, Bacteroides was the most important feature in all machine-learning models. The predictive capacity by taxonomy varied with age. At the age of 1-2 months, gut microbiome data were able to predict delivery mode with an area under the curve of 0.72 to 0.83. In contrast, ML models trained on taxa were not able to differentiate between the modes of delivery, in any of the cohorts, when the infants were between 3 and 12 months of age. Moreover, no ML model, alternately trained on the functional pathways of the infant gut microbiome, could consistently predict mode of delivery at any infant age. This study shows that infant gut microbiome data sets can be effectively combined with the application of ML analysis across different study populations.IMPORTANCEThere are challenges in merging microbiome data from diverse research groups due to the intricate and multifaceted nature of such data. To address this, we utilized a combination of machine-learning (ML) models to analyze 16S sequencing data from a substantial set of gut microbiome samples, sourced from 12 distinct infant cohorts that were gathered prospectively. Our initial focus was on the mode of delivery due to its prior association with changes in infant gut microbiomes. Through ML analysis, we demonstrated the effective merging and comparison of various gut microbiome data sets, facilitating the identification of robust microbiome biomarkers applicable across varied study populations.

Place, publisher, year, edition, pages
AMER SOC MICROBIOLOGY , 2023. Vol. 8, no 6, article id e0036423
Keywords [en]
machine learning; bioinformatics; human microbiome; gut microbiome; random forest; infant; children; cross-study; ensemble
National Category
Bioinformatics (Computational Biology)
Identifiers
URN: urn:nbn:se:liu:diva-199118DOI: 10.1128/msystems.00364-23ISI: 001143818300003PubMedID: 37874156OAI: oai:DiVA.org:liu-199118DiVA, id: diva2:1811561
Note

Funding Agencies|All authors declare no conflict of interest relevant to this article.

Available from: 2023-11-13 Created: 2023-11-13 Last updated: 2024-07-04

Open Access in DiVA

fulltext(2437 kB)9 downloads
File information
File name FULLTEXT01.pdfFile size 2437 kBChecksum SHA-512
843ff0443fa56e686d60f0c888a65c911def35006db670d473cbb58c5550b5a530f7309f87dbec122273dcb0e8f553e8e6357d8bf7964b26e78d86078f256cea
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMed

Search in DiVA

By author/editor
Ludvigsson, Johnny
By organisation
Division of Children's and Women's HealthFaculty of Medicine and Health SciencesH.K.H. Kronprinsessan Victorias barn- och ungdomssjukhus
In the same journal
mSystems
Bioinformatics (Computational Biology)

Search outside of DiVA

GoogleGoogle Scholar
Total: 9 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 39 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf