Skip to main navigation Skip to search Skip to main content

DynaBiome: interpretable unsupervised learning of gut microbiome dysbiosis via temporal deep models

  • Awais Qureshi*
  • , Abdul Wahid*
  • , Shams Qazi
  • , Muhammad K. Shahzad
  • , Hashir Moheed Kiani
  • , Muhammad Daud Abdullah Asif
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Downloads (Pure)

Abstract

Purpose: Gut microbiome dysbiosis is a critical determinant for autologous fecal microbiota transplantation (Auto-FMT) eligibility, yet current classification approaches rely predominantly on supervised learning with manually annotated sequencing labels, which are often scarce. This study proposes DynaBiome, a framework designed to predict gut dysbiosis by leveraging unsupervised learning and clinical phenotypic proxies as a scalable alternative to ground-truth genomic labeling.

Methods
: Our framework employs an LSTM autoencoder architecture to capture temporal microbiome dynamics within 14-day windows. The model reconstructs normal microbiome patterns, where high reconstruction errors signal potential dysbiosis. To ensure rigorous evaluation and prevent data leakage, the dataset was partitioned via a strict patient-level split. Unsupervised anomaly signals were refined via phenotypic proxy labels (e.g., fever, neutropenia) via weak supervision, and ensemble learning methods were applied to optimize classification performance.

Results
: The initial LSTM autoencoder successfully flagged dysbiotic sequences but required refinement to reduce false positives. Ensemble learning significantly enhanced predictive accuracy. The stacked ensemble (with Logistic Regression meta-learner) demonstrated optimal performance with an ROC AUC of 0.8908 and a Weighted F1-score of 0.7909. This approach significantly outperformed the standard One-Class SVM baseline (ROC AUC 0.6033), confirming the superiority of deep temporal modeling over static anomaly detection. Critically, the model achieved performance levels comparable to fully supervised baselines, confirming the efficacy of the proxy-label framework.

Conclusion
: Integrating unsupervised temporal feature extraction with stacked ensemble methods provides a viable framework for dysbiosis prediction. These results demonstrate that leveraging phenotypic via weak supervision can effectively approximate supervised baselines, thereby reducing the reliance on comprehensive metagenomic annotations for longitudinal patient monitoring.
Original languageEnglish
Article number71
Number of pages27
JournalBMC Bioinformatics
Volume27
Issue number1
Early online date27 Feb 2026
DOIs
Publication statusPublished - 26 Mar 2026

Keywords

  • Autoencoder
  • Longitudinal microbiome data
  • Clinical proxies
  • Stacked ensemble
  • Weak supervision
  • Deep learning

Fingerprint

Dive into the research topics of 'DynaBiome: interpretable unsupervised learning of gut microbiome dysbiosis via temporal deep models'. Together they form a unique fingerprint.

Cite this