Skip to main navigation Skip to search Skip to main content

Federated Variational Inference for Bayesian Mixture Models

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present a one-shot, unsupervised federated learning approach for Bayesian model-based clustering of large-scale binary and categorical datasets, motivated by the need to identify patient clusters in privacy-sensitive electronic health record (EHR) data. We introduce a principled 'divide-and-conquer’ inference procedure using variational inference with local merge and delete moves within batches of the data in parallel, followed by 'global' merge moves across batches to find global clustering structures. We show that these merge moves require only summaries of the data in each batch, enabling federated learning across local nodes without requiring the full dataset to be shared. Empirical results on simulated and benchmark datasets demonstrate that our method performs well relative to comparator clustering algorithms. We validate the practical utility of the method by applying it to a large-scale British primary care EHR dataset to identify clusters of individuals with common patterns of co-occurring conditions (multimorbidity).
Original languageEnglish
Title of host publicationProceedings of the 5th Machine Learning for Health Symposium
PublisherPMLR
Publication statusAccepted/In press - 27 Nov 2025
EventMachine Learning for Health 2025 - San Diego, United States
Duration: 1 Dec 20252 Dec 2025
https://ahli.cc/ml4h/

Publication series

NameProceedings of Machine Learning Research
ISSN (Electronic)2640-3498

Conference

ConferenceMachine Learning for Health 2025
Abbreviated titleML4H 2025
Country/TerritoryUnited States
CitySan Diego
Period1/12/252/12/25
Internet address

Bibliographical note

Not yet published as of 11/05/2026.

Fingerprint

Dive into the research topics of 'Federated Variational Inference for Bayesian Mixture Models'. Together they form a unique fingerprint.

Cite this