Interpretable and robust hospital readmission predictions from Electronic Health Records

Hugo Calero-Diaz*, Rebeen Ali Hamad, Christian Atallah, John Casement, Dexter Canoy, Nick J. Reynolds, Michael R. Barnes, Paolo Missier

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Rates of Hospital Readmission (HR), defined as unplanned readmission within 30 days of discharge, have been increasing over the years, and impose an economic burden on healthcare services worldwide. Despite recent research into predicting HR, few models provide sufficient discriminative ability. Three main drawbacks can be identified in the published literature: (i) imbalance in the target classes (readmitted or not), (ii) not including demographic and lifestyle predictors, and (iii) lack of interpretability of the models. In this work, we address these three points by evaluating class balancing techniques, performing a feature selection process including demographic and lifestyle features, and adding interpretability through a combination of SHapley Additive exPlanations (SHAP) and Accumulated Local Effects (ALE) post hoc methods. Our best classifier for this binary outcome achieves a UAC of 0.849 using a selection of 1296 features, extracted from patients' Electronic Health Records (EHRs) and from their sociodemographics profiles. Using SHAP and ALE, we have established the importance of age, the number of long-term conditions, and the duration of the first admission as top predictors. In addition, we show through an ablation study that demographic and lifestyle features provide even better predictive capabilities than other features, suggesting their relevance toward HR.

Original languageEnglish
Title of host publication2023 IEEE International Conference on Big Data (BigData)
EditorsJingrui He, Themis Palpanas, Xiaohua Hu, Alfredo Cuzzocrea, Dejing Dou, Dominik Slezak, Wei Wang, Aleksandra Gruca, Jerry Chun-Wei Lin, Rakesh Agrawal
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages3679-3687
Number of pages9
ISBN (Electronic)9798350324457
ISBN (Print)9798350324464
DOIs
Publication statusPublished - 22 Jan 2024
Event2023 IEEE International Conference on Big Data, BigData 2023 - Sorrento, Italy
Duration: 15 Dec 202318 Dec 2023

Publication series

NameIEEE International Conference on Big Data
PublisherIEEE

Conference

Conference2023 IEEE International Conference on Big Data, BigData 2023
Country/TerritoryItaly
CitySorrento
Period15/12/2318/12/23

Bibliographical note

Funding Information:
ACKNOWLEDGMENT This research has been conducted using data from UK Biobank, a major biomedical database.3 This study received funding from the National Institute for Health and Care Research (NIHR), Artificial Intelligence for Multiple Long-Term Conditions (AIM) Development and Collaboration grants.

Funding Information:
N.J.R. is also supported by the Newcastle Biomedical Research Centre, the Newcastle NIHR Medtech and In Vitro Diagnostics Cooperative, the NIHR Newcastle Patient Safety Research Collaborative and is a NIHR Senior Investigator.

Publisher Copyright:
© 2023 IEEE.

Keywords

  • Classification
  • Feature Selection
  • Hospital readmission
  • Imbalance correction
  • Interpretability

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'Interpretable and robust hospital readmission predictions from Electronic Health Records'. Together they form a unique fingerprint.

Cite this