Curating a longitudinal research resource using linked primary care EHR data - a UK Biobank case study

Philip Darke*, Sophie Cassidy, Michael Catt, Roy Taylor, Paolo Missier, Jaume Bacardit

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

50 Downloads (Pure)

Abstract

Primary care EHR data are often of clinical importance to cohort studies however they require careful handling. Challenges include determining the periods during which EHR data were collected. Participants are typically censored when they deregister from a medical practice, however, cohort studies wish to follow participants longitudinally including those that change practice. Using UK Biobank as an exemplar, we developed methodology to infer continuous periods of data collection and maximize follow-up in longitudinal studies. This resulted in longer follow-up for around 40% of participants with multiple registration records (mean increase of 3.8 years from the first study visit). The approach did not sacrifice phenotyping accuracy when comparing agreement between self-reported and EHR data. A diabetes mellitus case study illustrates how the algorithm supports longitudinal study design and provides further validation. We use UK Biobank data, however, the tools provided can be used for other conditions and studies with minimal alteration.

Original languageEnglish
Pages (from-to)546-552
Number of pages7
JournalJournal of the American Medical Informatics Association
Volume29
Issue number3
Early online date13 Dec 2021
DOIs
Publication statusPublished - Mar 2022

Bibliographical note

Funding Information:
The data used were provided under UK Biobank application 12184. The authors are grateful to all UK Biobank participants who generously contributed their time to the study. PD would like to thank Dr Peter Philipson at Newcastle University, Dr Sam Hodgson at University of Southampton, Dr Sarah Finer at Queen Mary University of London, and Professor Naomi Allen and Dr Rishi Caleyachetty at Oxford University/UK Biobank for helpful discussions.

Publisher Copyright:
© 2021 The Author(s).

Keywords

  • diabetes mellitus
  • electronic health records
  • longitudinal studies
  • medical record linkage
  • phenotype

ASJC Scopus subject areas

  • Health Informatics

Fingerprint

Dive into the research topics of 'Curating a longitudinal research resource using linked primary care EHR data - a UK Biobank case study'. Together they form a unique fingerprint.

Cite this