EMCODIST: A Context-based Search Tool for Email Archives

Santhilata Kuppili Venkata, Stephanie Decker, David Kirsch, Adam Nix

Research output: Chapter in Book/Report/Conference proceedingConference contribution


Preservation of emails poses particular challenges to future discovery as alternative historical sources. Emails represent communications between individuals and contain a wealth of information when viewed as an organisation-wide collection. Existing search tools can extract named entities and keyword searches but are less effective when it comes to extracting patterns and contextual information across multiple custodians. To address this, we present EMCODIST, a discovery tool for searching the contextual information across emails using attention-based models of Natural Language Processing (NLP). The EMCODIST aims to steer end-users to personalise their searches towards a concept. In this paper, we explain the definition of the ‘context’ for emails which is also suitable for object-oriented computational modelling. The tool is evaluated based on the relevancy of the emails extracted.
Original languageEnglish
Title of host publication2021 IEEE International Conference on Big Data (Big Data)
EditorsYixin Chen, Heiko Ludwig, Yicheng Tu, Usama Fayyad, Xingquan Zhu, Xiaohua Hu, Suren Byna, Xiong Liu, Jianping Zhang, Shirui Pan, Vagelis Papalexakis, Jianwu Wang, Alfredo Cuzzocrea, Carlos Ordonez
Number of pages10
ISBN (Electronic)9781665439022
ISBN (Print)9781665445993 (PoD)
Publication statusPublished - 13 Jan 2022

Publication series

NameIEEE International Conference on Big Data
ISSN (Print)2639-1589
ISSN (Electronic)2573-2978


  • Contextualisation
  • Email archives processing
  • Content analysis
  • Natural Language Processing


Dive into the research topics of 'EMCODIST: A Context-based Search Tool for Email Archives'. Together they form a unique fingerprint.

Cite this