Evaluating the diagnostic and triage performance of digital and online symptom checkers for the presentation of myocardial infarction: A retrospective cross-sectional study

William Wallace, Calvin Chan*, Swathikan Chidambaram, Lydia Hanna, Amish Acharya, Elisabeth Daniels, Pasha Normahani, Rubeta N. Matin, Sheraz R. Markar, Viknesh Sounderajah, Xiaoxuan Liu, Ara Darzi

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

9 Downloads (Pure)

Abstract

Online symptom checkers are increasingly popular health technologies that enable patients to input their symptoms to produce diagnoses and triage advice. However, there is concern regarding the performance and safety of symptom checkers in diagnosing and triaging patients with life-threatening conditions. This retrospective cross-sectional study aimed to evaluate and compare commercially available symptom checkers for performance in diagnosing and triaging myocardial infarctions (MI). Symptoms and biodata of MI patients were inputted into 8 symptom checkers identified through a systematic search. Anonymised clinical data of 100 consecutive MI patients were collected from a tertiary coronary intervention centre between 1st January 2020 to 31st December 2020. Outcomes included (1) diagnostic sensitivity as defined by symptom checkers outputting MI as the primary diagnosis (D1), or one of the top three (D3), or top five diagnoses (D5); and (2) triage sensitivity as defined by symptom checkers outputting urgent treatment recommendations. Overall D1 sensitivity was 48±31% and varied between symptom checkers (range: 6–85%). Overall D3 and D5 sensitivity were 73±20% (34–92%) and 79±14% (63–94%), respectively. Overall triage sensitivity was 83±13% (55–91%). 24±16% of atypical cases had a correct D1 though for female atypical cases D1 sensitivity was only 10%. Atypical MI D3 and D5 sensitivity were 44±21% and 48±24% respectively and were significantly lower than typical MI cases (p<0.01). Atypical MI triage sensitivity was significantly lower than typical cases (53±20% versus 84±15%, p<0.01). Female atypical cases had significantly lower diagnostic and triage sensitivity than typical female MI cases (p<0.01).Given the severity of the pathology, the diagnostic performance of symptom checkers for correctly diagnosing an MI is concerningly low. Moreover, there is considerable inter-symptom checker performance variation. Patients presenting with atypical symptoms were under-diagnosed and under-triaged, especially if female. This study highlights the need for improved clinical performance, equity and transparency associated with these technologies.
Original languageEnglish
Article numbere0000558
Number of pages18
JournalPLOS digital health
Volume3
Issue number8
DOIs
Publication statusPublished - 5 Aug 2024

Cite this