Analysis of the human diseasome using phenotype similarity between common, genetic, and infectious diseases

R. Hoehndorf, P. N. Schofield, G. V. Gkoutos

Research output: Contribution to journalArticlepeer-review

50 Citations (Scopus)
128 Downloads (Pure)


Phenotypes are the observable characteristics of an organism arising from its response to the environment. Phenotypes associated with engineered and natural genetic variation are widely recorded using phenotype ontologies in model organisms, as are signs and symptoms of human Mendelian diseases in databases such as OMIM and Orphanet. Exploiting these resources, several computational methods have been developed for integration and analysis of phenotype data to identify the genetic etiology of diseases or suggest plausible interventions. A similar resource would be highly useful not only for rare and Mendelian diseases, but also for common, complex and infectious diseases. We apply a semantic text-mining approach to identify the phenotypes (signs and symptoms) associated with over 6,000 diseases. We evaluate our text-mined phenotypes by demonstrating that they can correctly identify known disease-associated genes in mice and humans with high accuracy. Using a phenotypic similarity measure, we generate a human disease network in which diseases that have similar signs and symptoms cluster together and we use this network to identify closely related diseases based on common etiological, anatomical as well as physiological underpinnings.
Original languageEnglish
Article number10888
Number of pages14
JournalScientific Reports
Publication statusPublished - 8 Jun 2015


Dive into the research topics of 'Analysis of the human diseasome using phenotype similarity between common, genetic, and infectious diseases'. Together they form a unique fingerprint.

Cite this