DeepPVP: phenotype-based prioritization of causative variants using deep learning

Imane Boudellioua; Maxat Kulmanov; Paul N. Schofield; Georgios V. Gkoutos; Robert Hoehndorf

doi:10.1186/s12859-019-2633-8

DeepPVP: phenotype-based prioritization of causative variants using deep learning

Imane Boudellioua, Maxat Kulmanov, Paul N. Schofield, Georgios V. Gkoutos, Robert Hoehndorf^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

10 Citations (Scopus)

300 Downloads (Pure)

Abstract

Background: Prioritization of variants in personal genomic data is a major challenge. Recently, computational methods that rely on comparing phenotype similarity have shown to be useful to identify causative variants. In these methods, pathogenicity prediction is combined with a semantic similarity measure to prioritize not only variants that are likely to be dysfunctional but those that are likely involved in the pathogenesis of a patient's phenotype.

Results: We have developed DeepPVP, a variant prioritization method that combined automated inference with deep neural networks to identify the likely causative variants in whole exome or whole genome sequence data. We demonstrate that DeepPVP performs significantly better than existing methods, including phenotype-based methods that use similar features. DeepPVP is freely available at https://github.com/bio-ontology-research-group/phenomenet-vp.

Conclusions: DeepPVP further improves on existing variant prioritization methods both in terms of speed as well as accuracy.

Original language	English
Article number	65
Number of pages	8
Journal	BMC Bioinformatics
Volume	20
Issue number	1
DOIs	https://doi.org/10.1186/s12859-019-2633-8
Publication status	Published - 6 Feb 2019

Keywords

Machine learning
Ontology
Phenotype
Variant prioritization

ASJC Scopus subject areas

Structural Biology
Biochemistry
Molecular Biology
Computer Science Applications
Applied Mathematics

Access to Document

10.1186/s12859-019-2633-8Licence: Creative Commons: Attribution (CC BY)

Imane_Boudellioua_et_al_DeepPVP_BMC_Bioinformatics_2019
Checked for eligibility: 11/03/2019
Final published version, 729 KBLicence: Creative Commons: Attribution (CC BY)

Cite this

@article{19b1ee0e697e4beeb6ee4a118d4f1299,

title = "DeepPVP: phenotype-based prioritization of causative variants using deep learning",

abstract = "Background: Prioritization of variants in personal genomic data is a major challenge. Recently, computational methods that rely on comparing phenotype similarity have shown to be useful to identify causative variants. In these methods, pathogenicity prediction is combined with a semantic similarity measure to prioritize not only variants that are likely to be dysfunctional but those that are likely involved in the pathogenesis of a patient's phenotype. Results: We have developed DeepPVP, a variant prioritization method that combined automated inference with deep neural networks to identify the likely causative variants in whole exome or whole genome sequence data. We demonstrate that DeepPVP performs significantly better than existing methods, including phenotype-based methods that use similar features. DeepPVP is freely available at https://github.com/bio-ontology-research-group/phenomenet-vp. Conclusions: DeepPVP further improves on existing variant prioritization methods both in terms of speed as well as accuracy.",

keywords = "Machine learning, Ontology, Phenotype, Variant prioritization",

author = "Imane Boudellioua and Maxat Kulmanov and Schofield, {Paul N.} and Gkoutos, {Georgios V.} and Robert Hoehndorf",

year = "2019",

month = feb,

day = "6",

doi = "10.1186/s12859-019-2633-8",

language = "English",

volume = "20",

journal = "BMC Bioinformatics",

issn = "1471-2105",

publisher = "Springer",

number = "1",

}

TY - JOUR

T1 - DeepPVP

T2 - phenotype-based prioritization of causative variants using deep learning

AU - Boudellioua, Imane

AU - Kulmanov, Maxat

AU - Schofield, Paul N.

AU - Gkoutos, Georgios V.

AU - Hoehndorf, Robert

PY - 2019/2/6

Y1 - 2019/2/6

N2 - Background: Prioritization of variants in personal genomic data is a major challenge. Recently, computational methods that rely on comparing phenotype similarity have shown to be useful to identify causative variants. In these methods, pathogenicity prediction is combined with a semantic similarity measure to prioritize not only variants that are likely to be dysfunctional but those that are likely involved in the pathogenesis of a patient's phenotype. Results: We have developed DeepPVP, a variant prioritization method that combined automated inference with deep neural networks to identify the likely causative variants in whole exome or whole genome sequence data. We demonstrate that DeepPVP performs significantly better than existing methods, including phenotype-based methods that use similar features. DeepPVP is freely available at https://github.com/bio-ontology-research-group/phenomenet-vp. Conclusions: DeepPVP further improves on existing variant prioritization methods both in terms of speed as well as accuracy.

AB - Background: Prioritization of variants in personal genomic data is a major challenge. Recently, computational methods that rely on comparing phenotype similarity have shown to be useful to identify causative variants. In these methods, pathogenicity prediction is combined with a semantic similarity measure to prioritize not only variants that are likely to be dysfunctional but those that are likely involved in the pathogenesis of a patient's phenotype. Results: We have developed DeepPVP, a variant prioritization method that combined automated inference with deep neural networks to identify the likely causative variants in whole exome or whole genome sequence data. We demonstrate that DeepPVP performs significantly better than existing methods, including phenotype-based methods that use similar features. DeepPVP is freely available at https://github.com/bio-ontology-research-group/phenomenet-vp. Conclusions: DeepPVP further improves on existing variant prioritization methods both in terms of speed as well as accuracy.

KW - Machine learning

KW - Ontology

KW - Phenotype

KW - Variant prioritization

UR - http://www.scopus.com/inward/record.url?scp=85061131814&partnerID=8YFLogxK

U2 - 10.1186/s12859-019-2633-8

DO - 10.1186/s12859-019-2633-8

M3 - Article

C2 - 30727941

AN - SCOPUS:85061131814

SN - 1471-2105

VL - 20

JO - BMC Bioinformatics

JF - BMC Bioinformatics

IS - 1

M1 - 65

ER -

DeepPVP: phenotype-based prioritization of causative variants using deep learning

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Cite this