Broccoli: Combining Phylogenetic and Network Analyses for Orthology Assignment

Romain Derelle, Hervé Philippe, John K. Colbourne

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

Orthology assignment is a key step of comparative genomic studies, for which many bioinformatic tools have been developed. However, all gene clustering pipelines are based on the analysis of protein distances, which are subject to many artifacts. In this article, we introduce Broccoli, a user-friendly pipeline designed to infer, with high precision, orthologous groups, and pairs of proteins using a phylogeny-based approach. Briefly, Broccoli performs ultrafast phylogenetic analyses on most proteins and builds a network of orthologous relationships. Orthologous groups are then identified from the network using a parameter-free machine learning algorithm. Broccoli is also able to detect chimeric proteins resulting from gene-fusion events and to assign these proteins to the corresponding orthologous groups. Tested on two benchmark data sets, Broccoli outperforms current orthology pipelines. In addition, Broccoli is scalable, with runtimes similar to those of recent distance-based pipelines. Given its high level of performance and efficiency, this new pipeline represents a suitable choice for comparative genomic studies. Broccoli is freely available at https://github.com/rderelle/Broccoli.

Original languageEnglish
Pages (from-to)3389-3396
Number of pages8
JournalMolecular biology and evolution
Volume37
Issue number11
DOIs
Publication statusPublished - 1 Nov 2020

Bibliographical note

Publisher Copyright:
© The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Copyright:
This record is sourced from MEDLINE/PubMed, a database of the U.S. National Library of Medicine

Keywords

  • gene fusions
  • label propagation algorithm
  • LPA
  • orthologous groups
  • orthology

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Molecular Biology
  • Genetics

Fingerprint

Dive into the research topics of 'Broccoli: Combining Phylogenetic and Network Analyses for Orthology Assignment'. Together they form a unique fingerprint.

Cite this