ShadowCaster: Compositional Methods under the Shadow of Phylogenetic Models to Detect Horizontal Gene Transfers in Prokaryotes

Genes (Basel). 2020 Jul 7;11(7):756. doi: 10.3390/genes11070756.

Abstract

Horizontal gene transfer (HGT) plays an important role for evolutionary innovations within prokaryotic communities and is a crucial event for their survival. Several computational approaches have arisen to identify HGT events in recipient genomes. However, this has been proven to be a complex task due to the generation of a great number of false positives and the prediction disagreement among the existing methods. Phylogenetic reconstruction methods turned out to be the most reliable ones, but they are not extensible to all genes/species and are computationally demanding when dealing with large datasets. In contrast, the so-called surrogate methods that use heuristic solutions either based on nucleotide composition patterns or phyletic distribution of BLAST hits can be applied easily to the genomic scale, but they fail in identifying common HGT events. Here, we present ShadowCaster, a hybrid approach that sequentially combines nucleotide composition-based predictions by support vector machines (SVMs) under the shadow of phylogenetic models independent of tree reconstruction, to improve the detection of HGT events in prokaryotes. ShadowCaster successfully predicted close and distant HGT events in both artificial and bacterial genomes. ShadowCaster detected HGT related to heavy metal resistance in the genome of Rhodanobacter denitrificans with higher accuracy than the most popular state-of-the-art computational approaches, encompassing most of the predicted cases made by other methods. ShadowCaster is released at the GitHub platform as an open-source software under the GPLv3 license.

Keywords: horizontal gene transfer; hybrid approach; implicit phylogenetic model; parametric method.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Gammaproteobacteria / classification
  • Gammaproteobacteria / genetics
  • Gene Transfer, Horizontal*
  • Genome, Archaeal*
  • Genome, Bacterial*
  • Genomics / methods*
  • Phylogeny
  • Software*
  • Support Vector Machine

Supplementary concepts

  • Rhodanobacter denitrificans