Prophage Tracer: precisely tracing prophages in prokaryotic genomes using overlapping split-read alignment

Nucleic Acids Res. 2021 Dec 16;49(22):e128. doi: 10.1093/nar/gkab824.

Abstract

The life cycle of temperate phages includes a lysogenic cycle stage when the phage integrates into the host genome and becomes a prophage. However, the identification of prophages that are highly divergent from known phages remains challenging. In this study, by taking advantage of the lysis-lysogeny switch of temperate phages, we designed Prophage Tracer, a tool for recognizing active prophages in prokaryotic genomes using short-read sequencing data, independent of phage gene similarity searching. Prophage Tracer uses the criterion of overlapping split-read alignment to recognize discriminative reads that contain bacterial (attB) and phage (attP) att sites representing prophage excision signals. Performance testing showed that Prophage Tracer could predict known prophages with precise boundaries, as well as novel prophages. Two novel prophages, dsDNA and ssDNA, encoding highly divergent major capsid proteins, were identified in coral-associated bacteria. Prophage Tracer is a reliable data mining tool for the identification of novel temperate phages and mobile genetic elements. The code for the Prophage Tracer is publicly available at https://github.com/WangLab-SCSIO/Prophage_Tracer.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Animals
  • Anthozoa / microbiology
  • Bacteria / isolation & purification
  • Genome, Archaeal*
  • Genome, Bacterial*
  • Interspersed Repetitive Sequences
  • Prophages / genetics*
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Software*