Inference of the Life Cycle of Environmental Phages from Genomic Signature Distances to Their Hosts

Viruses. 2023 May 19;15(5):1196. doi: 10.3390/v15051196.

Abstract

The environmental impact of uncultured phages is shaped by their preferred life cycle (lytic or lysogenic). However, our ability to predict it is very limited. We aimed to discriminate between lytic and lysogenic phages by comparing the similarity of their genomic signatures to those of their hosts, reflecting their co-evolution. We tested two approaches: (1) similarities of tetramer relative frequencies, (2) alignment-free comparisons based on exact k = 14 oligonucleotide matches. First, we explored 5126 reference bacterial host strains and 284 associated phages and found an approximate threshold for distinguishing lysogenic and lytic phages using both oligonucleotide-based methods. The analysis of 6482 plasmids revealed the potential for horizontal gene transfer between different host genera and, in some cases, distant bacterial taxa. Subsequently, we experimentally analyzed combinations of 138 Klebsiella pneumoniae strains and their 41 phages and found that the phages with the largest number of interactions with these strains in the laboratory had the shortest genomic distances to K. pneumoniae. We then applied our methods to 24 single-cells from a hot spring biofilm containing 41 uncultured phage-host pairs, and the results were compatible with the lysogenic life cycle of phages detected in this environment. In conclusion, oligonucleotide-based genome analysis methods can be used for predictions of (1) life cycles of environmental phages, (2) phages with the broadest host range in culture collections, and (3) potential horizontal gene transfer by plasmids.

Keywords: bacteriophages; genomic signatures; lysogenic phages; lytic phages; single-cell genomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria / genetics
  • Bacteriophages* / genetics
  • Genome, Viral
  • Genomics
  • Lysogeny
  • Oligonucleotides

Substances

  • Oligonucleotides

Grants and funding

M.D. is supported by the Generalitat Valenciana program Gen-T (grant number CDEIGENT/2021/008). B.B. was funded by a PhD fellowship from Spanish MCIU FPU16/02139. R.S. received funding from the MICINN project PID2020-118602RB-I00 (ZooVir). P.D.-C. is financially supported by a Ramón y Cajal contract RYC2019-028015-I funded by MCIN/AEI/10.13039/501100011033, ESF Invest in Your Future, and received funding from ESCMID Research Grant 20200063, project PID2020-112835RA-I00, funded by MCIN/AEI/10.13039/501100011033, and project SEJIGENT/2021/014, funded by Conselleria d’Innovació, Universitats, Ciència i Societat Digital (Generalitat Valenciana). F.G.-C. was funded by MICIN project PID2021-127010OB-I00 and by project CIPROM-2021-053 from Generalitat Valenciana.