Comparative genomic analysis of five freshwater cyanophages and reference-guided metagenomic data mining

Microbiome. 2022 Aug 17;10(1):128. doi: 10.1186/s40168-022-01324-w.

Abstract

Background: As important producers using photosynthesis on Earth, cyanobacteria contribute to the oxygenation of atmosphere and the primary production of biosphere. However, due to the eutrophication of urban waterbodies and global warming, uncontrollable growth of cyanobacteria usually leads to the seasonal outbreak of cyanobacterial blooms. Cyanophages, a group of viruses that specifically infect and lyse cyanobacteria, are considered as potential environment-friendly agents to control the harmful blooms. Compared to the marine counterparts, only a few freshwater cyanophages have been isolated and genome sequenced to date, largely limiting their characterizations and applications.

Results: Here, we isolated five freshwater cyanophages varying in tail morphology, termed Pam1~Pam5, all of which infect the cyanobacterium Pseudanabaena mucicola Chao 1806 that was isolated from the bloom-suffering Lake Chaohu in Anhui, China. The whole-genome sequencing showed that cyanophages Pam1~Pam5 all contain a dsDNA genome, varying in size from 36 to 142 Kb. Phylogenetic analyses suggested that Pam1~Pam5 possess different DNA packaging mechanisms and are evolutionarily distinct from each other. Notably, Pam1 and Pam5 have lysogeny-associated gene clusters, whereas Pam2 possesses 9 punctuated DNA segments identical to the CRISPR spacers in the host genome. Metagenomic data-based calculation of the relative abundance of Pam1~Pam5 at the Nanfei estuary towards the Lake Chaohu revealed that the short-tailed Pam1 and Pam5 account for the majority of the five cyanophages. Moreover, comparative analyses of the reference genomes of Pam1~Pam5 and previously reported cyanophages enabled us to identify three circular and seven linear contigs of virtual freshwater cyanophages from the metagenomic data of the Lake Chaohu.

Conclusions: We propose a high-throughput strategy to systematically identify cyanophages based on the currently available metagenomic data and the very limited reference genomes of experimentally isolated cyanophages. This strategy could be applied to mine the complete or partial genomes of unculturable bacteriophages and viruses. Transformation of the synthesized whole genomes of these virtual phages/viruses to proper hosts will enable the rescue of bona fide viral particles and eventually enrich the library of microorganisms that exist on Earth. Video abstract.

Keywords: Cyanobacterium; Freshwater cyanophage; Metagenomics; Reference genome; Whole-genome sequencing.

Publication types

  • Video-Audio Media
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteriophages*
  • Data Mining
  • Fresh Water / microbiology
  • Genome, Viral* / genetics
  • Metagenomics
  • Oligopeptides
  • Phylogeny
  • Toll-Like Receptor 2 / agonists
  • Toll-Like Receptor 9 / agonists

Substances

  • Oligopeptides
  • Toll-Like Receptor 2
  • Toll-Like Receptor 9
  • PAM2-CSK4