Functional Profiling and Evolutionary Analysis of a Marine Microalgal Virus Pangenome

Viruses. 2023 May 5;15(5):1116. doi: 10.3390/v15051116.

Abstract

Phycodnaviridae are large double-stranded DNA viruses, which facilitate studies of host-virus interactions and co-evolution due to their prominence in algal infection and their role in the life cycle of algal blooms. However, the genomic interpretation of these viruses is hampered by a lack of functional information, stemming from the surprising number of hypothetical genes of unknown function. It is also unclear how many of these genes are widely shared within the clade. Using one of the most extensively characterized genera, Coccolithovirus, as a case study, we combined pangenome analysis, multiple functional annotation tools, AlphaFold structural modeling, and literature analysis to compare the core and accessory pangenome and assess support for novel functional predictions. We determined that the Coccolithovirus pangenome shares 30% of its genes with all 14 strains, making up the core. Notably, 34% of its genes were found in at most three strains. Core genes were enriched in early expression based on a transcriptomic dataset of Coccolithovirus EhV-201 algal infection, were more likely to be similar to host proteins than the non-core set, and were more likely to be involved in vital functions such as replication, recombination, and repair. In addition, we generated and collated annotations for the EhV representative EhV-86 from 12 different annotation sources, building up information for 142 previously hypothetical and putative membrane proteins. AlphaFold was further able to predict structures for 204 EhV-86 proteins with a modelling accuracy of good-high. These functional clues, combined with generated AlphaFold structures, provide a foundational framework for the future characterization of this model genus (and other giant viruses) and a further look into the evolution of the Coccolithovirus proteome.

Keywords: AlphaFold; Chlorovirus; Coccolithovirus; Phycodnaviridae; Prasinovirus; algal viruses; functional annotation; genomes; pangenomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genomics
  • Phycodnaviridae* / genetics
  • Phylogeny

Grants and funding

This work was supported by start-up funding from the University of Waterloo (UW), Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grants (RGPIN-2022-03350 and DGECR-2022-00329), and a Phycological Society of America Norma J. Lang Early Career Research Fellowship, awarded to J.I.N. We also acknowledge funding from an NSERC Discovery Grant (RGPIN-2019-04266) and Discovery Accelerator Supplement (RGPAS-2019-00004) awarded to A.C.D. A.C.D. also holds a University Research Chair from the University of Waterloo. Additional support was provided to A.S. by the Government of Canada’s Student Work Placement Program and facilitated through the UW co-op student program.