VIGA: a one-stop tool for eukaryotic virus identification and genome assembly from next-generation-sequencing data

Brief Bioinform. 2023 Nov 22;25(1):bbad444. doi: 10.1093/bib/bbad444.

Abstract

Identification of viruses and further assembly of viral genomes from the next-generation-sequencing data are essential steps in virome studies. This study presented a one-stop tool named VIGA (available at https://github.com/viralInformatics/VIGA) for eukaryotic virus identification and genome assembly from NGS data. It was composed of four modules, namely, identification, taxonomic annotation, assembly and novel virus discovery, which integrated several third-party tools such as BLAST, Trinity, MetaCompass and RagTag. Evaluation on multiple simulated and real virome datasets showed that VIGA assembled more complete virus genomes than its competitors on both the metatranscriptomic and metagenomic data and performed well in assembling virus genomes at the strain level. Finally, VIGA was used to investigate the virome in metatranscriptomic data from the Human Microbiome Project and revealed different composition and positive rate of viromes in diseases of prediabetes, Crohn's disease and ulcerative colitis. Overall, VIGA would help much in identification and characterization of viromes, especially the known viruses, in future studies.

Keywords: NGS; Virus identification genome assembly; metagenomic; metatranscriptomic.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Colitis, Ulcerative*
  • Crohn Disease*
  • Genome, Viral
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Metagenome