Comparison of Illumina de novo assembled and Sanger sequenced viral genomes: A case study for RNA viruses recovered from the plant pathogenic fungus Sclerotinia sclerotiorum

Virus Res. 2016 Jul 2:219:51-57. doi: 10.1016/j.virusres.2015.11.001. Epub 2015 Nov 12.

Abstract

The advent of 'next generation sequencing' (NGS) technologies has led to the discovery of many novel mycoviruses, the majority of which are sufficiently different from previously sequenced viruses that there is no appropriate reference sequence on which to base the sequence assembly. Although many new genome sequences are generated by NGS, confirmation of the sequence by Sanger sequencing is still essential for formal classification by the International Committee for the Taxonomy of Viruses (ICTV), although this is currently under review. To empirically test the validity of de novo assembled mycovirus genomes from dsRNA extracts, we compared the results from Illumina sequencing with those from random cloning plus targeted PCR coupled with Sanger sequencing for viruses from five Sclerotinia sclerotiorum isolates. Through Sanger sequencing we detected nine viral genomes while through Illumina sequencing we detected the same nine viruses plus one additional virus from the same samples. Critically, the Illumina derived sequences share >99.3 % identity to those obtained by cloning and Sanger sequencing. Although, there is scope for errors in de novo assembled viral genomes, our results demonstrate that by maximising the proportion of viral sequence in the data and using sufficiently rigorous quality controls, it is possible to generate de novo genome sequences of comparable accuracy from Illumina sequencing to those obtained by Sanger sequencing.

Keywords: De novo viral genome assembly; Illumina sequencing; Mycovirus; Next generation sequencing; Virus detection.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Ascomycota / virology
  • Computational Biology
  • Fungal Viruses / genetics
  • Genome, Viral*
  • High-Throughput Nucleotide Sequencing* / methods
  • Plants / microbiology
  • RNA Viruses / genetics
  • Sequence Analysis, DNA* / methods