Sequence, assembly and count datasets of viruses associated to the pine processionary moth Thaumetopoea pityocampa (Denis & Schiffermüller) (Lepidoptera, Notodontidae) identified from transcriptomic high-throughput sequencing

Data Brief. 2023 Apr 25:48:109180. doi: 10.1016/j.dib.2023.109180. eCollection 2023 Jun.

Abstract

The pine processionary moth Thaumetopoea pityocampa is a Lepidopteran pest species occurring in the Western Mediterranean. It causes heavy pine defoliations and it is a public and animal health concern because of its urticating caterpillars. Very little is known about the viruses associated to this species, as only two viruses were described so far. We here present a dataset corresponding to 34 viral transcripts, among which 27 could be confidently assigned to 9 RNA and DNA viral families (Iflaviridae, Reoviridae, Partitiviridae, Permutotetraviridae, Flaviviridae, Rhabdoviridae, Parvoviridae, Baculoviridae and PolyDNAviridae). These transcripts were identified from an original transcriptome assembled for the insect host, using both blast search and phylogenetic approaches. The data were acquired from 2 populations in Portugal and 2 populations in Italy. The transcripts were de novo assembled and used to identify viral sequences by homology searches. We also provide information about the populations and life stages in which each virus was identified. The data produced will allow to enrich the virus taxonomy in Lepidopteran hosts, and to develop PCR-based diagnostic tools to screen colonies across the range and determine the distribution and prevalence of the identified viral species.

Keywords: De novo assembly; High-throughput sequencing; In silico virus identification; Phylogeny; Pine processionary moth; Transcriptome.