Reconstruction and functional annotation of Ascosphaera apis full-length transcriptome utilizing PacBio long reads combined with Illumina short reads

J Invertebr Pathol. 2020 Oct:176:107475. doi: 10.1016/j.jip.2020.107475. Epub 2020 Sep 23.

Abstract

Ascosphaera apis is a widespread fungal pathogen of honeybee larvae that results in chalkbrood disease, leading to heavy losses for the beekeeping industry in China and many other countries. This work was aimed at generating a full-length transcriptome of A. apis using PacBio single-molecule real-time (SMRT) sequencing. Here, more than 23.97 Gb of clean reads was generated from long-read sequencing of A. apis mycelia, including 464,043 circular consensus sequences (CCS) and 394,142 full-length non-chimeric (FLNC) reads. In total, we identified 174,095 high-confidence transcripts covering 5141 known genes with an average length of 2728 bp. We also discovered 2405 genic loci and 11,623 isoforms that have not been annotated yet within the current reference genome. Additionally, 16,049, 10,682, 4520 and 7253 of the discovered transcripts have annotations in the Non-redundant protein (Nr), Clusters of Eukaryotic Orthologous Groups (KOG), Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Moreover, 1205 long non-coding RNAs (lncRNAs) were identified, which have less exons, shorter exon and intron lengths, shorter transcript lengths, lower GC percent, lower expression levels, and fewer alternative splicing (AS) evens, compared with protein-coding transcripts. A total of 253 members from 17 transcription factor (TF) families were identified from our transcript datasets. Finally, the expression of A. apis isoforms was validated using a molecular approach. Overall, this is the first report of a full-length transcriptome of entomogenous fungi including A. apis. Our data offer a comprehensive set of reference transcripts and hence contributes to improving the genome annotation and transcriptomic study of A. apis.

Keywords: Ascosphaera apis; Chalkbrood; Full-length transcriptome; Honeybee; PacBio.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Bees / microbiology
  • Fungal Proteins / analysis
  • High-Throughput Nucleotide Sequencing
  • Onygenales / genetics*
  • RNA, Fungal / analysis
  • RNA, Long Noncoding / analysis
  • Transcription Factors / analysis
  • Transcriptome*

Substances

  • Fungal Proteins
  • RNA, Fungal
  • RNA, Long Noncoding
  • Transcription Factors

Supplementary concepts

  • Ascosphaera apis