A transcriptome atlas of silkworm silk glands revealed by PacBio single-molecule long-read sequencing

Mol Genet Genomics. 2020 Sep;295(5):1227-1237. doi: 10.1007/s00438-020-01691-9. Epub 2020 Jun 10.

Abstract

The silk gland of the silkworm Bombyx mori is a specialized organ where silk proteins are efficiently synthesized under precise regulation that largely determines the properties of silk fibers. To understand the genes involved in the regulation of silk protein synthesis, considerable research has focused on the transcripts expressed in silk glands; however, the complete transcriptome profile of this organ has yet to be elucidated. Here, we report a full-length silk gland transcriptome obtained by PacBio single-molecule long-read sequencing technology. In total, 11,697 non-redundant transcripts were identified in mixed samples of silk glands dissected from larvae at five developmental stages. When compared with the published reference, the full-length transcripts optimized the structures of 3002 known genes, and a total of 9061 novel transcripts with an average length of 2171 bp were detected. Among these, 1403 (15.5%) novel transcripts were computationally revealed to be lncRNAs, 8135 (89.8%) novel transcripts were annotated to different protein and nucleotide databases, and 5655 (62.4%) novel transcripts were predicted to have complete ORFs. Furthermore, we found 1867 alternative splicing events, 2529 alternative polyadenylation events, 784 fusion events and 6596 SSRs. This study provides a comprehensive set of reference transcripts and greatly revises and expands the available silkworm transcript data. In addition, these data will be very useful for studying the regulatory mechanisms of silk protein synthesis.

Keywords: Full-length transcriptome; Novel transcripts; PacBio RS II; Silk gland; Silkworm.

MeSH terms

  • Alternative Splicing
  • Animals
  • Bombyx / genetics
  • Bombyx / growth & development*
  • Exome Sequencing
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation, Developmental
  • High-Throughput Nucleotide Sequencing
  • Insect Proteins / genetics
  • Open Reading Frames
  • Polyadenylation
  • RNA, Long Noncoding / genetics
  • Silk / genetics*
  • Single Molecule Imaging / methods*

Substances

  • Insect Proteins
  • RNA, Long Noncoding
  • Silk