The dataset of predicted trypsin serine peptidases and their inactive homologs in Tenebrio molitor transcriptomes

Data Brief. 2021 Aug 16:38:107301. doi: 10.1016/j.dib.2021.107301. eCollection 2021 Oct.

Abstract

Tenebrio molitor is an important coleopteran model insect and agricultural pest from the Tenebrionidae family. We used RNA-Seq transcriptome data from T. molitor to annotate trypsin-like sequences from the chymotrypsin S1 family of serine peptidases, including sequences of active serine peptidases (SerP) and their inactive homologs (SerPH) in T. molitor transcriptomes. A total of 63 S1 family tryspin-like serine peptidase sequences were de novo assembled. Among the sequences, 58 were predicted to be active trypsins and five inactive SerPH. The length of preproenzyme and mature form of the predicted enzyme, position of signal peptide and proenzyme cleavage sites, molecular mass, active site and S1 substrate binding subsite residues, and transmembrane and regulatory domains were analyzed using bioinformatic tools. The data can be used for further physiological, biochemical, and phylogenetic study of tenebrionid pests and other animal systems.

Keywords: Inactive peptidase homologs; Pseudopeptidases; Serine peptidases; Tenebrio molitor; Tenebrionidae; Trypsin.