Modular structure, sequence diversification and appropriate nomenclature of seroins produced in the silk glands of Lepidoptera

Sci Rep. 2019 Mar 7;9(1):3797. doi: 10.1038/s41598-019-40401-3.

Abstract

Seroins are small lepidopteran silk proteins known to possess antimicrobial activities. Several seroin paralogs and isoforms were identified in studied lepidopteran species and their classification required detailed phylogenetic analysis based on complete and verified cDNA sequences. We sequenced silk gland-specific cDNA libraries from ten species and identified 52 novel seroin cDNAs. The results of this targeted research, combined with data retrieved from available databases, form a dataset representing the major clades of Lepidoptera. The analysis of deduced seroin proteins distinguished three seroin classes (sn1-sn3), which are composed of modules: A (includes the signal peptide), B (rich in charged amino acids) and C (highly variable linker containing proline). The similarities within and between the classes were 31-50% and 22.5-25%, respectively. All species express one, and in exceptional cases two, genes per class, and alternative splicing further enhances seroin diversity. Seroins occur in long versions with the full set of modules (AB1C1B2C2B3) and/or in short versions that lack parts or the entire B and C modules. The classes and the modular structure of seroins probably evolved prior to the split between Trichoptera and Lepidoptera. The diversity of seroins is reflected in proposed nomenclature.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing
  • Animals
  • Databases, Protein
  • Insect Proteins / genetics
  • Insect Proteins / metabolism*
  • Lepidoptera / genetics
  • Lepidoptera / metabolism*
  • Protein Conformation
  • Silk / metabolism*

Substances

  • Insect Proteins
  • Silk