Sequence conservation, phylogenetic relationships, and expression profiles of nondigestive serine proteases and serine protease homologs in Manduca sexta

Insect Biochem Mol Biol. 2015 Jul:62:51-63. doi: 10.1016/j.ibmb.2014.10.006. Epub 2014 Dec 19.

Abstract

Serine protease (SP) and serine protease homolog (SPH) genes in insects encode a large family of proteins involved in digestion, development, immunity, and other processes. While 68 digestive SPs and their close homologs are reported in a companion paper (Kuwar et al., in preparation), we have identified 125 other SPs/SPHs in Manduca sexta and studied their structure, evolution, and expression. Fifty-two of them contain cystine-stabilized structures for molecular recognition, including clip, LDLa, Sushi, Wonton, TSP, CUB, Frizzle, and SR domains. There are nineteen groups of genes evolved from relatively recent gene duplication and sequence divergence. Thirty-five SPs and seven SPHs contain 1, 2 or 5 clip domains. Multiple sequence alignment and molecular modeling of the 54 clip domains have revealed structural diversity of these regulatory modules. Sequence comparison with their homologs in Drosophila melanogaster, Anopheles gambiae and Tribolium castaneum allows us to classify them into five subfamilies: A are SPHs with 1 or 5 group-3 clip domains, B are SPs with 1 or 2 group-2 clip domains, C, D1 and D2 are SPs with a single clip domain in group-1a, 1b and 1c, respectively. We have classified into six categories the 125 expression profiles of SP-related proteins in fat body, brain, midgut, Malpighian tubule, testis, and ovary at different stages, suggesting that they participate in various physiological processes. Through RNA-Seq-based gene annotation and expression profiling, as well as intragenomic sequence comparisons, we have established a framework of information for future biochemical research of nondigestive SPs and SPHs in this model species.

Keywords: Clip domain; Hemolymph protein; Insect immunity; Phylogenetic analysis; RNA-Seq.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Conserved Sequence
  • Insect Proteins / chemistry*
  • Insect Proteins / genetics
  • Manduca / enzymology*
  • Manduca / genetics
  • Models, Molecular
  • Phylogeny*
  • Protein Conformation
  • RNA, Messenger / genetics
  • Sequence Alignment
  • Sequence Analysis, RNA
  • Serine Proteases / chemistry*
  • Serine Proteases / genetics
  • Species Specificity
  • Transcriptome*

Substances

  • Insect Proteins
  • RNA, Messenger
  • Serine Proteases