Proteogenomics of the house dust mite, Dermatophagoides farinae: Allergen repertoire, accurate allergen identification, isoforms, and sex-biased proteome differences

J Proteomics. 2020 Jan 6:210:103535. doi: 10.1016/j.jprot.2019.103535. Epub 2019 Oct 17.

Abstract

The allergen repertoire of the house dust mite, Dermatophagoides farinae, is incomplete despite most mite allergens having been described in this species. Using proteogenomics, we aimed to compare proteins and allergens between sexes and provide a foundation for the identification of novel allergens. Overall, 6297 protein hits were identified, and 2899 and 886 were male- and female-specific, respectively. Removal of trace results narrowed the dataset to 3478 hits, including 275 and 157 male- and female-specific hits, respectively. All 34 WHO/IUIS-approved D. farinae allergens (omitting Der f 17) were identified, and we also identified homologs of the yet undescribed Der f 9 and 38. Der f 27/serpin exhibited the largest sex-dependent difference and was dominant in females. Using official protein sequences, Der f 11, 14, 23, 28 and 30 were identified with low success. However, identification success of Der f 11 and 14 was greatly increased by using longer/complete sequences. Der f 30 is characterized by the same tryptic digests as the more abundant Der f 30 (isoform) identified here. Der f 23 appears to be of low abundance in mite bodies. Der f 28.0101 and Der f 28.0201 were detected at low abundance and in trace amounts, respectively. SIGNIFICANCE: In this work, we performed a proteogenomic annotation of the house dust mite, Dermatophagoides farinae, which is the most important source of house dust allergens. The proteogenomic analysis performed here provides a foundation for not only understanding the biology of the mite but also the identification of novel allergens. This study generated a robust proteomic dataset for D. farinae and reviewed existing and candidate allergens in this species. We stress some pitfalls of high-throughput analyses, especially that improper headers of allergen protein records provided in databases can lead to confusion. Using partial sequences in proteomic identification and quantification can lead to low identification success (low signal intensity or MS/MS counts). Thus, we individually curated the protein sequences for proper identification and quantification. The discovered sex differences can be one factor affecting allergen/immunogen variations in mite extracts. Overall, this work provides a benchmark for accurate identification of mite immunogenic proteins using proteomics.

Keywords: Der f 28; Der f 38; Der f 9; Ferritin Der f 30; Label-free proteomics; Serpin Der f 27.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Allergens / genetics
  • Allergens / immunology
  • Allergens / metabolism*
  • Amino Acid Sequence
  • Animals
  • Arthropod Proteins / genetics
  • Arthropod Proteins / immunology
  • Arthropod Proteins / metabolism*
  • Dermatophagoides farinae / genetics
  • Dermatophagoides farinae / immunology
  • Dermatophagoides farinae / metabolism*
  • Female
  • Male
  • Proteogenomics / methods*
  • Proteome / metabolism*
  • Pyroglyphidae / genetics
  • Pyroglyphidae / immunology
  • Pyroglyphidae / metabolism*
  • Sequence Homology
  • Sex Factors

Substances

  • Allergens
  • Arthropod Proteins
  • Proteome