Annotation and Analysis of 3902 Odorant Receptor Protein Sequences from 21 Insect Species Provide Insights into the Evolution of Odorant Receptor Gene Families in Solitary and Social Insects

Genes (Basel). 2022 May 20;13(5):919. doi: 10.3390/genes13050919.

Abstract

The gene family of insect olfactory receptors (ORs) has expanded greatly over the course of evolution. ORs enable insects to detect volatile chemicals and therefore play an important role in social interactions, enemy and prey recognition, and foraging. The sequences of several thousand ORs are known, but their specific function or their ligands have only been identified for very few of them. To advance the functional characterization of ORs, we have assembled, curated, and aligned the sequences of 3902 ORs from 21 insect species, which we provide as an annotated online resource. Using functionally characterized proteins from the fly Drosophila melanogaster, the mosquito Anopheles gambiae and the ant Harpegnathos saltator, we identified amino acid positions that best predict response to ligands. We examined the conservation of these predicted relevant residues in all OR subfamilies; the results showed that the subfamilies that expanded strongly in social insects had a high degree of conservation in their binding sites. This suggests that the ORs of social insect families are typically finely tuned and exhibit sensitivity to very similar odorants. Our novel approach provides a powerful tool to exploit functional information from a limited number of genes to study the functional evolution of large gene families.

Keywords: chemical binding; insects; machine learning; odorant receptor.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Drosophila melanogaster / metabolism
  • Insect Proteins / metabolism
  • Insecta / genetics
  • Insecta / metabolism
  • Ligands
  • Receptors, Odorant* / genetics
  • Receptors, Odorant* / metabolism

Substances

  • Insect Proteins
  • Ligands
  • Receptors, Odorant

Grants and funding

This work was supported with funds from the Johannes Gutenberg University Research Center for Algorithmic Emergent Intelligence (Carl Zeiss Foundation) for S.F. and M.A.A-N.