Analysis and Prediction of Pathogen Nucleic Acid Specificity for Toll-like Receptors in Vertebrates

J Mol Biol. 2023 Sep 1;435(17):168208. doi: 10.1016/j.jmb.2023.168208. Epub 2023 Jul 20.

Abstract

Identification of key sequence, expression and function related features of nucleic acid-sensing host proteins is of fundamental importance to understand the dynamics of pathogen-specific host responses. To meet this objective, we considered toll-like receptors (TLRs), a representative class of membrane-bound sensor proteins, from 17 vertebrate species covering mammals, birds, reptiles, amphibians, and fishes in this comparative study. We identified the molecular signatures of host TLRs that are responsible for sensing pathogen nucleic acids or other pathogen-associated molecular patterns (PAMPs), and potentially play important roles in host defence mechanism. Interestingly, our findings reveal that such host-specific features are directly related to the strand (single or double) specificity of nucleic acid from pathogens. However, during host-pathogen interactions, such features were unable to explain the pathogenic PAMP (i.e., DNA, RNA or other) selectivity, suggesting a more complex mechanism. Using these features, we developed a number of machine learning models, of which Random Forest achieved a high performance (94.57% accuracy) to predict strand specificity of TLRs from protein-derived features. We applied the trained model to propose strand specificity of some previously uncharacterized distinct fish-specific novel TLRs (TLR18, TLR23, TLR24, TLR25, TLR27).

Keywords: gene age; gene expression; leucine-Rich-Repeats; machine learning approach; toll-like receptors.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Evolution, Molecular
  • Fishes
  • Host-Pathogen Interactions* / immunology
  • Immunity, Innate*
  • Mammals / genetics
  • Nucleic Acids* / chemistry
  • Phylogeny
  • Substrate Specificity
  • Toll-Like Receptors* / chemistry
  • Toll-Like Receptors* / genetics
  • Vertebrates* / genetics
  • Vertebrates* / immunology

Substances

  • Nucleic Acids
  • Toll-Like Receptors