Revealing the grammar of small RNA secretion using interpretable machine learning

Cell Genom. 2024 Apr 10;4(4):100522. doi: 10.1016/j.xgen.2024.100522. Epub 2024 Mar 8.

Abstract

Small non-coding RNAs can be secreted through a variety of mechanisms, including exosomal sorting, in small extracellular vesicles, and within lipoprotein complexes. However, the mechanisms that govern their sorting and secretion are not well understood. Here, we present ExoGRU, a machine learning model that predicts small RNA secretion probabilities from primary RNA sequences. We experimentally validated the performance of this model through ExoGRU-guided mutagenesis and synthetic RNA sequence analysis. Additionally, we used ExoGRU to reveal cis and trans factors that underlie small RNA secretion, including known and novel RNA-binding proteins (RBPs), e.g., YBX1, HNRNPA2B1, and RBM24. We also developed a novel technique called exoCLIP, which reveals the RNA interactome of RBPs within the cell-free space. Together, our results demonstrate the power of machine learning in revealing novel biological mechanisms. In addition to providing deeper insight into small RNA secretion, this knowledge can be leveraged in therapeutic and synthetic biology applications.

Keywords: ExoCLIP; ExoGRU; extracellular RNA; machine learning; small RNA; small RNA secretion.

MeSH terms

  • Extracellular Vesicles* / metabolism
  • Machine Learning
  • Mutagenesis
  • RNA* / genetics
  • RNA-Binding Proteins / genetics

Substances

  • RNA
  • RNA-Binding Proteins