Deep generative design of RNA family sequences

Nat Methods. 2024 Mar;21(3):435-443. doi: 10.1038/s41592-023-02148-8. Epub 2024 Jan 18.

Abstract

RNA engineering has immense potential to drive innovation in biotechnology and medicine. Despite its importance, a versatile platform for the automated design of functional RNA is still lacking. Here, we propose RNA family sequence generator (RfamGen), a deep generative model that designs RNA family sequences in a data-efficient manner by explicitly incorporating alignment and consensus secondary structure information. RfamGen can generate novel and functional RNA family sequences by sampling points from a semantically rich and continuous representation. We have experimentally demonstrated the versatility of RfamGen using diverse RNA families. Furthermore, we confirmed the high success rate of RfamGen in designing functional ribozymes through a quantitative massively parallel assay. Notably, RfamGen successfully generates artificial sequences with higher activity than natural sequences. Overall, RfamGen significantly improves our ability to design functional RNA and opens up new potential for generative RNA engineering in synthetic biology.

MeSH terms

  • Biotechnology
  • Humans
  • RNA / genetics
  • RNA, Catalytic* / chemistry
  • RNA, Catalytic* / genetics
  • Synthetic Biology

Substances

  • RNA, Catalytic
  • RNA