Secondary Structure Libraries for Artificial Evolution Experiments

Molecules. 2021 Mar 17;26(6):1671. doi: 10.3390/molecules26061671.

Abstract

Methods of artificial evolution such as SELEX and in vitro selection have made it possible to isolate RNA and DNA motifs with a wide range of functions from large random sequence libraries. Once the primary sequence of a functional motif is known, the sequence space around it can be comprehensively explored using a combination of random mutagenesis and selection. However, methods to explore the sequence space of a secondary structure are not as well characterized. Here we address this question by describing a method to construct libraries in a single synthesis which are enriched for sequences with the potential to form a specific secondary structure, such as that of an aptamer, ribozyme, or deoxyribozyme. Although interactions such as base pairs cannot be encoded in a library using conventional DNA synthesizers, it is possible to modulate the probability that two positions will have the potential to pair by biasing the nucleotide composition at these positions. Here we show how to maximize this probability for each of the possible ways to encode a pair (in this study defined as A-U or U-A or C-G or G-C or G.U or U.G). We then use these optimized coding schemes to calculate the number of different variants of model stems and secondary structures expected to occur in a library for a series of structures in which the number of pairs and the extent of conservation of unpaired positions is systematically varied. Our calculations reveal a tradeoff between maximizing the probability of forming a pair and maximizing the number of possible variants of a desired secondary structure that can occur in the library. They also indicate that the optimal coding strategy for a library depends on the complexity of the motif being characterized. Because this approach provides a simple way to generate libraries enriched for sequences with the potential to form a specific secondary structure, we anticipate that it should be useful for the optimization and structural characterization of functional nucleic acid motifs.

Keywords: DNA; RNA; SELEX; aptamer; artificial evolution; deoxyribozyme; in vitro selection; nucleic acids; ribozyme; secondary structure; synthetic biology.

MeSH terms

  • Aptamers, Nucleotide / genetics
  • Base Pairing
  • DNA, Catalytic / genetics
  • Directed Molecular Evolution / methods*
  • Gene Library*
  • In Vitro Techniques
  • Inverted Repeat Sequences / genetics
  • Mutagenesis
  • Nucleic Acid Conformation
  • Nucleotide Motifs / genetics*
  • Probability
  • RNA, Catalytic / genetics
  • Synthetic Biology / methods*

Substances

  • Aptamers, Nucleotide
  • DNA, Catalytic
  • RNA, Catalytic