ProtNAff: protein-bound Nucleic Acid filters and fragment libraries

Bioinformatics. 2022 Aug 10;38(16):3911-3917. doi: 10.1093/bioinformatics/btac430.

Abstract

Motivation: Atomistic models of nucleic acids (NA) fragments can be used to model the 3D structures of specific protein-NA interactions and address the problem of great NA flexibility, especially in their single-stranded regions. One way to obtain relevant NA fragments is to extract them from existing 3D structures corresponding to the targeted context (e.g. specific 2D structures, protein families, sequences) and to learn from them. Several databases exist for specific NA 3D motifs, especially in RNA, but none can handle the variety of possible contexts.

Results: This article presents protNAff (protein-bound Nucleic Acids filters and fragments), a new pipeline for the conception of searchable databases on the 2D and 3D structures of protein-bound NA, the selection of context-specific (regions of) NA structures by combinations of filters, and the creation of context-specific NA fragment libraries. The strength of this pipeline is its modularity, allowing users to adapt it to many specific modeling problems. As examples, the pipeline is applied to the quantitative analysis of (i) the sequence-specificity of trinucleotide conformations, (ii) the conformational diversity of RNA at several levels of resolution, (iii) the effect of protein binding on RNA local conformations and (iv) the protein-binding propensity of RNA hairpin loops of various lengths.

Availability and implementation: The source code is freely available for download at URL https://github.com/isaureCdB/protNAff. The database and the trinucleotide fragment library are downloadable at URL https://zenodo.org/record/6483823#.YmbVhFxByV4.

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Nucleic Acid Conformation
  • Nucleic Acids*
  • Proteins / chemistry
  • RNA
  • Software

Substances

  • Nucleic Acids
  • Proteins
  • RNA