Analysis of Protein Intermolecular Interactions with MAFFT-DASH

Methods Mol Biol. 2021:2231:163-177. doi: 10.1007/978-1-0716-1036-7_11.

Abstract

The Database of Aligned Structural Homologs (DASH) is a tool for efficiently navigating the Protein Data Bank (PDB) by means of pre-computed pairwise structural alignments. We recently showed that, by integrating DASH structural alignments with the multiple sequence alignment (MSA) software MAFFT, we were able to significantly improve MSA accuracy without dramatically increasing manual or computational complexity. In the latest DASH update, such queries are not limited to PDB entries but can also be launched from user-provided protein coordinates. Here, we describe a further extension of DASH that retrieves intermolecular interactions of all structurally similar domains in the PDB to a query domain of interest. We illustrate these new features using a model of the NYN domain of the ribonuclease N4BP1 as an example. We show that the protein-nucleotide interactions returned are distributed on the surface of the NYN domain in an asymmetric manner, roughly centered on the known nuclease active site.

Keywords: Binding site prediction; Database query; Protein structural alignment; Protein-nucleotide interaction; Protein-protein interaction; RNA structure; Ribonuclease; Structural domain.

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Computational Biology
  • Databases, Protein
  • Nuclear Proteins / chemistry
  • Protein Binding
  • Protein Domains
  • RNA-Binding Proteins / chemistry*
  • Ribonucleases / chemistry
  • Sequence Alignment / methods*
  • Sequence Analysis, Protein / methods*
  • Software*

Substances

  • N4BP1 protein, human
  • Nuclear Proteins
  • RNA-Binding Proteins
  • Ribonucleases