The Bio3D packages for structural bioinformatics

Protein Sci. 2021 Jan;30(1):20-30. doi: 10.1002/pro.3923. Epub 2020 Aug 17.

Abstract

Bio3D is a family of R packages for the analysis of biomolecular sequence, structure, and dynamics. Major functionality includes biomolecular database searching and retrieval, sequence and structure conservation analysis, ensemble normal mode analysis, protein structure and correlation network analysis, principal component, and related multivariate analysis methods. Here, we review recent package developments, including a new underlying segregation into separate packages for distinct analysis, and introduce a new method for structure analysis named ensemble difference distance matrix analysis (eDDM). The eDDM approach calculates and compares atomic distance matrices across large sets of homologous atomic structures to help identify the residue wise determinants underlying specific functional processes. An eDDM workflow is detailed along with an example application to a large protein family. As a new member of the Bio3D family, the Bio3D-eddm package supports both experimental and theoretical simulation-generated structures, is integrated with other methods for dissecting sequence-structure-function relationships, and can be used in a highly automated and reproducible manner. Bio3D is distributed as an integrated set of platform independent open source R packages available from: http://thegrantlab.org/bio3d/.

Keywords: allosteric regulation; distance matrix analysis; functional dynamics; molecular dynamics; normal mode analysis; principal component analysis; protein sequence; protein structure; protein structure network; structural bioinformatics.

MeSH terms

  • Computational Biology*
  • Databases, Protein*
  • Molecular Dynamics Simulation*
  • Protein Conformation
  • Proteins / chemistry*
  • Software*

Substances

  • Proteins