PyInteraph2 and PyInKnife2 to Analyze Networks in Protein Structural Ensembles

J Chem Inf Model. 2023 Jul 24;63(14):4237-4245. doi: 10.1021/acs.jcim.3c00574. Epub 2023 Jul 12.

Abstract

Due to the complex nature of noncovalent interactions and their long-range effects, analyzing protein conformations using network theory can be enlightening. Protein Structure Networks (PSNs) provide a convenient formalism to study protein structures in relation to essential properties such as key residues for structural stability, allosteric communication, and the effects of modifications of the protein. PSNs can be defined according to very different principles, and the available tools have limitations in input formats, supported models, and version control. Other outstanding problems are related to the definition of network cutoffs and the assessment of the stability of the network properties. The protein science community could benefit from a common framework to carry out these analyses and make them easier to reproduce, reuse, and evaluate. We here provide two open-source software packages, PyInteraph2 and PyInKnife2, to implement and analyze PSNs in a reproducible and documented manner. PyInteraph2 interfaces with multiple formats for protein ensembles and incorporates different network models with the possibility of integrating them into a macronetwork and performing various downstream analyses, including hubs, connected components, and several other centrality measures, and visualizes the networks or further analyzes them thanks to compatibility with Cytoscape.PyInKnife2 that supports the network models implemented in PyInteraph2. It employs a jackknife resampling approach to estimate the convergence of network properties and streamline the selection of distance cutoffs. We foresee that the modular structure of the code and the supported version control system will promote the transition to a community-driven effort, boost reproducibility, and establish common protocols in the PSN field. As developers, we will guarantee the introduction of new functionalities and maintenance, assistance, and training of new contributors.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Protein Conformation
  • Proteins* / chemistry
  • Reproducibility of Results
  • Software*

Substances

  • Proteins