Protein Repair and Analysis Server: A Web Server to Repair PDB Structures, Add Missing Heavy Atoms and Hydrogen Atoms, and Assign Secondary Structures by Amide Interactions

J Chem Inf Model. 2022 Sep 12;62(17):4232-4246. doi: 10.1021/acs.jcim.2c00571. Epub 2022 Aug 24.

Abstract

The Protein Data Bank (PDB) file format developed at Brookhaven National Laboratory is the most popular file format to store biological data and is widely supported by many software programs for the editing and visualization of macromolecular structures. Unfortunately, many of these structures have variety of issues ranging from missing side chains and/or atoms to alternate locations (rotamers). To fix these flaws, one has to learn a new program, compile modules, and install libraries. To overcome these challenges, we present Protein Repair and Analysis Server (PRAS), an easy-to-use web server to repair protein structures and add missing atoms. The PRAS program flow has two main sections. In the first section, several subroutines are called to deal with sequence microheterogeneity, rotamers, missing side chains, heavy atoms, or hydrogen atoms. A log file is generated detailing irregularities and their fixes as carried out by the program. In the later section, the program uses the error-corrected protein structure to assign secondary structure elements based on amide-amide interactions of the backbone. Plots of four Ramachandran types (general, glycine, proline, and pre-proline) and percentage of secondary structure elements; a log file and the error-corrected PDB file can be downloaded by clicking on the download link generated by the server. The server is freely available and is accessed through its web address at www.protein-science.com. Alternatively, users can download the source code from the server or install the program on their local machines following the instructions at https://pypi.org/project/Pras-Server/ or https://github.com/osita-sunday-nnyigide/Pras_Server. While the present work focuses on the repair of structural data in the PDB format, the server is capable of fixing similar errors in the extensible and newly adopted mmCIF format.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amides*
  • Databases, Protein
  • Hydrogen*
  • Internet
  • Proline
  • Protein Conformation
  • Protein Structure, Secondary
  • Proteins / chemistry
  • Software

Substances

  • Amides
  • Proteins
  • Hydrogen
  • Proline