A convenient protein library for spectroscopic calibrations

Comput Struct Biotechnol J. 2020 Jul 10:18:1864-1876. doi: 10.1016/j.csbj.2020.07.001. eCollection 2020.

Abstract

While several Raman, CD or FTIR spectral libraries are available for well-characterized proteins of known structure, proteins themselves are usually very difficult to acquire, preventing a convenient calibration of new instruments and new recording methods. The problem is particularly critical in the field of FTIR spectroscopy where numerous new methods are becoming available on the market. The present papers reports the construction of a protein library (cSP92) including commercially available products, that are well characterized experimentally for their purity and solubility in conditions compatible with the recording of FTIR spectra and whose high-resolution structure is available. Overall, 92 proteins were selected. These proteins cover well the CATH space at the level of classes and architectures. In terms of secondary structure content, an analysis of their high-resolution structure by DSSP shows that the mean content in the different secondary structures present in cSP92 is very similar to the mean content found in the PDB. The 92-protein set is analyzed in details for the distribution of helix length, number of strands in β- sheets, length of β-strands and amino acid content, all features that may be important for the interpretation of FTIR spectra.

Keywords: FTIR spectroscopy; Protein selection; Protein spectroscopy; Secondary structure.