CSI-LSTM: a web server to predict protein secondary structure using bidirectional long short term memory and NMR chemical shifts

J Biomol NMR. 2021 Dec;75(10-12):393-400. doi: 10.1007/s10858-021-00383-9. Epub 2021 Sep 12.

Abstract

Protein secondary structure provides rich structural information, hence the description and understanding of protein structure relies heavily on it. Identification or prediction of secondary structures therefore plays an important role in protein research. In protein NMR studies, it is more convenient to predict secondary structures from chemical shifts as compared to the traditional determination methods based on inter-nuclear distances provided by NOESY experiment. In recent years, there was a significant improvement observed in deep neural networks, which had been applied in many research fields. Here we proposed a deep neural network based on bidirectional long short term memory (biLSTM) to predict protein 3-state secondary structure using NMR chemical shifts of backbone nuclei. While comparing with the existing methods the proposed method showed better prediction accuracy. Based on the proposed method, a web server has been built to provide protein secondary structure prediction service.

Keywords: Chemical shifts; Deep learning; Long short term memory; Nuclear magnetic resonance; Protein secondary structure prediction.

MeSH terms

  • Memory, Short-Term*
  • Neural Networks, Computer
  • Nuclear Magnetic Resonance, Biomolecular
  • Protein Structure, Secondary
  • Proteins*

Substances

  • Proteins