StackSSSPred: A Stacking-Based Prediction of Supersecondary Structure from Sequence

Methods Mol Biol. 2019:1958:101-122. doi: 10.1007/978-1-4939-9161-7_5.

Abstract

Supersecondary structure (SSS) refers to specific geometric arrangements of several secondary structure (SS) elements that are connected by loops. The SSS can provide useful information about the spatial structure and function of a protein. As such, the SSS is a bridge between the secondary structure and tertiary structure. In this chapter, we propose a stacking-based machine learning method for the prediction of two types of SSSs, namely, β-hairpins and β-α-β, from the protein sequence based on comprehensive feature encoding. To encode protein residues, we utilize key features such as solvent accessibility, conservation profile, half surface exposure, torsion angle fluctuation, disorder probabilities, and more. The usefulness of the proposed approach is assessed using a widely used threefold cross-validation technique. The obtained empirical result shows that the proposed approach is useful and prediction can be improved further.

Keywords: Beta-alpha-beta; Beta-hairpins; Machine learning; Sequence-based prediction; Stacking; Supersecondary structure prediction.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Amino Acid Motifs*
  • Amino Acid Sequence / genetics
  • Computational Biology / methods*
  • Databases, Protein
  • Models, Molecular
  • Proteins / chemistry*
  • Proteins / genetics
  • Software

Substances

  • Proteins