Data on the application of the molecular vector machine model: A database of protein pentafragments and computer software for predicting and designing secondary protein structures

Data Brief. 2019 Nov 19:28:104815. doi: 10.1016/j.dib.2019.104815. eCollection 2020 Feb.

Abstract

Based on ideas about the molecular vector machine of proteins [1], a database of protein pentafragments has been created and algorithms have been proposed for predicting the secondary structure of proteins according to their primary structure and for designing the primary protein structure for a given secondary structure that it takes on. A comprehensive software suite (Predicto @ Designer) has been developed using the pentafragments database and the said algorithms. For the proteins used to create the pentafragments database, a high accuracy (close to 100%) in predicting the secondary protein structure as well as good prospects for its use for designing secondary structures of proteins have been demonstrated.

Keywords: Database of protein pentafragments; Molecular vector machine; Software for predicting and design the secondary protein structure.