Gene Ontology Capsule GAN: an improved architecture for protein function prediction

PeerJ Comput Sci. 2022 Aug 15:8:e1014. doi: 10.7717/peerj-cs.1014. eCollection 2022.

Abstract

Proteins are the core of all functions pertaining to living things. They consist of an extended amino acid chain folding into a three-dimensional shape that dictates their behavior. Currently, convolutional neural networks (CNNs) have been pivotal in predicting protein functions based on protein sequences. While it is a technology crucial to the niche, the computation cost and translational invariance associated with CNN make it impossible to detect spatial hierarchies between complex and simpler objects. Therefore, this research utilizes capsule networks to capture spatial information as opposed to CNNs. Since capsule networks focus on hierarchical links, they have a lot of potential for solving structural biology challenges. In comparison to the standard CNNs, our results exhibit an improvement in accuracy. Gene Ontology Capsule GAN (GOCAPGAN) achieved an F1 score of 82.6%, a precision score of 90.4% and recall score of 76.1%.

Keywords: Capsule networks; Deep learning; Gene ontology; Generative adversarial networks; Protein function prediction; Transfer learning.

Grants and funding

The authors received no funding for this work.