Gene Teams are on the Field: Evaluation of Variants in Gene-Networks Using High Dimensional Modelling

IEEE/ACM Trans Comput Biol Bioinform. 2023 Sep-Oct;20(5):2959-2969. doi: 10.1109/TCBB.2023.3292245. Epub 2023 Oct 9.

Abstract

In medical genetics, each genetic variant is evaluated as an independent entity regarding its clinical importance. However, in most complex diseases, variant combinations in specific gene networks, rather than the presence of a particular single variant, predominates. In the case of complex diseases, disease status can be evaluated by considering the success level of a team of specific variants. We propose a high dimensional modelling based method to analyse all the variants in a gene network together, which we name "Computational Gene Network Analysis" (CoGNA). To evaluate our method, we selected two gene networks, mTOR and TGF- β. For each pathway, we generated 400 control and 400 patient group samples. mTOR and TGF- β pathways contain 31 and 93 genes of varying sizes, respectively. We produced Chaos Game Representation images for each gene sequence to obtain 2-D binary patterns. These patterns were arranged in succession, and a 3-D tensor structure was achieved for each gene network. Features for each data sample were acquired by exploiting Enhanced Multivariance Products Representation to 3-D data. Features were split as training and testing vectors. Training vectors were employed to train a Support Vector Machines classification model. We achieved more than 96% and 99% classification accuracies for mTOR and TGF- β networks, respectively, using a limited amount of training samples.