Molecular Subtyping and Outlier Detection in Human Disease Using the Paraclique Algorithm

Algorithms. 2021 Feb;14(2):63. doi: 10.3390/a14020063. Epub 2021 Feb 19.

Abstract

Recent discoveries of distinct molecular subtypes have led to remarkable advances in treatment for a variety of diseases. While subtyping via unsupervised clustering has received a great deal of interest, most methods rely on basic statistical or machine learning methods. At the same time, techniques based on graph clustering, particularly clique-based strategies, have been successfully used to identify disease biomarkers and gene networks. A graph theoretical approach based on the paraclique algorithm is described that can easily be employed to identify putative disease subtypes and serve as an aid in outlier detection as well. The feasibility and potential effectiveness of this method is demonstrated on publicly available gene co-expression data derived from patient samples covering twelve different disease families.

Keywords: molecular subtyping; outlier detection; paraclique algorithm; transcriptomic data.