Multi-Domain Networks Association for Biological Data Using Block Signed Graph Clustering

IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):435-448. doi: 10.1109/TCBB.2018.2848904. Epub 2018 Jun 25.

Abstract

Multi-domain biological network association and clustering have attracted a lot of attention in biological data integration and understanding, which can provide a more global and accurate understanding of biological phenomenon. In many problems, different domains may have different cluster structures. Due to rapid growth of data collection from different sources, some domains may be strongly or weakly associated with the other domains. A key challenge is how to determine the degree of association among different domains, and to achieve accurate clustering results by data integration. In this paper, we propose an unsupervised learning approach for multi-domain network association by using block signed graph clustering. In particular, with consistency weights calculation, the proposed algorithm automatically identify domains relevant to each other strongly (or weakly) by assigning them larger (or smaller) weights. This approach not only significantly improve clustering accuracy but also understand multi-domain networks association. In each iteration of the proposed algorithm, we update consistency weights based on cluster structure of each domain, and then make use of different sets of eigenvectors to obtain different cluster structures in each domain. Experimental results on both synthetic data sets and real data sets (including neuron activity data and gene expression data) empirically demonstrate the effectiveness of the proposed algorithm in clustering performance and in domain association capability.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Animals
  • Caenorhabditis elegans
  • Cluster Analysis*
  • Computational Biology / methods*
  • Databases, Factual
  • Humans
  • Neoplasms / genetics
  • Neoplasms / metabolism
  • Neurophysiology
  • Unsupervised Machine Learning*