Constructing gene similarity networks using co-occurrence probabilities

Golrokh Mirzaei

doi:10.1186/s12864-023-09780-w

Constructing gene similarity networks using co-occurrence probabilities

BMC Genomics. 2023 Nov 21;24(1):697. doi: 10.1186/s12864-023-09780-w.

Author

Golrokh Mirzaei¹

Affiliation

¹ Department of Computer Science and Engineering, The Ohio State University, Marion, USA. mirzaei.4@osu.edu.

Abstract

Gene similarity networks play important role in unraveling the intricate associations within diverse cancer types. Conventionally, gauging the similarity between genes has been approached through experimental methodologies involving chemical and molecular analyses, or through the lens of mathematical techniques. However, in our work, we have pioneered a distinctive mathematical framework, one rooted in the co-occurrence of attribute values and single point mutations, thereby establishing a novel approach for quantifying the dissimilarity or similarity among genes. Central to our approach is the recognition of mutations as key players in the evolutionary trajectory of cancer. Anchored in this understanding, our methodology hinges on the consideration of two categorical attributes: mutation type and nucleotide change. These attributes are pivotal, as they encapsulate the critical variations that can precipitate substantial changes in gene behavior and ultimately influence disease progression. Our study takes on the challenge of formulating similarity measures that are intrinsic to genes' categorical data. Taking into account the co-occurrence probability of attribute values within single point mutations, our innovative mathematical approach surpasses the boundaries of conventional methods. We thereby provide a robust and comprehensive means to assess gene similarity and take a significant step forward in refining the tools available for uncovering the subtle yet impactful associations within the complex realm of gene interactions in cancer.

Keywords: Cancer; Co-ccurrence; Gene network; Probability; Similarity network.

MeSH terms

Algorithms*
Gene Regulatory Networks
Humans
Neoplasms* / genetics
Probability