Covariance thresholding to detect differentially co-expressed genes from microarray gene expression data

J Bioinform Comput Biol. 2020 Feb;18(1):2050002. doi: 10.1142/S021972002050002X.

Abstract

Gene set analysis aims to identify differentially expressed or co-expressed genes within a biological pathway between two experimental conditions, so that it can eventually reveal biological processes and pathways involved in disease development. In the last few decades, various statistical and computational methods have been proposed to improve statistical power of gene set analysis. In recent years, much attention has been paid to differentially co-expressed genes since they can be potentially disease-related genes without significant difference in average expression levels between two conditions. In this paper, we propose a new statistical method to identify differentially co-expressed genes from microarray gene expression data. The proposed method first estimates co-expression levels of paired genes using covariance regularization by thresholding, and then significance of difference in covariance estimation between two conditions is evaluated. We demonstrated that the proposed method is more powerful than the existing main-stream methods to detect co-expressed genes through extensive simulation studies. Also, we applied it to various microarray gene expression datasets related with mutant p53 transcriptional activity, and epithelium and stroma breast cancer.

Keywords: Co-expressed genes; covariance estimation; hard thresholding; microarray gene expression data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Breast Neoplasms / genetics*
  • Breast Neoplasms / pathology
  • Computational Biology / methods*
  • Computer Simulation
  • Female
  • Gene Expression Profiling / methods*
  • Gene Expression Profiling / statistics & numerical data
  • Gene Expression Regulation, Neoplastic
  • Humans
  • Mutation
  • Oligonucleotide Array Sequence Analysis / methods*
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data
  • Tumor Suppressor Protein p53 / genetics

Substances

  • TP53 protein, human
  • Tumor Suppressor Protein p53