Gene set-based module discovery in the breast cancer transcriptome

BMC Bioinformatics. 2009 Feb 26:10:71. doi: 10.1186/1471-2105-10-71.

Abstract

Background: Although microarray-based studies have revealed global view of gene expression in cancer cells, we still have little knowledge about regulatory mechanisms underlying the transcriptome. Several computational methods applied to yeast data have recently succeeded in identifying expression modules, which is defined as co-expressed gene sets under common regulatory mechanisms. However, such module discovery methods are not applied cancer transcriptome data.

Results: In order to decode oncogenic regulatory programs in cancer cells, we developed a novel module discovery method termed EEM by extending a previously reported module discovery method, and applied it to breast cancer expression data. Starting from seed gene sets prepared based on cis-regulatory elements, ChIP-chip data, and gene locus information, EEM identified 10 principal expression modules in breast cancer based on their expression coherence. Moreover, EEM depicted their activity profiles, which predict regulatory programs in each subtypes of breast tumors. For example, our analysis revealed that the expression module regulated by the Polycomb repressive complex 2 (PRC2) is downregulated in triple negative breast cancers, suggesting similarity of transcriptional programs between stem cells and aggressive breast cancer cells. We also found that the activity of the PRC2 expression module is negatively correlated to the expression of EZH2, a component of PRC2 which belongs to the E2F expression module. E2F-driven EZH2 overexpression may be responsible for the repression of the PRC2 expression modules in triple negative tumors. Furthermore, our network analysis predicts regulatory circuits in breast cancer cells.

Conclusion: These results demonstrate that the gene set-based module discovery approach is a powerful tool to decode regulatory programs in cancer cells.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Animals
  • Breast Neoplasms / genetics*
  • Breast Neoplasms / metabolism
  • Computational Biology / methods*
  • E2F Transcription Factors / genetics
  • E2F Transcription Factors / metabolism
  • Female
  • Gene Expression Profiling*
  • Humans
  • Mice
  • Oligonucleotide Array Sequence Analysis / methods
  • Polycomb-Group Proteins
  • Repressor Proteins / genetics
  • Repressor Proteins / metabolism

Substances

  • E2F Transcription Factors
  • Polycomb-Group Proteins
  • Repressor Proteins