Gene association analysis: a survey of frequent pattern mining from gene expression data

Brief Bioinform. 2010 Mar;11(2):210-24. doi: 10.1093/bib/bbp042. Epub 2009 Oct 8.

Abstract

Establishing an association between variables is always of interest in genomic studies. Generation of DNA microarray gene expression data introduces a variety of data analysis issues not encountered in traditional molecular biology or medicine. Frequent pattern mining (FPM) has been applied successfully in business and scientific data for discovering interesting association patterns, and is becoming a promising strategy in microarray gene expression analysis. We review the most relevant FPM strategies, as well as surrounding main issues when devising efficient and practical methods for gene association analysis (GAA). We observed that, so far, scalability achieved by efficient methods does not imply biological soundness of the discovered association patterns, and vice versa. Ideally, GAA should employ a balanced mining model taking into account best practices employed by methods reviewed in this survey. Integrative approaches, in which biological knowledge plays an important role within the mining process, are becoming more reliable.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Gene Expression
  • Gene Expression Profiling / methods*
  • Gene Regulatory Networks
  • Oligonucleotide Array Sequence Analysis*
  • Pattern Recognition, Automated / methods*
  • Sequence Analysis, DNA*