Exploring patterns of epigenetic information with data mining techniques

Curr Pharm Des. 2013;19(4):779-89.

Abstract

Data mining, a part of the Knowledge Discovery in Databases process (KDD), is the process of extracting patterns from large data sets by combining methods from statistics and artificial intelligence with database management. Analyses of epigenetic data have evolved towards genome-wide and high-throughput approaches, thus generating great amounts of data for which data mining is essential. Part of these data may contain patterns of epigenetic information which are mitotically and/or meiotically heritable determining gene expression and cellular differentiation, as well as cellular fate. Epigenetic lesions and genetic mutations are acquired by individuals during their life and accumulate with ageing. Both defects, either together or individually, can result in losing control over cell growth and, thus, causing cancer development. Data mining techniques could be then used to extract the previous patterns. This work reviews some of the most important applications of data mining to epigenetics.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Aging
  • Animals
  • Artificial Intelligence*
  • Computational Biology / methods
  • Data Mining / methods*
  • Databases, Factual
  • Epigenesis, Genetic*
  • Gene Expression
  • Genome-Wide Association Study / methods
  • High-Throughput Screening Assays / methods
  • Humans
  • Mutation