Exploring patterns of epigenetic information with data mining techniques

Vanessa Aguiar-Pulido; José A Seoane; Marcos Gestal; Julián Dorado

Exploring patterns of epigenetic information with data mining techniques

Curr Pharm Des. 2013;19(4):779-89.

Authors

Vanessa Aguiar-Pulido¹, José A Seoane, Marcos Gestal, Julián Dorado

Affiliation

¹ Department of Information and Communication Technologies, Computer Science Faculty, University of A Coruña, Campus de Elviña, S/N, 15071 A Coruña, Spain. vaguiar@udc.es

PMID: 23016855

Abstract

Data mining, a part of the Knowledge Discovery in Databases process (KDD), is the process of extracting patterns from large data sets by combining methods from statistics and artificial intelligence with database management. Analyses of epigenetic data have evolved towards genome-wide and high-throughput approaches, thus generating great amounts of data for which data mining is essential. Part of these data may contain patterns of epigenetic information which are mitotically and/or meiotically heritable determining gene expression and cellular differentiation, as well as cellular fate. Epigenetic lesions and genetic mutations are acquired by individuals during their life and accumulate with ageing. Both defects, either together or individually, can result in losing control over cell growth and, thus, causing cancer development. Data mining techniques could be then used to extract the previous patterns. This work reviews some of the most important applications of data mining to epigenetics.

Publication types

Research Support, Non-U.S. Gov't
Review

MeSH terms

Aging
Animals
Artificial Intelligence*
Computational Biology / methods
Data Mining / methods*
Databases, Factual
Epigenesis, Genetic*
Gene Expression
Genome-Wide Association Study / methods
High-Throughput Screening Assays / methods
Humans
Mutation