Integrative approaches based on genomic techniques in the functional studies on enhancers

Brief Bioinform. 2023 Nov 22;25(1):bbad442. doi: 10.1093/bib/bbad442.

Abstract

With the development of sequencing technology and the dramatic drop in sequencing cost, the functions of noncoding genes are being characterized in a wide variety of fields (e.g. biomedicine). Enhancers are noncoding DNA elements with vital transcription regulation functions. Tens of thousands of enhancers have been identified in the human genome; however, the location, function, target genes and regulatory mechanisms of most enhancers have not been elucidated thus far. As high-throughput sequencing techniques have leapt forwards, omics approaches have been extensively employed in enhancer research. Multidimensional genomic data integration enables the full exploration of the data and provides novel perspectives for screening, identification and characterization of the function and regulatory mechanisms of unknown enhancers. However, multidimensional genomic data are still difficult to integrate genome wide due to complex varieties, massive amounts, high rarity, etc. To facilitate the appropriate methods for studying enhancers with high efficacy, we delineate the principles, data processing modes and progress of various omics approaches to study enhancers and summarize the applications of traditional machine learning and deep learning in multi-omics integration in the enhancer field. In addition, the challenges encountered during the integration of multiple omics data are addressed. Overall, this review provides a comprehensive foundation for enhancer analysis.

Keywords: data integration; enhancer; high-throughput data analysis; machine learning; multi-omics.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genome, Human
  • Genomics*
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Machine Learning
  • Regulatory Sequences, Nucleic Acid*