Hi-C analysis: from data generation to integration

Biophys Rev. 2019 Feb;11(1):67-78. doi: 10.1007/s12551-018-0489-1. Epub 2018 Dec 20.

Abstract

In the epigenetics field, large-scale functional genomics datasets of ever-increasing size and complexity have been produced using experimental techniques based on high-throughput sequencing. In particular, the study of the 3D organization of chromatin has raised increasing interest, thanks to the development of advanced experimental techniques. In this context, Hi-C has been widely adopted as a high-throughput method to measure pairwise contacts between virtually any pair of genomic loci, thus yielding unprecedented challenges for analyzing and handling the resulting complex datasets. In this review, we focus on the increasing complexity of available Hi-C datasets, which parallels the adoption of novel protocol variants. We also review the complexity of the multiple data analysis steps required to preprocess Hi-C sequencing reads and extract biologically meaningful information. Finally, we discuss solutions for handling and visualizing such large genomics datasets.

Keywords: Chromatin 3D architecture; Chromosome conformation capture; Computational biology; Epigenomics; High-throughput sequencing.

Publication types

  • Review