Comparison of normalization methods for Hi-C data

Hongqiang Lyu; Erhu Liu; Zhifang Wu

doi:10.2144/btn-2019-0105

Comparison of normalization methods for Hi-C data

Biotechniques. 2020 Feb;68(2):56-64. doi: 10.2144/btn-2019-0105. Epub 2019 Oct 7.

Authors

Hongqiang Lyu¹, Erhu Liu¹, Zhifang Wu¹

Affiliation

¹ School of Electronic & Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China.

PMID: 31588782
DOI: 10.2144/btn-2019-0105

Abstract

Hi-C has been predominately used to study the genome-wide interactions of genomes. In Hi-C experiments, it is believed that biases originating from different systematic deviations lead to extraneous variability among raw samples, and affect the reliability of downstream interpretations. As an important pipeline in Hi-C analysis, normalization seeks to remove the unwanted systematic biases; thus, a comparison between Hi-C normalization methods benefits their choice and the downstream analysis. In this article, a comprehensive comparison is proposed to investigate six Hi-C normalization methods in terms of multiple considerations. In light of comparison results, it has been shown that a cross-sample approach significantly outperforms individual sample methods in most considerations. The differences between these methods are analyzed, some practical recommendations are given, and the results are summarized in a table to facilitate the choice of the six normalization methods. The source code for the implementation of these methods is available at https://github.com/lhqxinghun/bioinformatics/tree/master/Hi-C/NormCompare.

Keywords: Hi-C data; comprehensive comparison; normalization methods.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't

MeSH terms

Animals
Chromatin*
Computational Biology / methods*
Genome*
Humans

Substances

Chromatin