HiCBin: binning metagenomic contigs and recovering metagenome-assembled genomes using Hi-C contact maps

Genome Biol. 2022 Feb 28;23(1):63. doi: 10.1186/s13059-022-02626-w.

Abstract

Recovering high-quality metagenome-assembled genomes (MAGs) from complex microbial ecosystems remains challenging. Recently, high-throughput chromosome conformation capture (Hi-C) has been applied to simultaneously study multiple genomes in natural microbial communities. We develop HiCBin, a novel open-source pipeline, to resolve high-quality MAGs utilizing Hi-C contact maps. HiCBin employs the HiCzin normalization method and the Leiden clustering algorithm and includes the spurious contact detection into binning pipelines for the first time. HiCBin is validated on one synthetic and two real metagenomic samples and is shown to outperform the existing Hi-C-based binning methods. HiCBin is available at https://github.com/dyxstat/HiCBin .

Keywords: Contig binning; Leiden clustering; Metagenomic Hi-C; Potts model; Spurious contact detection; Zero-inflated negative binomial normalization.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Metagenome*
  • Metagenomics / methods
  • Microbiota* / genetics