Pgltools: a genomic arithmetic tool suite for manipulation of Hi-C peak and other chromatin interaction data

BMC Bioinformatics. 2017 Apr 7;18(1):207. doi: 10.1186/s12859-017-1621-0.

Abstract

Background: Genomic interaction studies use next-generation sequencing (NGS) to examine the interactions between two loci on the genome, with subsequent bioinformatics analyses typically including annotation, intersection, and merging of data from multiple experiments. While many file types and analysis tools exist for storing and manipulating single locus NGS data, there is currently no file standard or analysis tool suite for manipulating and storing paired-genomic-loci: the data type resulting from "genomic interaction" studies. As genomic interaction sequencing data are becoming prevalent, a standard file format and tools for working with these data conveniently and efficiently are needed.

Results: This article details a file standard and novel software tool suite for working with paired-genomic-loci data. We present the paired-genomic-loci (PGL) file standard for genomic-interactions data, and the accompanying analysis tool suite "pgltools": a cross platform, pypy compatible python package available both as an easy-to-use UNIX package, and as a python module, for integration into pipelines of paired-genomic-loci analyses.

Conclusions: Pgltools is a freely available, open source tool suite for manipulating paired-genomic-loci data. Source code, an in-depth manual, and a tutorial are available publicly at www.github.com/billgreenwald/pgltools , and a python module of the operations can be installed from PyPI via the PyGLtools module.

Keywords: Bedtools; Chromatin conformation capture; Genomic arithmetic; Hi-CChIA-PET; Paired-genomic-loci; Peak; Tool suite.

MeSH terms

  • Chromatin / genetics
  • Chromatin / metabolism*
  • Chromatin Immunoprecipitation
  • Genetic Loci
  • Genomics / methods*
  • High-Throughput Nucleotide Sequencing
  • Software*

Substances

  • Chromatin