GLANET: genomic loci annotation and enrichment tool

Bioinformatics. 2017 Sep 15;33(18):2818-2828. doi: 10.1093/bioinformatics/btx326.

Abstract

Motivation: Genomic studies identify genomic loci representing genetic variations, transcription factor (TF) occupancy, or histone modification through next generation sequencing (NGS) technologies. Interpreting these loci requires evaluating them with known genomic and epigenomic annotations.

Results: We present GLANET as a comprehensive annotation and enrichment analysis tool which implements a sampling-based enrichment test that accounts for GC content and/or mappability biases, jointly or separately. GLANET annotates and performs enrichment analysis on these loci with a rich library. We introduce and perform novel data-driven computational experiments for assessing the power and Type-I error of its enrichment procedure which show that GLANET has attained high statistical power and well-controlled Type-I error rate. As a key feature, users can easily extend its library with new gene sets and genomic intervals. Other key features include assessment of impact of single nucleotide variants (SNPs) on TF binding sites and regulation based pathway enrichment analysis.

Availability and implementation: GLANET can be run using its GUI or on command line. GLANET's source code is available at https://github.com/burcakotlu/GLANET . Tutorials are provided at https://glanet.readthedocs.org .

Contact: burcak@ceng.metu.edu.tr or oznur.tastan@cs.bilkent.edu.tr.

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • DNA / metabolism
  • Epigenomics / methods
  • Genetic Loci*
  • Genome, Human
  • Genomics / methods*
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Molecular Sequence Annotation / methods*
  • Polymorphism, Single Nucleotide
  • Protein Binding
  • Sequence Analysis, DNA / methods
  • Software*
  • Transcription Factors / metabolism

Substances

  • Transcription Factors
  • DNA