Manhattan Harvester and Cropper: a system for GWAS peak detection

BMC Bioinformatics. 2019 Jan 11;20(1):22. doi: 10.1186/s12859-019-2600-4.

Abstract

Background: Selection of interesting regions from genome wide association studies (GWAS) is typically performed by eyeballing of Manhattan Plots. This is no longer possible with thousands of different phenotypes. There is a need for tools that can automatically detect genomic regions that correspond to what the experienced researcher perceives as peaks worthwhile of further study.

Results: We developed Manhattan Harvester, a tool designed for "peak extraction" from GWAS summary files and computation of parameters characterizing various aspects of individual peaks. We present the algorithms used and a model for creating a general quality score that evaluates peaks similarly to that of a human researcher. Our tool Cropper utilizes a graphical interface for inspecting, cropping and subsetting Manhattan Plot regions. Cropper is used to validate and visualize the regions detected by Manhattan Harvester.

Conclusions: We conclude that our tools fill the current void in automatically screening large number of GWAS output files in batch mode. The interesting regions are detected and quantified by various parameters by Manhattan Harvester. Cropper offers graphical tools for in-depth inspection of the regions. The tools are open source and freely available.

Keywords: GWAS; Manhattan plots; Peak detection; Peak quality score; Software.

MeSH terms

  • Computer Graphics*
  • Data Interpretation, Statistical*
  • Data Mining / methods*
  • Genome-Wide Association Study / statistics & numerical data*
  • Genomics / methods*
  • Humans
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Software*