JAWAMix5: an out-of-core HDF5-based java implementation of whole-genome association studies using mixed models

Bioinformatics. 2013 May 1;29(9):1220-2. doi: 10.1093/bioinformatics/btt122. Epub 2013 Mar 11.

Abstract

Summary: We present JAWAMix5, an out-of-core open-source toolkit for association mapping using high-throughput sequence data. Taking advantage of its HDF5-based implementation, JAWAMix5 stores genotype data on disk and accesses them as though stored in main memory. Therefore, it offers a scalable and fast analysis without concerns about memory usage, whatever the size of the dataset. We have implemented eight functions for association studies, including standard methods (linear models, linear mixed models, rare variants test, analysis in nested association mapping design and local variance component analysis), as well as a novel Bayesian local variance component analysis. Application to real data demonstrates that JAWAMix5 is reasonably fast compared with traditional solutions that load the complete dataset into memory, and that the memory usage is efficient regardless of the dataset size.

Availability: The source code, a 'batteries-included' executable and user manual can be freely downloaded from http://code.google.com/p/jawamix5/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Genome-Wide Association Study / methods*
  • Genotype
  • High-Throughput Nucleotide Sequencing
  • Linear Models
  • Software*