Metaviz: interactive statistical and visual analysis of metagenomic data

Nucleic Acids Res. 2018 Apr 6;46(6):2777-2787. doi: 10.1093/nar/gky136.

Abstract

Large studies profiling microbial communities and their association with healthy or disease phenotypes are now commonplace. Processed data from many of these studies are publicly available but significant effort is required for users to effectively organize, explore and integrate it, limiting the utility of these rich data resources. Effective integrative and interactive visual and statistical tools to analyze many metagenomic samples can greatly increase the value of these data for researchers. We present Metaviz, a tool for interactive exploratory data analysis of annotated microbiome taxonomic community profiles derived from marker gene or whole metagenome shotgun sequencing. Metaviz is uniquely designed to address the challenge of browsing the hierarchical structure of metagenomic data features while rendering visualizations of data values that are dynamically updated in response to user navigation. We use Metaviz to provide the UMD Metagenome Browser web service, allowing users to browse and explore data for more than 7000 microbiomes from published studies. Users can also deploy Metaviz as a web service, or use it to analyze data through the metavizr package to interoperate with state-of-the-art analysis tools available through Bioconductor. Metaviz is free and open source with the code, documentation and tutorials publicly accessible.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Bacteria / classification
  • Bacteria / genetics
  • Child
  • Computational Biology / methods*
  • Computational Biology / statistics & numerical data
  • Diarrhea / diagnosis
  • Diarrhea / genetics
  • Humans
  • Internet
  • Metagenome / genetics*
  • Metagenomics / methods*
  • Metagenomics / statistics & numerical data
  • Reproducibility of Results
  • Web Browser
  • Whole Genome Sequencing / methods*
  • Whole Genome Sequencing / statistics & numerical data