Snowflake: visualizing microbiome abundance tables as multivariate bipartite graphs

Front Bioinform. 2024 Feb 5:4:1331043. doi: 10.3389/fbinf.2024.1331043. eCollection 2024.

Abstract

Current visualizations in microbiome research rely on aggregations in taxonomic classifications or do not show less abundant taxa. We introduce Snowflake: a new visualization method that creates a clear overview of the microbiome composition in collected samples without losing any information due to classification or neglecting less abundant reads. Snowflake displays every observed OTU/ASV in the microbiome abundance table and provides a solution to include the data's hierarchical structure and additional information obtained from downstream analysis (e.g., alpha- and beta-diversity) and metadata. Based on the value-driven ICE-T evaluation methodology, Snowflake was positively received. Experts in microbiome research found the visualizations to be user-friendly and detailed and liked the possibility of including and relating additional information to the microbiome's composition. Exploring the topological structure of the microbiome abundance table allows them to quickly identify which taxa are unique to specific samples and which are shared among multiple samples (i.e., separating sample-specific taxa from the core microbiome), and see the compositional differences between samples. An R package for constructing and visualizing Snowflake microbiome composition graphs is available at https://gitlab.com/vda-lab/snowflake.

Keywords: metagenomics; microbiome composition; taxonomy; visualization application; visualization method.

Grants and funding

The authors declare financial support was received for the research, authorship, and/or publication of this article. This work is funded through Hasselt University BOF grant ADMIRE (BOF21GP17) and BOF grants (BOF20OWB33 and BOF21DOC19), and by the Flemish Government under the “Onderzoeksprogramma 664 Artificiële Intelligentie (AI) Vlaanderen” programme, R-13509.