The MAGMA pipeline for comprehensive genomic analyses of clinical Mycobacterium tuberculosis samples

PLoS Comput Biol. 2023 Nov 29;19(11):e1011648. doi: 10.1371/journal.pcbi.1011648. eCollection 2023 Nov.

Abstract

Background: Whole genome sequencing (WGS) holds great potential for the management and control of tuberculosis. Accurate analysis of samples with low mycobacterial burden, which are characterized by low (<20x) coverage and high (>40%) levels of contamination, is challenging. We created the MAGMA (Maximum Accessible Genome for Mtb Analysis) bioinformatics pipeline for analysis of clinical Mtb samples.

Methods and results: High accuracy variant calling is achieved by using a long seedlength during read mapping to filter out contaminants, variant quality score recalibration with machine learning to identify genuine genomic variants, and joint variant calling for low Mtb coverage genomes. MAGMA automatically generates a standardized and comprehensive output of drug resistance information and resistance classification based on the WHO catalogue of Mtb mutations. MAGMA automatically generates phylogenetic trees with drug resistance annotations and trees that visualize the presence of clusters. Drug resistance and phylogeny outputs from sequencing data of 79 primary liquid cultures were compared between the MAGMA and MTBseq pipelines. The MTBseq pipeline reported only a proportion of the variants in candidate drug resistance genes that were reported by MAGMA. Notable differences were in structural variants, variants in highly conserved rrs and rrl genes, and variants in candidate resistance genes for bedaquiline, clofazmine, and delamanid. Phylogeny results were similar between pipelines but only MAGMA visualized clusters.

Conclusion: The MAGMA pipeline could facilitate the integration of WGS into clinical care as it generates clinically relevant data on drug resistance and phylogeny in an automated, standardized, and reproducible manner.

MeSH terms

  • Genome
  • Genomics
  • Humans
  • Mycobacterium tuberculosis* / genetics
  • Phylogeny
  • Tuberculosis* / drug therapy
  • Tuberculosis* / genetics

Grants and funding

This research was funded by the Research Foundation Flanders (FWO) strategic basic research grant 1SB4519N, (PhD funding for L.V.), the FWO Odysseus grant G0F8316N, and the FWO applied biomedical research with a primary social finality T001018N to A.V.R. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.