P-DOR, an easy-to-use pipeline to reconstruct bacterial outbreaks using genomics

Bioinformatics. 2023 Sep 2;39(9):btad571. doi: 10.1093/bioinformatics/btad571.

Abstract

Summary: Bacterial Healthcare-Associated Infections (HAIs) are a major threat worldwide, which can be counteracted by establishing effective infection control measures, guided by constant surveillance and timely epidemiological investigations. Genomics is crucial in modern epidemiology but lacks standard methods and user-friendly software, accessible to users without a strong bioinformatics proficiency. To overcome these issues we developed P-DOR, a novel tool for rapid bacterial outbreak characterization. P-DOR accepts genome assemblies as input, it automatically selects a background of publicly available genomes using k-mer distances and adds it to the analysis dataset before inferring a Single-Nucleotide Polymorphism (SNP)-based phylogeny. Epidemiological clusters are identified considering the phylogenetic tree topology and SNP distances. By analyzing the SNP-distance distribution, the user can gauge the correct threshold. Patient metadata can be inputted as well, to provide a spatio-temporal representation of the outbreak. The entire pipeline is fast and scalable and can be also run on low-end computers.

Availability and implementation: P-DOR is implemented in Python3 and R and can be installed using conda environments. It is available from GitHub https://github.com/SteMIDIfactory/P-DOR under the GPL-3.0 license.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria
  • Computational Biology* / methods
  • Disease Outbreaks
  • Genome
  • Genome, Bacterial
  • Genomics* / methods
  • Humans
  • Phylogeny
  • Software