Benefits of applying molecular barcoding systems are not uniform across different genomic applications

J Transl Med. 2023 May 5;21(1):305. doi: 10.1186/s12967-023-04160-0.

Abstract

Background: Despite the wide variety of Next Generation Sequencing (NGS)-based methods, it remains challenging to detect mutations present at very low frequencies. This problem is particularly relevant in oncology, where the limiting amount of input material, and its low quality, often limit the performance of the assays. Unique Molecular Identifiers (UMIs) are a molecular barcoding system often coupled with computational methods of noise suppression to improve the reliability of detection of rare variants. Although widely adopted, UMI inclusion imposes additional technical complexity and sequencing cost. Currently, there are no guidelines on UMI usage nor a comprehensive evaluation of their advantage across different applications.

Methods: We used DNA sequencing data generated by molecular barcoding and hybridization-based enrichment, from various types and quantities of input material (fresh frozen, formaldehyde-treated and cell-free DNA), to evaluate the performance of variant calling in different clinically relevant contexts.

Results: Noise suppression achieved by read grouping based on fragment mapping positions ensures reliable variant calling for many experimental designs even without exogenous UMIs. Exogenous barcodes significantly improve performance only when mapping position collisions occur, which is common in cell-free DNA.

Conclusions: We demonstrate that UMI usage is not universally beneficial across experimental designs and that it is worthwhile to critically consider the comparative advantage of UMI usage for a given NGS application prior to experimental design.

Keywords: Genomics; Molecular barcode; UMI; Unique molecular identifier; Variant calling.

MeSH terms

  • DNA*
  • Genomics* / methods
  • High-Throughput Nucleotide Sequencing / methods
  • Mutation / genetics
  • Reproducibility of Results
  • Sequence Analysis, DNA / methods

Substances

  • DNA