Copy-number analysis by base-level normalization: An intuitive visualization tool for evaluating copy number variations

Clin Genet. 2023 Jan;103(1):35-44. doi: 10.1111/cge.14236. Epub 2022 Oct 3.

Abstract

Next-generation sequencing (NGS) facilitates comprehensive molecular analyses that help with diagnosing unsolved disorders. In addition to detecting single-nucleotide variations and small insertions/deletions, bioinformatics tools can identify copy number variations (CNVs) in NGS data, which improves the diagnostic yield. However, due to the possibility of false positives, subsequent confirmation tests are generally performed. Here, we introduce Copy-number Analysis by BAse-level NormAlization (CABANA), a visualization tool that allows users to intuitively identify candidate CNVs using the normalized single-base-level read depth calculated from NGS data. To demonstrate how CABANA works, NGS data were obtained from 474 patients with neuromuscular disorders. CNVs were screened using a conventional bioinformatics tool, ExomeDepth, and then we normalized and visualized those data at the single-base level using CABANA, followed by manual inspection by geneticists to filter out false positives and determine candidate CNVs. In doing so, we identified 31 candidate CNVs (7%) in 474 patients and subsequently confirmed all of them to be true using multiplex ligation-dependent probe amplification. The performance of CABANA was deemed acceptable by comparing its diagnostic yield with previous data about neuromuscular disorders. Despite some limitations, we expect CABANA to help researchers accurately identify CNVs and reduce the need for subsequent confirmation testing.

Keywords: CABANA; CNV visualization; NGS; candidate CNV; confirmatory test; single-base-level normalization.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA Copy Number Variations* / genetics
  • Humans