Novel bioinformatics quality control metric for next-generation sequencing experiments in the clinical context

Nucleic Acids Res. 2019 Dec 2;47(21):e135. doi: 10.1093/nar/gkz775.

Abstract

As the use of next-generation sequencing (NGS) for the Mendelian diseases diagnosis is expanding, the performance of this method has to be improved in order to achieve higher quality. Typically, performance measures are considered to be designed in the context of each application and, therefore, account for a spectrum of clinically relevant variants. We present EphaGen, a new computational methodology for bioinformatics quality control (QC). Given a single NGS dataset in BAM format and a pre-compiled VCF-file of targeted clinically relevant variants it associates this dataset with a single arbiter parameter. Intrinsically, EphaGen estimates the probability to miss any variant from the defined spectrum within a particular NGS dataset. Such performance measure virtually resembles the diagnostic sensitivity of given NGS dataset. Here we present case studies of the use of EphaGen in context of BRCA1/2 and CFTR sequencing in a series of 14 runs across 43 blood samples and 504 publically available NGS datasets. EphaGen is superior to conventional bioinformatics metrics such as coverage depth and coverage uniformity. We recommend using this software as a QC step in NGS studies in the clinical context. Availability: https://github.com/m4merg/EphaGen or https://hub.docker.com/r/m4merg/ephagen.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • BRCA1 Protein / genetics
  • BRCA2 Protein / genetics
  • Breast Neoplasms / genetics
  • Computational Biology / methods*
  • Cystic Fibrosis Transmembrane Conductance Regulator / genetics
  • Female
  • Genome, Human
  • Genomics / methods
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Mendelian Randomization Analysis / methods
  • Polymorphism, Single Nucleotide / genetics*
  • Quality Control*
  • Software*

Substances

  • BRCA1 Protein
  • BRCA1 protein, human
  • BRCA2 Protein
  • BRCA2 protein, human
  • CFTR protein, human
  • Cystic Fibrosis Transmembrane Conductance Regulator