High throughput exome coverage of clinically relevant cardiac genes

BMC Med Genomics. 2014 Dec 11:7:67. doi: 10.1186/s12920-014-0067-8.

Abstract

Background: Given the growing use of whole-exome sequencing (WES) for clinical diagnostics of complex human disorders, we evaluated coverage of clinically relevant cardiac genes on WES and factors influencing uniformity and depth of coverage of exonic regions.

Methods: Two hundred and thirteen human DNA samples were exome sequenced via Illumina HiSeq using different versions of the Agilent SureSelect capture kit. 50 cardiac genes were further analyzed including 31 genes from the American College of Medical Genetics (ACMG) list for reporting of incidental findings and 19 genes associated with congenital heart disease for which clinical testing is available. Gene coordinates were obtained from two databases, CCDS and Known Gene and compared. Read depth for each region was extracted from the exomes and used to assess capture variability between kits for individual genes, and for overall coverage. GC content, gene size, and inter-sample variability were also tested as potential contributors to variability in gene coverage.

Results: All versions of capture kits (designed based on Consensus coding sequence) included only 55% of known genomic regions for the cardiac genes. Although newer versions of each Agilent kit showed improvement in capture of CCDS regions to 99%, only 64% of Known Gene regions were captured even with newer capture kits. There was considerable variability in coverage of the cardiac genes. 10 of the 50 genes including 6 on the ACMG list had less than the optimal coverage of 30X. Within each gene, only 32 of the 50 genes had the majority of their bases covered at an interquartile range ≥30X. Heterogeneity in gene coverage was modestly associated with gene size and significantly associated with GC content.

Conclusions: Despite improvement in overall coverage across the exome with newer capture kit versions and higher sequencing depths, only 50% of known genomic regions of clinical cardiac genes are targeted and individual gene coverage is non-uniform. This may contribute to a bias with greater attribution of disease causation to mutations in well-represented and well-covered genes. Improvements in WES technology are needed before widespread clinical application.

Publication types

  • Comparative Study
  • Multicenter Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Base Composition
  • Biomarkers / metabolism*
  • Case-Control Studies
  • Datasets as Topic
  • Exome / genetics*
  • Female
  • Genome, Human*
  • Heart Defects, Congenital / diagnosis
  • Heart Defects, Congenital / genetics*
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Male
  • Prospective Studies

Substances

  • Biomarkers