A suite of automated sequence analyses reduces the number of candidate deleterious variants and reveals a difference between probands and unaffected siblings

Genet Med. 2019 Aug;21(8):1772-1780. doi: 10.1038/s41436-019-0434-0. Epub 2019 Jan 31.

Abstract

Purpose: Develop an automated exome analysis workflow that can produce a very small number of candidate variants yet still detect different numbers of deleterious variants between probands and unaffected siblings.

Methods: Ninety-seven outbred nuclear families from the Undiagnosed Diseases Program/Network included single probands and the corresponding unaffected sibling(s). Single-nucleotide polymorphism (SNP) chip and exome analyses were performed on all, with proband and unaffected sibling considered independently as the target. The total burden of candidate genetic variants was summed for probands and siblings over all considered disease models.

Results: Exome analysis workflow include automated programs for ethnicity-matched genotype calling, salvage pathway for Mendelian inconsistency, compound heterozygous recessive detection, BAM file regional curation, population frequency filtering, pedigree-aware BAM file noise evaluation, and exon deletion filtration. This workflow relied heavily on BAM file analysis. A greater average pathogenic variant number was found compared with unaffected siblings. This was significant (p < 0.05) when using published recommended thresholds, and implies that causal variants are retained in many probands' lists.

Conclusion: Using Mendelian and non-Mendelian models, this agnostic exome analysis shows a difference between a small group of probands and their unaffected siblings. This workflow produces candidate lists small enough to pursue with laboratory validation.

Keywords: Undiagnosed Diseases Network; agnostic exome analysis; diagnosis; exome; rare diseases.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, N.I.H., Intramural

MeSH terms

  • DNA Copy Number Variations / genetics*
  • Electronic Data Processing*
  • Exome / genetics
  • Exons / genetics
  • Female
  • Genetic Diseases, Inborn / diagnosis*
  • Genetic Diseases, Inborn / genetics
  • Genotype
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Male
  • Pedigree
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics
  • Sequence Analysis, DNA*
  • Sequence Deletion / genetics
  • Siblings