EthSEQ: ethnicity annotation from whole exome sequencing data

Bioinformatics. 2017 Aug 1;33(15):2402-2404. doi: 10.1093/bioinformatics/btx165.

Abstract

Summary: Whole exome sequencing (WES) is widely utilized both in translational cancer genomics studies and in the setting of precision medicine. Stratification of individual's ethnicity is fundamental for the correct interpretation of personal genomic variation impact. We implemented EthSEQ to provide reliable and rapid ethnicity annotation from whole exome sequencing individual's data, validated it on 1000 Genome Project and TCGA data (2700 samples) demonstrating high precision, and finally assessed computational performances compared to other tools. EthSEQ can be integrated into any WES based processing pipeline and exploits multi-core capabilities.

Availability and implementation: R package available at github.com/aromanel/EthSEQ and CRAN repository.

Contact: alessandro.romanel@unitn.it or f.demichelis@unitn.it.

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Exome Sequencing / methods*
  • Genetics, Population / methods
  • Genomics / methods
  • Humans
  • Molecular Sequence Annotation / methods*
  • Population Groups*
  • Software*