A Vietnamese human genetic variation database

Hum Mutat. 2019 Oct;40(10):1664-1675. doi: 10.1002/humu.23835. Epub 2019 Jul 3.

Abstract

Large scale human genome projects have created tremendous human genome databases for some well-studied populations. Vietnam has about 95 million people (the 14th largest country by population in the world) of which more than 86% are Kinh people. To date, genetic studies for Vietnamese people mostly rely on genetic information from other populations. Building a Vietnamese human genetic variation database is a must for properly interpreting Vietnamese genetic variants. To this end, we sequenced 105 whole genomes and 200 whole exomes of 305 unrelated Kinh Vietnamese (KHV) people. We also included 101 other previously published KHV genomes to build a Vietnamese human genetic variation database of 406 KHV people. The KHV database contains 24.81 million variants (22.47 million single nucleotide polymorphisms (SNPs) and 2.34 million indels) of which 0.71 million variants are novel. It includes more than 99.3% of variants with a frequency of >1% in the KHV population. Noticeably, the KHV database revealed 107 variants reported in the human genome mutation database as pathological mutations with a frequency above 1% in the KHV population. The KHV database (available at https://genomes.vn) would be beneficial for genetic studies and medical applications not only for the Vietnamese population but also for other closely related populations.

Keywords: Asian human genome database; Vietnamese genetic population structure; Vietnamese human genome database; whole genome sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Asian People / genetics*
  • Computational Biology / methods
  • Databases, Genetic*
  • Exome Sequencing
  • Genetic Variation*
  • Genetics, Population
  • Genome, Human*
  • Humans
  • Molecular Sequence Annotation
  • Phylogeny
  • Phylogeography
  • Polymorphism, Single Nucleotide
  • Reproducibility of Results
  • Sequence Analysis, DNA
  • Vietnam
  • Whole Genome Sequencing