Implementing the VMC Specification to Reduce Ambiguity in Genomic Variant Representation

AMIA Annu Symp Proc. 2020 Mar 4:2019:1226-1235. eCollection 2019.

Abstract

Current methods used for representing biological sequence variants allow flexibility, which has created redundancy within variant archives and discordance among variant representation tools. While research methodologies have been able to adapt to this ambiguity, strict clinical standards make it difficult to use this data in what would otherwise be useful clinical interventions. We implemented a specification developed by the GA4GH Variant Modeling Collaboration (VMC), which details a new approach to unambiguous representation of variants at the allelic level, as a haplotype, or as a genotype. Our implementation, called the VMC Test Suite (http://vcfclin.org), offers web tools to generate and insert VMC identifiers into a VCF file and to generate a VMC bundle JSON representation of a VCF file or HGVS expression. A command line tool with similar functionality is also introduced. These tools facilitate use of this standard-an important step toward reliable querying of variants and their associated annotations.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Alleles
  • Databases, Genetic
  • Genetic Variation*
  • Genome, Human
  • Humans
  • Internet
  • Models, Genetic*
  • Software
  • Terminology as Topic*