A new digital approach to SNP encoding for DNA identification

Forensic Sci Int. 2020 Dec:317:110520. doi: 10.1016/j.forsciint.2020.110520. Epub 2020 Sep 18.

Abstract

Identification of individuals has become an urgent problem for mankind. In the last three decades, STR-based DNA identification has actively evolved along with traditional biometric methods. Nonetheless, single-nucleotide polymorphisms (SNPs) are now of great interest and a number of relevant SNP panels have been proposed for DNA identification. Here, a simple approach to SNP data digitization that can provide assigning a unique genetic identification number (GIN) to each person is proposed. The key points of this approach are as follows: 1) SNP data are digitized as whole 4-bit boxes in the most convenient binary format, where character "1" (YES) is assigned to revealed nucleotides, and character "0" (NO) to missing nucleotides after SNP-typing; 2) all SNPs should be considered tetra-allelic. Calculations showed that a 72-plex SNP panel is enough to provide the population with unique GINs, which can be represented in digital (binary or hexadecimal) or graphic (linear or two-dimensional) formats. Simple software for SNP data processing and GINs creation in any format was written. It is likely that the national and global GIN databases will facilitate the solution of problems related to identification of individuals or human biological materials. The proposed approach may be extended to other living organisms as well.

Keywords: 72-Plex SNP panel; Binary format; Digitization; Genetic Identification number (GIN); Graphics; Single nucleotide polymorphism (SNP).

MeSH terms

  • Computational Biology
  • DNA Fingerprinting*
  • Forensic Genetics / methods
  • Gene Frequency
  • Humans
  • Polymorphism, Single Nucleotide*
  • Programming Languages*
  • Software*