Population genetic analyses of Eastern Chinese Han nationality using ForenSeq™ DNA Signature Prep Kit

Mol Genet Genomics. 2024 Feb 20;299(1):9. doi: 10.1007/s00438-024-02121-w.

Abstract

Currently, the most commonly used method for human identification and kinship analysis in forensic genetics is the detection of length polymorphism in short tandem repeats (STRs) using polymerase chain reaction (PCR) and capillary electrophoresis (CE). However, numerous studies have shown that considerable sequence variations exist in the repeat and flanking regions of the STR loci, which cannot be identified by CE detection. Comparatively, massively parallel sequencing (MPS) technology can capture these sequence differences, thereby enhancing the identification capability of certain STRs. In this study, we used the ForenSeq™ DNA Signature Prep Kit to sequence 58 STRs and 94 individual identification SNPs (iiSNPs) in a sample of 220 unrelated individuals from the Eastern Chinese Han population. Our aim is to obtain MPS-based STR and SNP data, providing further evidence for the study of population genetics and forensic applications. The results showed that the MPS method, utilizing sequence information, identified a total of 486 alleles on autosomal STRs (A-STRs), 97 alleles on X-chromosome STRs (X-STRs), and 218 alleles on Y-chromosome STRs (Y-STRs). Compared with length polymorphism, we observed an increase of 260 alleles (157, 31, and 72 alleles on A-STRs, X-STRs, and Y-STRs, respectively) across 36 STRs. The most substantial increments were observed in DYF387S1 and DYS389II, with increases of 287.5% and 250%, respectively. The most increment in the number of alleles was found at DYF387S1 and DYS389II (287.5% and 250%, respectively). The length-based (LB) and sequence-based (SB) combined random match probability (RMP) of 27 A-STRs were 6.05E-31 and 1.53E-34, respectively. Furthermore, other forensic parameters such as total discrimination power (TDP), cumulative probability of exclusion of trios (CPEtrio), and duos (CPEduo) were significantly improved when using the SB data, and informative data were obtained for the 94 iiSNPs. Collectively, these findings highlight the advantages of MPS technology in forensic genetics, and the Eastern Chinese Han genetic data generated in this study could be used as a valuable reference for future research in this field.

Keywords: Individual identification SNPs; Massively parallel sequencing; STRs.

MeSH terms

  • China
  • DNA
  • DNA Fingerprinting* / methods
  • Ethnicity* / genetics
  • Genetics, Population
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Microsatellite Repeats / genetics
  • Polymorphism, Single Nucleotide / genetics
  • Sequence Analysis, DNA / methods

Substances

  • DNA