Age prediction using DNA methylation of Y-chromosomal CpGs in semen samples

Forensic Sci Int Genet. 2024 Mar:69:103007. doi: 10.1016/j.fsigen.2024.103007. Epub 2024 Jan 6.

Abstract

In cases of sexual assault, the evidence often exists as a mixture of female and male body fluids, and in many cases, contains a higher proportion of female body fluids than males. In these cases, Y-STR, rather than autosomal STRs, can provide useful information. It becomes very difficult to identify the true suspect if there is no match among known suspects or if a match exists for two or more suspects, e.g. two suspects from the same paternal lineage. However, age prediction using the DNA methylation of Y-chromosomal CpGs can help narrow the search for unknown suspects and discriminate between older and younger suspects. Therefore, the DNA methylation profiles of semen samples from 56 healthy Korean males were generated using Illumina's Infinium MethylationEPIC BeadChip Array. Among the ten identified age-associated CpG markers located in the Y-chromosome, nine were used to construct age prediction models. The identified markers were further investigated in the MPS analysis of 147 semen samples, and the multiplex assay was validated with the reliability, reproducibility and sensitivity tests. Several age prediction models were constructed using the MPS data with the multiple linear regression, stepwise linear regression, ridge linear regression, lasso regression, elastic net linear regression and support vector machine analyses, and all showed MAEs of 5 to 7 years in the test set samples. Six single-source female samples were also subjected to MPS analysis but showed very low coverage that could not affect the analysis of the mixed samples. Therefore, the age prediction models of the present study are expected to provide useful investigative leads, especially in mixed male and female samples from sexual assault cases.

Keywords: Age; Massively parallel sequencing; Methylation; Semen; Y-chromosome.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Child
  • Child, Preschool
  • Chromosomes, Human, Y
  • CpG Islands / genetics
  • DNA Methylation*
  • Female
  • Humans
  • Linear Models
  • Male
  • Reproducibility of Results
  • Semen*