MADloy: robust detection of mosaic loss of chromosome Y from genotype-array-intensity data

BMC Bioinformatics. 2020 Nov 23;21(1):533. doi: 10.1186/s12859-020-03768-z.

Abstract

Background: Accurate protocols and methods to robustly detect the mosaic loss of chromosome Y (mLOY) are needed given its reported role in cancer, several age-related disorders and overall male mortality. Intensity SNP-array data have been used to infer mLOY status and to determine its prominent role in male disease. However, discrepancies of reported findings can be due to the uncertainty and variability of the methods used for mLOY detection and to the differences in the tissue-matrix used.

Results: We created a publicly available software tool called MADloy (Mosaic Alteration Detection for LOY) that incorporates existing methods and includes a new robust approach, allowing efficient calling in large studies and comparisons between methods. MADloy optimizes mLOY calling by correctly modeling the underlying reference population with no-mLOY status and incorporating B-deviation information. We observed improvements in the calling accuracy to previous methods, using experimentally validated samples, and an increment in the statistical power to detect associations with disease and mortality, using simulation studies and real dataset analyses. To understand discrepancies in mLOY detection across different tissues, we applied MADloy to detect the increment of mLOY cellularity in blood on 18 individuals after 3 years and to confirm that its detection in saliva was sub-optimal (41%). We additionally applied MADloy to detect the down-regulation genes in the chromosome Y in kidney and bladder tumors with mLOY, and to perform pathway analyses for the detection of mLOY in blood.

Conclusions: MADloy is a new software tool implemented in R for the easy and robust calling of mLOY status across different tissues aimed to facilitate its study in large epidemiological studies.

Keywords: Bioconductor; Loss of chromosome Y; SNP array.

MeSH terms

  • Chromosomes, Human, Y / genetics*
  • Down-Regulation / genetics
  • Genotype
  • Humans
  • Male
  • Mosaicism*
  • Polymorphism, Single Nucleotide / genetics
  • Reproducibility of Results
  • Software*
  • Statistics as Topic
  • Transcriptome / genetics