m6ACali: machine learning-powered calibration for accurate m6A detection in MeRIP-Seq

Haokai Ye; Tenglong Li; Daniel J Rigden; Zhen Wei

doi:10.1093/nar/gkae280

m6ACali: machine learning-powered calibration for accurate m6A detection in MeRIP-Seq

Nucleic Acids Res. 2024 Apr 18:gkae280. doi: 10.1093/nar/gkae280. Online ahead of print.

Authors

Haokai Ye^{1

2}, Tenglong Li^{3

4}, Daniel J Rigden², Zhen Wei^{1

5}

Affiliations

¹ Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China.
² Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK.
³ Wisdom Lake Academy of Pharmacy, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China.
⁴ Department of Biostatistics, School of Public Health, Boston University, Boston, MA, USA.
⁵ Institute of Life Course and Medical Sciences, University of Liverpool, L7 8TX Liverpool, UK.

PMID: 38634812
DOI: 10.1093/nar/gkae280

Abstract

We present m6ACali, a novel machine-learning framework aimed at enhancing the accuracy of N6-methyladenosine (m6A) epitranscriptome profiling by reducing the impact of non-specific antibody enrichment in MeRIP-Seq. The calibration model serves as a genomic feature-based classifier that refines the identification of m6A sites, distinguishing those genuinely present from those that can be detected in in-vitro transcribed (IVT) control experiments. We find that m6ACali effectively identifies non-specific binding peaks reported by exomePeak2 and MACS2 in novel MeRIP-Seq datasets without the need for paired IVT controls. The model interpretation revealed that off-target antibody binding sites commonly occur at short exons and short mRNAs, originating from high read coverage regions that share the motif sequence with true m6A sites. We also reveal that the ML strategy can efficiently adjust differentially methylated peaks and other antibody-dependent, base-resolution m6A detection techniques. As a result, m6ACali offers a promising method for the universal enhancement of m6A profiles generated by MeRIP-Seq experiments, elevating the benchmark for omics-level m6A data integration.

Abstract

Grants and funding