Genome-wide methylation analysis of retrocopy-associated CpG islands and their genomic environment

Epigenetics. 2016 Mar 3;11(3):216-26. doi: 10.1080/15592294.2016.1145330. Epub 2016 Feb 18.

Abstract

Gene duplication by retrotransposition, i.e., the reverse transcription of an mRNA and integration of the cDNA into the genome, is an important mechanism in evolution. Based on whole-genome bisulfite sequencing of monocyte DNA, we have investigated the methylation state of all CpG islands (CGIs) associated with a retrocopy (n = 1,319), their genomic environment, as well as the CGIs associated with the ancestral genes. Approximately 10% of retrocopies are associated with a CGI. Whereas almost all CGIs of the human genome are unmethylated, 68% of the CGIs associated with a retrocopy are methylated. In retrocopies resulting from multiple retrotranspositions of the same ancestral gene, the methylation state of the CGI often differs. There is a strong positive correlation between the methylation state of the CGI/retrocopy and their genomic environment, suggesting that the methylation state of the integration site determined the methylation state of the CGI/retrocopy, or that methylation of the retrocopy by a host defense mechanism has spread into the adjacent regions. Only a minor fraction of CGI/retrocopies (n = 195) has intermediate methylation levels. Among these, the previously reported CGI/retrocopy in intron 2 of the RB1 gene (PPP1R26P1) as well as the CGI associated with the retrocopy RPS2P32 identified in this study carry a maternal methylation imprint. In conclusion, these findings shed light on the evolutionary dynamics and constraints of DNA methylation.

Keywords: CpG islands; DNA methylation; RPS2P32; imprinting; retrotransposition.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • CpG Islands / genetics
  • DNA Methylation / genetics*
  • Genome, Human*
  • Genomic Imprinting / genetics*
  • Humans
  • Introns / genetics
  • Maternal Inheritance / genetics
  • Monocytes / metabolism
  • Promoter Regions, Genetic
  • Retinoblastoma Binding Proteins / genetics*
  • Retroelements / genetics
  • Sequence Analysis, DNA
  • Ubiquitin-Protein Ligases / genetics*

Substances

  • RB1 protein, human
  • Retinoblastoma Binding Proteins
  • Retroelements
  • Ubiquitin-Protein Ligases